Jetson Inference: tensorNet - Jetson Inference

| | Jetson Inference

DNN Vision Library |

Classes | Macros | Enumerations | Functions

tensorNet DNN Vision Library (jetson-inference)

DNN abstract base class that provides TensorRT functionality underneath. These functions aren't typically accessed by end users unless they are implementing their own DNN class like imageNet or detectNet. More...

Classes

Macros

| | #define | TENSORRT_VERSION_CHECK(major, minor, patch) (NV_TENSORRT_MAJOR > major || (NV_TENSORRT_MAJOR == major && NV_TENSORRT_MINOR > minor) || (NV_TENSORRT_MAJOR == major && NV_TENSORRT_MINOR == minor && NV_TENSORRT_PATCH >= patch)) | | | Macro for checking the minimum version of TensorRT that is installed. More...
| | | | #define | DEFAULT_MAX_BATCH_SIZE 1 | | | Default maximum batch size. More...
| | | | #define | LOG_TRT "[TRT] " | | | Prefix used for tagging printed log output from TensorRT. More...
| | |

Enumerations

| | enum | precisionType {
TYPE_DISABLED = 0, TYPE_FASTEST, TYPE_FP32, TYPE_FP16,
TYPE_INT8, NUM_PRECISIONS
} | | | Enumeration for indicating the desired precision that the network should run in, if available in hardware. More...
| | | | enum | deviceType {
DEVICE_GPU = 0, DEVICE_DLA, DEVICE_DLA_0 = DEVICE_DLA, DEVICE_DLA_1,
NUM_DEVICES
} | | | Enumeration for indicating the desired device that the network should run on, if available in hardware. More...
| | | | enum | modelType {
MODEL_CUSTOM = 0, MODEL_CAFFE, MODEL_ONNX, MODEL_UFF,
MODEL_ENGINE
} | | | Enumeration indicating the format of the model that's imported in TensorRT (either caffe, ONNX, or UFF). More...
| | | | enum | profilerQuery {
PROFILER_PREPROCESS = 0, PROFILER_NETWORK, PROFILER_POSTPROCESS, PROFILER_VISUALIZE,
PROFILER_TOTAL
} | | | Profiling queries. More...
| | | | enum | profilerDevice { PROFILER_CPU = 0, PROFILER_CUDA } | | | Profiler device. More...
| | |

Functions

Detailed Description

Class Documentation

◆tensorNet

| class tensorNet |

Abstract class for loading a tensor network with TensorRT.

For example implementations,

Public Member Functions

| | virtual | ~tensorNet () | | | Destory. More...
| | | | bool | LoadNetwork (const char *prototxt, const char *model, const char *mean=NULL, const char *input_blob="data", const char *output_blob="prob", uint32_t maxBatchSize=DEFAULT_MAX_BATCH_SIZE, precisionType precision=TYPE_FASTEST, deviceType device=DEVICE_GPU, bool allowGPUFallback=true, nvinfer1::IInt8Calibrator *calibrator=NULL, cudaStream_t stream=NULL) | | | Load a new network instance. More...
| | | | bool | LoadNetwork (const char *prototxt, const char *model, const char *mean, const char *input_blob, const std::vector< std::string > &output_blobs, uint32_t maxBatchSize=DEFAULT_MAX_BATCH_SIZE, precisionType precision=TYPE_FASTEST, deviceType device=DEVICE_GPU, bool allowGPUFallback=true, nvinfer1::IInt8Calibrator *calibrator=NULL, cudaStream_t stream=NULL) | | | Load a new network instance with multiple output layers. More...
| | | | bool | LoadNetwork (const char *prototxt, const char *model, const char *mean, const std::vector< std::string > &input_blobs, const std::vector< std::string > &output_blobs, uint32_t maxBatchSize=DEFAULT_MAX_BATCH_SIZE, precisionType precision=TYPE_FASTEST, deviceType device=DEVICE_GPU, bool allowGPUFallback=true, nvinfer1::IInt8Calibrator *calibrator=NULL, cudaStream_t stream=NULL) | | | Load a new network instance with multiple input layers. More...
| | | | bool | LoadNetwork (const char *prototxt, const char *model, const char *mean, const char *input_blob, const Dims3 &input_dims, const std::vector< std::string > &output_blobs, uint32_t maxBatchSize=DEFAULT_MAX_BATCH_SIZE, precisionType precision=TYPE_FASTEST, deviceType device=DEVICE_GPU, bool allowGPUFallback=true, nvinfer1::IInt8Calibrator *calibrator=NULL, cudaStream_t stream=NULL) | | | Load a new network instance (this variant is used for UFF models) More...
| | | | bool | LoadNetwork (const char *prototxt, const char *model, const char *mean, const std::vector< std::string > &input_blobs, const std::vector< Dims3 > &input_dims, const std::vector< std::string > &output_blobs, uint32_t maxBatchSize=DEFAULT_MAX_BATCH_SIZE, precisionType precision=TYPE_FASTEST, deviceType device=DEVICE_GPU, bool allowGPUFallback=true, nvinfer1::IInt8Calibrator *calibrator=NULL, cudaStream_t stream=NULL) | | | Load a new network instance with multiple input layers (used for UFF models) More...
| | | | bool | LoadEngine (const char *engine_filename, const std::vector< std::string > &input_blobs, const std::vector< std::string > &output_blobs, nvinfer1::IPluginFactory *pluginFactory=NULL, deviceType device=DEVICE_GPU, cudaStream_t stream=NULL) | | | Load a network instance from a serialized engine plan file. More...
| | | | bool | LoadEngine (char *engine_stream, size_t engine_size, const std::vector< std::string > &input_blobs, const std::vector< std::string > &output_blobs, nvinfer1::IPluginFactory *pluginFactory=NULL, deviceType device=DEVICE_GPU, cudaStream_t stream=NULL) | | | Load a network instance from a serialized engine plan file. More...
| | | | bool | LoadEngine (nvinfer1::ICudaEngine *engine, const std::vector< std::string > &input_blobs, const std::vector< std::string > &output_blobs, deviceType device=DEVICE_GPU, cudaStream_t stream=NULL) | | | Load network resources from an existing TensorRT engine instance. More...
| | | | bool | LoadEngine (const char *filename, char **stream, size_t *size) | | | Load a serialized engine plan file into memory. More...
| | | | void | EnableLayerProfiler () | | | Manually enable layer profiling times. More...
| | | | void | EnableDebug () | | | Manually enable debug messages and synchronization. More...
| | | | bool | AllowGPUFallback () const | | | Return true if GPU fallback is enabled. More...
| | | | deviceType | GetDevice () const | | | Retrieve the device being used for execution. More...
| | | | precisionType | GetPrecision () const | | | Retrieve the type of precision being used. More...
| | | | bool | IsPrecision (precisionType type) const | | | Check if a particular precision is being used. More...
| | | | cudaStream_t | GetStream () const | | | Retrieve the stream that the device is operating on. More...
| | | | cudaStream_t | CreateStream (bool nonBlocking=true) | | | Create and use a new stream for execution. More...
| | | | void | SetStream (cudaStream_t stream) | | | Set the stream that the device is operating on. More...
| | | | const char * | GetPrototxtPath () const | | | Retrieve the path to the network prototxt file. More...
| | | | const char * | GetModelPath () const | | | Retrieve the full path to model file, including the filename. More...
| | | | const char * | GetModelFilename () const | | | Retrieve the filename of the file, excluding the directory. More...
| | | | modelType | GetModelType () const | | | Retrieve the format of the network model. More...
| | | | bool | IsModelType (modelType type) const | | | Return true if the model is of the specified format. More...
| | | | uint32_t | GetInputLayers () const | | | Retrieve the number of input layers to the network. More...
| | | | uint32_t | GetOutputLayers () const | | | Retrieve the number of output layers to the network. More...
| | | | Dims3 | GetInputDims (uint32_t layer=0) const | | | Retrieve the dimensions of network input layer. More...
| | | | uint32_t | GetInputWidth (uint32_t layer=0) const | | | Retrieve the width of network input layer. More...
| | | | uint32_t | GetInputHeight (uint32_t layer=0) const | | | Retrieve the height of network input layer. More...
| | | | uint32_t | GetInputSize (uint32_t layer=0) const | | | Retrieve the size (in bytes) of network input layer. More...
| | | | float * | GetInputPtr (uint32_t layer=0) const | | | Get the CUDA pointer to the input layer's memory. More...
| | | | Dims3 | GetOutputDims (uint32_t layer=0) const | | | Retrieve the dimensions of network output layer. More...
| | | | uint32_t | GetOutputWidth (uint32_t layer=0) const | | | Retrieve the width of network output layer. More...
| | | | uint32_t | GetOutputHeight (uint32_t layer=0) const | | | Retrieve the height of network output layer. More...
| | | | uint32_t | GetOutputSize (uint32_t layer=0) const | | | Retrieve the size (in bytes) of network output layer. More...
| | | | float * | GetOutputPtr (uint32_t layer=0) const | | | Get the CUDA pointer to the output memory. More...
| | | | float | GetNetworkFPS () | | | Retrieve the network frames per second (FPS). More...
| | | | float | GetNetworkTime () | | | Retrieve the network runtime (in milliseconds). More...
| | | | const char * | GetNetworkName () const | | | Retrieve the network name (it's filename). More...
| | | | float2 | GetProfilerTime (profilerQuery query) | | | Retrieve the profiler runtime (in milliseconds). More...
| | | | float | GetProfilerTime (profilerQuery query, profilerDevice device) | | | Retrieve the profiler runtime (in milliseconds). More...
| | | | void | PrintProfilerTimes () | | | Print the profiler times (in millseconds). More...
| | | |

Static Public Member Functions

| | static bool | LoadClassLabels (const char *filename, std::vector< std::string > &descriptions, int expectedClasses=-1) | | | Load class descriptions from a label file. More...
| | | | static bool | LoadClassLabels (const char *filename, std::vector< std::string > &descriptions, std::vector< std::string > &synsets, int expectedClasses=-1) | | | Load class descriptions and synset strings from a label file. More...
| | | | static bool | LoadClassColors (const char *filename, float4 *colors, int expectedClasses, float defaultAlpha=255.0f) | | | Load class colors from a text file. More...
| | | | static bool | LoadClassColors (const char *filename, float4 **colors, int expectedClasses, float defaultAlpha=255.0f) | | | Load class colors from a text file. More...
| | | | static float4 | GenerateColor (uint32_t classID, float alpha=255.0f) | | | Procedurally generate a color for a given class index with the specified alpha value. More...
| | | | static precisionType | SelectPrecision (precisionType precision, deviceType device=DEVICE_GPU, bool allowInt8=true) | | | Resolve a desired precision to a specific one that's available. More...
| | | | static precisionType | FindFastestPrecision (deviceType device=DEVICE_GPU, bool allowInt8=true) | | | Determine the fastest native precision on a device. More...
| | | | static std::vector< precisionType > | DetectNativePrecisions (deviceType device=DEVICE_GPU) | | | Detect the precisions supported natively on a device. More...
| | | | static bool | DetectNativePrecision (const std::vector< precisionType > &nativeTypes, precisionType type) | | | Detect if a particular precision is supported natively. More...
| | | | static bool | DetectNativePrecision (precisionType precision, deviceType device=DEVICE_GPU) | | | Detect if a particular precision is supported natively. More...
| | | |

Protected Member Functions

| | | tensorNet () | | | Constructor. More...
| | | | bool | ProcessNetwork (bool sync=true) | | | Execute processing of the network. More...
| | | | bool | ProfileModel (const std::string &deployFile, const std::string &modelFile, const std::vector< std::string > &inputs, const std::vector< Dims3 > &inputDims, const std::vector< std::string > &outputs, uint32_t maxBatchSize, precisionType precision, deviceType device, bool allowGPUFallback, nvinfer1::IInt8Calibrator *calibrator, char **engineStream, size_t *engineSize) | | | Create and output an optimized network model. More...
| | | | bool | ConfigureBuilder (nvinfer1::IBuilder *builder, uint32_t maxBatchSize, uint32_t workspaceSize, precisionType precision, deviceType device, bool allowGPUFallback, nvinfer1::IInt8Calibrator *calibrator) | | | Configure builder options. More...
| | | | bool | ValidateEngine (const char *model_path, const char *cache_path, const char *checksum_path) | | | Validate that the model already has a built TensorRT engine that exists and doesn't need updating. More...
| | | | void | PROFILER_BEGIN (profilerQuery query) | | | Begin a profiling query, before network is run. More...
| | | | void | PROFILER_END (profilerQuery query) | | | End a profiling query, after the network is run. More...
| | | | bool | PROFILER_QUERY (profilerQuery query) | | | Query the CUDA part of a profiler query. More...
| | | |

Protected Attributes

Constructor & Destructor Documentation

◆~tensorNet()

| virtual tensorNet::~tensorNet | ( | | ) | |

| virtual |

Destory.

◆tensorNet()

| tensorNet::tensorNet | ( | | ) | |

| protected |

Constructor.

Member Function Documentation

◆AllowGPUFallback()

| bool tensorNet::AllowGPUFallback | ( | | ) | const |

| inline |

Return true if GPU fallback is enabled.

◆ConfigureBuilder()

| protected |

Configure builder options.

◆CreateStream()

Create and use a new stream for execution.

◆DetectNativePrecision() [1/2]

| static |

Detect if a particular precision is supported natively.

◆DetectNativePrecision() [2/2]

| static |

Detect if a particular precision is supported natively.

◆DetectNativePrecisions()

| static |

Detect the precisions supported natively on a device.

◆EnableDebug()

| void tensorNet::EnableDebug | ( | | ) | |

Manually enable debug messages and synchronization.

◆EnableLayerProfiler()

| void tensorNet::EnableLayerProfiler | ( | | ) | |

Manually enable layer profiling times.

◆FindFastestPrecision()

| static |

Determine the fastest native precision on a device.

◆GenerateColor()

| static |

Procedurally generate a color for a given class index with the specified alpha value.

This function can be used to generate a range of colors when a colors.txt file isn't available.

◆GetDevice()

| deviceType tensorNet::GetDevice | ( | | ) | const |

| inline |

Retrieve the device being used for execution.

◆GetInputDims()

| inline |

Retrieve the dimensions of network input layer.

◆GetInputHeight()

| inline |

Retrieve the height of network input layer.

◆GetInputLayers()

| uint32_t tensorNet::GetInputLayers | ( | | ) | const |

| inline |

Retrieve the number of input layers to the network.

◆GetInputPtr()

| inline |

Get the CUDA pointer to the input layer's memory.

◆GetInputSize()

| inline |

Retrieve the size (in bytes) of network input layer.

◆GetInputWidth()

| inline |

Retrieve the width of network input layer.

◆GetModelFilename()

| const char* tensorNet::GetModelFilename | ( | | ) | const |

| inline |

Retrieve the filename of the file, excluding the directory.

◆GetModelPath()

| const char* tensorNet::GetModelPath | ( | | ) | const |

| inline |

Retrieve the full path to model file, including the filename.

◆GetModelType()

| modelType tensorNet::GetModelType | ( | | ) | const |

| inline |

Retrieve the format of the network model.

◆GetNetworkFPS()

| float tensorNet::GetNetworkFPS | ( | | ) | |

| inline |

Retrieve the network frames per second (FPS).

◆GetNetworkName()

| const char* tensorNet::GetNetworkName | ( | | ) | const |

| inline |

Retrieve the network name (it's filename).

◆GetNetworkTime()

| float tensorNet::GetNetworkTime | ( | | ) | |

| inline |

Retrieve the network runtime (in milliseconds).

◆GetOutputDims()

| inline |

Retrieve the dimensions of network output layer.

◆GetOutputHeight()

| inline |

Retrieve the height of network output layer.

◆GetOutputLayers()

| uint32_t tensorNet::GetOutputLayers | ( | | ) | const |

| inline |

Retrieve the number of output layers to the network.

◆GetOutputPtr()

| inline |

Get the CUDA pointer to the output memory.

◆GetOutputSize()

| inline |

Retrieve the size (in bytes) of network output layer.

◆GetOutputWidth()

| inline |

Retrieve the width of network output layer.

◆GetPrecision()

| precisionType tensorNet::GetPrecision | ( | | ) | const |

| inline |

Retrieve the type of precision being used.

◆GetProfilerTime() [1/2]

| inline |

Retrieve the profiler runtime (in milliseconds).

◆GetProfilerTime() [2/2]

| inline |

Retrieve the profiler runtime (in milliseconds).

◆GetPrototxtPath()

| const char* tensorNet::GetPrototxtPath | ( | | ) | const |

| inline |

Retrieve the path to the network prototxt file.

◆GetStream()

| cudaStream_t tensorNet::GetStream | ( | | ) | const |

| inline |

Retrieve the stream that the device is operating on.

◆IsModelType()

| inline |

Return true if the model is of the specified format.

◆IsPrecision()

| inline |

Check if a particular precision is being used.

◆LoadClassColors() [1/2]

| static |

Load class colors from a text file.

If the number of expected colors aren't parsed, they will be generated. The float4 color array will automatically be allocated in shared CPU/GPU memory by cudaAllocMapped(). If a line in the text file only has RGB, then the defaultAlpha value will be used for the alpha channel.

◆LoadClassColors() [2/2]

| static |

Load class colors from a text file.

If the number of expected colors aren't parsed, they will be generated. The float4 color array should be expectedClasses long, and would typically be in shared CPU/GPU memory. If a line in the text file only has RGB, then the defaultAlpha value will be used for the alpha channel.

◆LoadClassLabels() [1/2]

| static |

Load class descriptions from a label file.

Each line of the text file should include one class label (and optionally a synset). If the number of expected labels aren't parsed, they will be automatically generated.

◆LoadClassLabels() [2/2]

| static |

Load class descriptions and synset strings from a label file.

Each line of the text file should include one class label (and optionally a synset). If the number of expected labels aren't parsed, they will be automatically generated.

◆LoadEngine() [1/4]

Load a network instance from a serialized engine plan file.

Parameters

◆LoadEngine() [2/4]

Load a network instance from a serialized engine plan file.

Parameters

◆LoadEngine() [3/4]

Load a serialized engine plan file into memory.

◆LoadEngine() [4/4]

Load network resources from an existing TensorRT engine instance.

Parameters

◆LoadNetwork() [1/5]

Load a new network instance (this variant is used for UFF models)

Parameters

◆LoadNetwork() [2/5]

Load a new network instance with multiple output layers.

Parameters

◆LoadNetwork() [3/5]

Load a new network instance with multiple input layers (used for UFF models)

Parameters

◆LoadNetwork() [4/5]

Load a new network instance with multiple input layers.

Parameters

◆LoadNetwork() [5/5]

Load a new network instance.

Parameters

◆PrintProfilerTimes()

| void tensorNet::PrintProfilerTimes | ( | | ) | |

| inline |

Print the profiler times (in millseconds).

◆ProcessNetwork()

| protected |

Execute processing of the network.

Parameters

| sync | if true (default), the device will be synchronized after processing and the thread/function will block until processing is complete. if false, the function will return immediately after the processing has been enqueued to the CUDA stream indicated by GetStream(). |

◆ProfileModel()

| protected |

Create and output an optimized network model.

Notethis function is automatically used by LoadNetwork, but also can be used individually to perform the network operations offline. Parameters

◆PROFILER_BEGIN()

| inlineprotected |

Begin a profiling query, before network is run.

◆PROFILER_END()

| inlineprotected |

End a profiling query, after the network is run.

◆PROFILER_QUERY()

| inlineprotected |

Query the CUDA part of a profiler query.

◆SelectPrecision()

| static |

Resolve a desired precision to a specific one that's available.

◆SetStream()

Set the stream that the device is operating on.

◆ValidateEngine()

| protected |

Validate that the model already has a built TensorRT engine that exists and doesn't need updating.

Member Data Documentation

◆gLogger

| tensorNet::Logger tensorNet::gLogger |

| protected |

◆gProfiler

| tensorNet::Profiler tensorNet::gProfiler |

| protected |

◆mAllowGPUFallback

| bool tensorNet::mAllowGPUFallback |

| protected |

◆mBindings

| void** tensorNet::mBindings |

| protected |

◆mCacheCalibrationPath

| std::string tensorNet::mCacheCalibrationPath |

| protected |

◆mCacheEnginePath

| std::string tensorNet::mCacheEnginePath |

| protected |

◆mChecksumPath

| std::string tensorNet::mChecksumPath |

| protected |

◆mContext

| nvinfer1::IExecutionContext* tensorNet::mContext |

| protected |

◆mDevice

| deviceType tensorNet::mDevice |

| protected |

◆mEnableDebug

| bool tensorNet::mEnableDebug |

| protected |

◆mEnableProfiler

| bool tensorNet::mEnableProfiler |

| protected |

◆mEngine

| nvinfer1::ICudaEngine* tensorNet::mEngine |

| protected |

◆mEventsCPU

| timespec tensorNet::mEventsCPU[PROFILER_TOTAL *2] |

| protected |

◆mEventsGPU

| cudaEvent_t tensorNet::mEventsGPU[PROFILER_TOTAL *2] |

| protected |

◆mInfer

| nvinfer1::IRuntime* tensorNet::mInfer |

| protected |

◆mInputs

| std::vector<layerInfo> tensorNet::mInputs |

| protected |

◆mMaxBatchSize

| uint32_t tensorNet::mMaxBatchSize |

| protected |

◆mMeanPath

| std::string tensorNet::mMeanPath |

| protected |

◆mModelFile

| std::string tensorNet::mModelFile |

| protected |

◆mModelPath

| std::string tensorNet::mModelPath |

| protected |

◆mModelType

| modelType tensorNet::mModelType |

| protected |

◆mOutputs

| std::vector<layerInfo> tensorNet::mOutputs |

| protected |

◆mPrecision

| precisionType tensorNet::mPrecision |

| protected |

◆mProfilerQueriesDone

| uint32_t tensorNet::mProfilerQueriesDone |

| protected |

◆mProfilerQueriesUsed

| uint32_t tensorNet::mProfilerQueriesUsed |

| protected |

◆mProfilerTimes

| float2 tensorNet::mProfilerTimes[PROFILER_TOTAL+1] |

| protected |

◆mPrototxtPath

| std::string tensorNet::mPrototxtPath |

| protected |

◆mStream

| cudaStream_t tensorNet::mStream |

| protected |

◆mWorkspaceSize

| uint32_t tensorNet::mWorkspaceSize |

| protected |

Macro Definition Documentation

◆DEFAULT_MAX_BATCH_SIZE

| #define DEFAULT_MAX_BATCH_SIZE 1 |

Default maximum batch size.

◆LOG_TRT

| #define LOG_TRT "[TRT] " |

Prefix used for tagging printed log output from TensorRT.

◆TENSORRT_VERSION_CHECK

| #define TENSORRT_VERSION_CHECK | ( | | major, | | | | | minor, | | | | | patch | | | ) | | (NV_TENSORRT_MAJOR > major || (NV_TENSORRT_MAJOR == major && NV_TENSORRT_MINOR > minor) || (NV_TENSORRT_MAJOR == major && NV_TENSORRT_MINOR == minor && NV_TENSORRT_PATCH >= patch)) |

Macro for checking the minimum version of TensorRT that is installed.

This evaluates to true if TensorRT is newer or equal to the provided version.

Enumeration Type Documentation

◆deviceType

| enum deviceType |

Enumeration for indicating the desired device that the network should run on, if available in hardware.

Enumerator
DEVICE_GPU

GPU (if multiple GPUs are present, a specific GPU can be selected with cudaSetDevice()

| | DEVICE_DLA |

Deep Learning Accelerator (DLA) Core 0 (only on Jetson Xavier)

| | DEVICE_DLA_0 |

Deep Learning Accelerator (DLA) Core 0 (only on Jetson Xavier)

| | DEVICE_DLA_1 |

Deep Learning Accelerator (DLA) Core 1 (only on Jetson Xavier)

| | NUM_DEVICES |

Number of device types defined.

◆modelType

| enum modelType |

Enumeration indicating the format of the model that's imported in TensorRT (either caffe, ONNX, or UFF).

Enumerator
MODEL_CUSTOM

Created directly with TensorRT API.

| | MODEL_CAFFE |

caffemodel

| | MODEL_ONNX |

ONNX.

| | MODEL_UFF |

UFF.

| | MODEL_ENGINE |

TensorRT engine/plan.

◆precisionType

| enum precisionType |

Enumeration for indicating the desired precision that the network should run in, if available in hardware.

Enumerator
TYPE_DISABLED

Unknown, unspecified, or disabled type.

| | TYPE_FASTEST |

The fastest detected precision should be use (i.e.

try INT8, then FP16, then FP32)

| | TYPE_FP32 |

32-bit floating-point precision (FP32)

| | TYPE_FP16 |

16-bit floating-point half precision (FP16)

| | TYPE_INT8 |

8-bit integer precision (INT8)

| | NUM_PRECISIONS |

Number of precision types defined.

◆profilerDevice

| enum profilerDevice |

Profiler device.

Enumerator
PROFILER_CPU

CPU walltime.

| | PROFILER_CUDA |

CUDA kernel time.

◆profilerQuery

| enum profilerQuery |

Profiling queries.

Enumerator
PROFILER_PREPROCESS
PROFILER_NETWORK
PROFILER_POSTPROCESS
PROFILER_VISUALIZE
PROFILER_TOTAL

Function Documentation

◆deviceTypeFromStr()

| deviceType deviceTypeFromStr | ( | const char * | str | ) | |

Parse the device type from a string.

◆deviceTypeToStr()

Stringize function that returns deviceType in text.

◆modelTypeFromPath()

Parse the model format from a file path.

◆modelTypeFromStr()

| modelType modelTypeFromStr | ( | const char * | str | ) | |

Parse the model format from a string.

◆modelTypeToStr()

Stringize function that returns modelType in text.

◆precisionTypeFromStr()

| precisionType precisionTypeFromStr | ( | const char * | str | ) | |

Parse the precision type from a string.

◆precisionTypeToStr()

Stringize function that returns precisionType in text.

◆profilerQueryToStr()

Stringize function that returns profilerQuery in text.

Generated on Tue Mar 28 2023 14:27:58 for Jetson Inference by 1.8.17