Jetson Inference: tensorNet Class Reference - Jetson Inference

| | Jetson Inference

DNN Vision Library |

tensorNet Class Reference DNN Vision Library (jetson-inference) » tensorNet

Abstract class for loading a tensor network with TensorRT. More...

#include <tensorNet.h>

Inheritance diagram for tensorNet:

Classes

Public Member Functions

| | virtual | ~tensorNet () | | | Destory. More...
| | | | bool | LoadNetwork (const char *prototxt, const char *model, const char *mean=NULL, const char *input_blob="data", const char *output_blob="prob", uint32_t maxBatchSize=DEFAULT_MAX_BATCH_SIZE, precisionType precision=TYPE_FASTEST, deviceType device=DEVICE_GPU, bool allowGPUFallback=true, nvinfer1::IInt8Calibrator *calibrator=NULL, cudaStream_t stream=NULL) | | | Load a new network instance. More...
| | | | bool | LoadNetwork (const char *prototxt, const char *model, const char *mean, const char *input_blob, const std::vector< std::string > &output_blobs, uint32_t maxBatchSize=DEFAULT_MAX_BATCH_SIZE, precisionType precision=TYPE_FASTEST, deviceType device=DEVICE_GPU, bool allowGPUFallback=true, nvinfer1::IInt8Calibrator *calibrator=NULL, cudaStream_t stream=NULL) | | | Load a new network instance with multiple output layers. More...
| | | | bool | LoadNetwork (const char *prototxt, const char *model, const char *mean, const std::vector< std::string > &input_blobs, const std::vector< std::string > &output_blobs, uint32_t maxBatchSize=DEFAULT_MAX_BATCH_SIZE, precisionType precision=TYPE_FASTEST, deviceType device=DEVICE_GPU, bool allowGPUFallback=true, nvinfer1::IInt8Calibrator *calibrator=NULL, cudaStream_t stream=NULL) | | | Load a new network instance with multiple input layers. More...
| | | | bool | LoadNetwork (const char *prototxt, const char *model, const char *mean, const char *input_blob, const Dims3 &input_dims, const std::vector< std::string > &output_blobs, uint32_t maxBatchSize=DEFAULT_MAX_BATCH_SIZE, precisionType precision=TYPE_FASTEST, deviceType device=DEVICE_GPU, bool allowGPUFallback=true, nvinfer1::IInt8Calibrator *calibrator=NULL, cudaStream_t stream=NULL) | | | Load a new network instance (this variant is used for UFF models) More...
| | | | bool | LoadNetwork (const char *prototxt, const char *model, const char *mean, const std::vector< std::string > &input_blobs, const std::vector< Dims3 > &input_dims, const std::vector< std::string > &output_blobs, uint32_t maxBatchSize=DEFAULT_MAX_BATCH_SIZE, precisionType precision=TYPE_FASTEST, deviceType device=DEVICE_GPU, bool allowGPUFallback=true, nvinfer1::IInt8Calibrator *calibrator=NULL, cudaStream_t stream=NULL) | | | Load a new network instance with multiple input layers (used for UFF models) More...
| | | | bool | LoadEngine (const char *engine_filename, const std::vector< std::string > &input_blobs, const std::vector< std::string > &output_blobs, nvinfer1::IPluginFactory *pluginFactory=NULL, deviceType device=DEVICE_GPU, cudaStream_t stream=NULL) | | | Load a network instance from a serialized engine plan file. More...
| | | | bool | LoadEngine (char *engine_stream, size_t engine_size, const std::vector< std::string > &input_blobs, const std::vector< std::string > &output_blobs, nvinfer1::IPluginFactory *pluginFactory=NULL, deviceType device=DEVICE_GPU, cudaStream_t stream=NULL) | | | Load a network instance from a serialized engine plan file. More...
| | | | bool | LoadEngine (nvinfer1::ICudaEngine *engine, const std::vector< std::string > &input_blobs, const std::vector< std::string > &output_blobs, deviceType device=DEVICE_GPU, cudaStream_t stream=NULL) | | | Load network resources from an existing TensorRT engine instance. More...
| | | | bool | LoadEngine (const char *filename, char **stream, size_t *size) | | | Load a serialized engine plan file into memory. More...
| | | | void | EnableLayerProfiler () | | | Manually enable layer profiling times. More...
| | | | void | EnableDebug () | | | Manually enable debug messages and synchronization. More...
| | | | bool | AllowGPUFallback () const | | | Return true if GPU fallback is enabled. More...
| | | | deviceType | GetDevice () const | | | Retrieve the device being used for execution. More...
| | | | precisionType | GetPrecision () const | | | Retrieve the type of precision being used. More...
| | | | bool | IsPrecision (precisionType type) const | | | Check if a particular precision is being used. More...
| | | | cudaStream_t | GetStream () const | | | Retrieve the stream that the device is operating on. More...
| | | | cudaStream_t | CreateStream (bool nonBlocking=true) | | | Create and use a new stream for execution. More...
| | | | void | SetStream (cudaStream_t stream) | | | Set the stream that the device is operating on. More...
| | | | const char * | GetPrototxtPath () const | | | Retrieve the path to the network prototxt file. More...
| | | | const char * | GetModelPath () const | | | Retrieve the full path to model file, including the filename. More...
| | | | const char * | GetModelFilename () const | | | Retrieve the filename of the file, excluding the directory. More...
| | | | modelType | GetModelType () const | | | Retrieve the format of the network model. More...
| | | | bool | IsModelType (modelType type) const | | | Return true if the model is of the specified format. More...
| | | | uint32_t | GetInputLayers () const | | | Retrieve the number of input layers to the network. More...
| | | | uint32_t | GetOutputLayers () const | | | Retrieve the number of output layers to the network. More...
| | | | Dims3 | GetInputDims (uint32_t layer=0) const | | | Retrieve the dimensions of network input layer. More...
| | | | uint32_t | GetInputWidth (uint32_t layer=0) const | | | Retrieve the width of network input layer. More...
| | | | uint32_t | GetInputHeight (uint32_t layer=0) const | | | Retrieve the height of network input layer. More...
| | | | uint32_t | GetInputSize (uint32_t layer=0) const | | | Retrieve the size (in bytes) of network input layer. More...
| | | | float * | GetInputPtr (uint32_t layer=0) const | | | Get the CUDA pointer to the input layer's memory. More...
| | | | Dims3 | GetOutputDims (uint32_t layer=0) const | | | Retrieve the dimensions of network output layer. More...
| | | | uint32_t | GetOutputWidth (uint32_t layer=0) const | | | Retrieve the width of network output layer. More...
| | | | uint32_t | GetOutputHeight (uint32_t layer=0) const | | | Retrieve the height of network output layer. More...
| | | | uint32_t | GetOutputSize (uint32_t layer=0) const | | | Retrieve the size (in bytes) of network output layer. More...
| | | | float * | GetOutputPtr (uint32_t layer=0) const | | | Get the CUDA pointer to the output memory. More...
| | | | float | GetNetworkFPS () | | | Retrieve the network frames per second (FPS). More...
| | | | float | GetNetworkTime () | | | Retrieve the network runtime (in milliseconds). More...
| | | | const char * | GetNetworkName () const | | | Retrieve the network name (it's filename). More...
| | | | float2 | GetProfilerTime (profilerQuery query) | | | Retrieve the profiler runtime (in milliseconds). More...
| | | | float | GetProfilerTime (profilerQuery query, profilerDevice device) | | | Retrieve the profiler runtime (in milliseconds). More...
| | | | void | PrintProfilerTimes () | | | Print the profiler times (in millseconds). More...
| | |

Static Public Member Functions

| | static bool | LoadClassLabels (const char *filename, std::vector< std::string > &descriptions, int expectedClasses=-1) | | | Load class descriptions from a label file. More...
| | | | static bool | LoadClassLabels (const char *filename, std::vector< std::string > &descriptions, std::vector< std::string > &synsets, int expectedClasses=-1) | | | Load class descriptions and synset strings from a label file. More...
| | | | static bool | LoadClassColors (const char *filename, float4 *colors, int expectedClasses, float defaultAlpha=255.0f) | | | Load class colors from a text file. More...
| | | | static bool | LoadClassColors (const char *filename, float4 **colors, int expectedClasses, float defaultAlpha=255.0f) | | | Load class colors from a text file. More...
| | | | static float4 | GenerateColor (uint32_t classID, float alpha=255.0f) | | | Procedurally generate a color for a given class index with the specified alpha value. More...
| | | | static precisionType | SelectPrecision (precisionType precision, deviceType device=DEVICE_GPU, bool allowInt8=true) | | | Resolve a desired precision to a specific one that's available. More...
| | | | static precisionType | FindFastestPrecision (deviceType device=DEVICE_GPU, bool allowInt8=true) | | | Determine the fastest native precision on a device. More...
| | | | static std::vector< precisionType > | DetectNativePrecisions (deviceType device=DEVICE_GPU) | | | Detect the precisions supported natively on a device. More...
| | | | static bool | DetectNativePrecision (const std::vector< precisionType > &nativeTypes, precisionType type) | | | Detect if a particular precision is supported natively. More...
| | | | static bool | DetectNativePrecision (precisionType precision, deviceType device=DEVICE_GPU) | | | Detect if a particular precision is supported natively. More...
| | |

Protected Member Functions

| | | tensorNet () | | | Constructor. More...
| | | | bool | ProcessNetwork (bool sync=true) | | | Execute processing of the network. More...
| | | | bool | ProfileModel (const std::string &deployFile, const std::string &modelFile, const std::vector< std::string > &inputs, const std::vector< Dims3 > &inputDims, const std::vector< std::string > &outputs, uint32_t maxBatchSize, precisionType precision, deviceType device, bool allowGPUFallback, nvinfer1::IInt8Calibrator *calibrator, char **engineStream, size_t *engineSize) | | | Create and output an optimized network model. More...
| | | | bool | ConfigureBuilder (nvinfer1::IBuilder *builder, uint32_t maxBatchSize, uint32_t workspaceSize, precisionType precision, deviceType device, bool allowGPUFallback, nvinfer1::IInt8Calibrator *calibrator) | | | Configure builder options. More...
| | | | bool | ValidateEngine (const char *model_path, const char *cache_path, const char *checksum_path) | | | Validate that the model already has a built TensorRT engine that exists and doesn't need updating. More...
| | | | void | PROFILER_BEGIN (profilerQuery query) | | | Begin a profiling query, before network is run. More...
| | | | void | PROFILER_END (profilerQuery query) | | | End a profiling query, after the network is run. More...
| | | | bool | PROFILER_QUERY (profilerQuery query) | | | Query the CUDA part of a profiler query. More...
| | |

Protected Attributes

Detailed Description

Abstract class for loading a tensor network with TensorRT.

For example implementations,

Constructor & Destructor Documentation

◆~tensorNet()

| virtual tensorNet::~tensorNet | ( | | ) | |

| virtual |

Destory.

◆tensorNet()

| tensorNet::tensorNet | ( | | ) | |

| protected |

Constructor.

Member Function Documentation

◆AllowGPUFallback()

| bool tensorNet::AllowGPUFallback | ( | | ) | const |

| inline |

Return true if GPU fallback is enabled.

◆ConfigureBuilder()

| protected |

Configure builder options.

◆CreateStream()

Create and use a new stream for execution.

◆DetectNativePrecision() [1/2]

| static |

Detect if a particular precision is supported natively.

◆DetectNativePrecision() [2/2]

| static |

Detect if a particular precision is supported natively.

◆DetectNativePrecisions()

| static |

Detect the precisions supported natively on a device.

◆EnableDebug()

| void tensorNet::EnableDebug | ( | | ) | |

Manually enable debug messages and synchronization.

◆EnableLayerProfiler()

| void tensorNet::EnableLayerProfiler | ( | | ) | |

Manually enable layer profiling times.

◆FindFastestPrecision()

| static |

Determine the fastest native precision on a device.

◆GenerateColor()

| static |

Procedurally generate a color for a given class index with the specified alpha value.

This function can be used to generate a range of colors when a colors.txt file isn't available.

◆GetDevice()

| deviceType tensorNet::GetDevice | ( | | ) | const |

| inline |

Retrieve the device being used for execution.

◆GetInputDims()

| inline |

Retrieve the dimensions of network input layer.

◆GetInputHeight()

| inline |

Retrieve the height of network input layer.

◆GetInputLayers()

| uint32_t tensorNet::GetInputLayers | ( | | ) | const |

| inline |

Retrieve the number of input layers to the network.

◆GetInputPtr()

| inline |

Get the CUDA pointer to the input layer's memory.

◆GetInputSize()

| inline |

Retrieve the size (in bytes) of network input layer.

◆GetInputWidth()

| inline |

Retrieve the width of network input layer.

◆GetModelFilename()

| const char* tensorNet::GetModelFilename | ( | | ) | const |

| inline |

Retrieve the filename of the file, excluding the directory.

◆GetModelPath()

| const char* tensorNet::GetModelPath | ( | | ) | const |

| inline |

Retrieve the full path to model file, including the filename.

◆GetModelType()

| modelType tensorNet::GetModelType | ( | | ) | const |

| inline |

Retrieve the format of the network model.

◆GetNetworkFPS()

| float tensorNet::GetNetworkFPS | ( | | ) | |

| inline |

Retrieve the network frames per second (FPS).

◆GetNetworkName()

| const char* tensorNet::GetNetworkName | ( | | ) | const |

| inline |

Retrieve the network name (it's filename).

◆GetNetworkTime()

| float tensorNet::GetNetworkTime | ( | | ) | |

| inline |

Retrieve the network runtime (in milliseconds).

◆GetOutputDims()

| inline |

Retrieve the dimensions of network output layer.

◆GetOutputHeight()

| inline |

Retrieve the height of network output layer.

◆GetOutputLayers()

| uint32_t tensorNet::GetOutputLayers | ( | | ) | const |

| inline |

Retrieve the number of output layers to the network.

◆GetOutputPtr()

| inline |

Get the CUDA pointer to the output memory.

◆GetOutputSize()

| inline |

Retrieve the size (in bytes) of network output layer.

◆GetOutputWidth()

| inline |

Retrieve the width of network output layer.

◆GetPrecision()

| precisionType tensorNet::GetPrecision | ( | | ) | const |

| inline |

Retrieve the type of precision being used.

◆GetProfilerTime() [1/2]

| inline |

Retrieve the profiler runtime (in milliseconds).

◆GetProfilerTime() [2/2]

| inline |

Retrieve the profiler runtime (in milliseconds).

◆GetPrototxtPath()

| const char* tensorNet::GetPrototxtPath | ( | | ) | const |

| inline |

Retrieve the path to the network prototxt file.

◆GetStream()

| cudaStream_t tensorNet::GetStream | ( | | ) | const |

| inline |

Retrieve the stream that the device is operating on.

◆IsModelType()

| inline |

Return true if the model is of the specified format.

◆IsPrecision()

| inline |

Check if a particular precision is being used.

◆LoadClassColors() [1/2]

| static |

Load class colors from a text file.

If the number of expected colors aren't parsed, they will be generated. The float4 color array will automatically be allocated in shared CPU/GPU memory by cudaAllocMapped(). If a line in the text file only has RGB, then the defaultAlpha value will be used for the alpha channel.

◆LoadClassColors() [2/2]

| static |

Load class colors from a text file.

If the number of expected colors aren't parsed, they will be generated. The float4 color array should be expectedClasses long, and would typically be in shared CPU/GPU memory. If a line in the text file only has RGB, then the defaultAlpha value will be used for the alpha channel.

◆LoadClassLabels() [1/2]

| static |

Load class descriptions from a label file.

Each line of the text file should include one class label (and optionally a synset). If the number of expected labels aren't parsed, they will be automatically generated.

◆LoadClassLabels() [2/2]

| static |

Load class descriptions and synset strings from a label file.

Each line of the text file should include one class label (and optionally a synset). If the number of expected labels aren't parsed, they will be automatically generated.

◆LoadEngine() [1/4]

Load a network instance from a serialized engine plan file.

Parameters

◆LoadEngine() [2/4]

Load a network instance from a serialized engine plan file.

Parameters

◆LoadEngine() [3/4]

Load a serialized engine plan file into memory.

◆LoadEngine() [4/4]

Load network resources from an existing TensorRT engine instance.

Parameters

◆LoadNetwork() [1/5]

Load a new network instance (this variant is used for UFF models)

Parameters

◆LoadNetwork() [2/5]

Load a new network instance with multiple output layers.

Parameters

◆LoadNetwork() [3/5]

Load a new network instance with multiple input layers (used for UFF models)

Parameters

◆LoadNetwork() [4/5]

Load a new network instance with multiple input layers.

Parameters

◆LoadNetwork() [5/5]

Load a new network instance.

Parameters

◆PrintProfilerTimes()

| void tensorNet::PrintProfilerTimes | ( | | ) | |

| inline |

Print the profiler times (in millseconds).

◆ProcessNetwork()

| protected |

Execute processing of the network.

Parameters

| sync | if true (default), the device will be synchronized after processing and the thread/function will block until processing is complete. if false, the function will return immediately after the processing has been enqueued to the CUDA stream indicated by GetStream(). |

◆ProfileModel()

| protected |

Create and output an optimized network model.

Notethis function is automatically used by LoadNetwork, but also can be used individually to perform the network operations offline. Parameters

◆PROFILER_BEGIN()

| inlineprotected |

Begin a profiling query, before network is run.

◆PROFILER_END()

| inlineprotected |

End a profiling query, after the network is run.

◆PROFILER_QUERY()

| inlineprotected |

Query the CUDA part of a profiler query.

◆SelectPrecision()

| static |

Resolve a desired precision to a specific one that's available.

◆SetStream()

Set the stream that the device is operating on.

◆ValidateEngine()

| protected |

Validate that the model already has a built TensorRT engine that exists and doesn't need updating.

Member Data Documentation

◆gLogger

| tensorNet::Logger tensorNet::gLogger |

| protected |

◆gProfiler

| tensorNet::Profiler tensorNet::gProfiler |

| protected |

◆mAllowGPUFallback

| bool tensorNet::mAllowGPUFallback |

| protected |

◆mBindings

| void** tensorNet::mBindings |

| protected |

◆mCacheCalibrationPath

| std::string tensorNet::mCacheCalibrationPath |

| protected |

◆mCacheEnginePath

| std::string tensorNet::mCacheEnginePath |

| protected |

◆mChecksumPath

| std::string tensorNet::mChecksumPath |

| protected |

◆mContext

| nvinfer1::IExecutionContext* tensorNet::mContext |

| protected |

◆mDevice

| deviceType tensorNet::mDevice |

| protected |

◆mEnableDebug

| bool tensorNet::mEnableDebug |

| protected |

◆mEnableProfiler

| bool tensorNet::mEnableProfiler |

| protected |

◆mEngine

| nvinfer1::ICudaEngine* tensorNet::mEngine |

| protected |

◆mEventsCPU

| timespec tensorNet::mEventsCPU[PROFILER_TOTAL *2] |

| protected |

◆mEventsGPU

| cudaEvent_t tensorNet::mEventsGPU[PROFILER_TOTAL *2] |

| protected |

◆mInfer

| nvinfer1::IRuntime* tensorNet::mInfer |

| protected |

◆mInputs

| std::vector<layerInfo> tensorNet::mInputs |

| protected |

◆mMaxBatchSize

| uint32_t tensorNet::mMaxBatchSize |

| protected |

◆mMeanPath

| std::string tensorNet::mMeanPath |

| protected |

◆mModelFile

| std::string tensorNet::mModelFile |

| protected |

◆mModelPath

| std::string tensorNet::mModelPath |

| protected |

◆mModelType

| modelType tensorNet::mModelType |

| protected |

◆mOutputs

| std::vector<layerInfo> tensorNet::mOutputs |

| protected |

◆mPrecision

| precisionType tensorNet::mPrecision |

| protected |

◆mProfilerQueriesDone

| uint32_t tensorNet::mProfilerQueriesDone |

| protected |

◆mProfilerQueriesUsed

| uint32_t tensorNet::mProfilerQueriesUsed |

| protected |

◆mProfilerTimes

| float2 tensorNet::mProfilerTimes[PROFILER_TOTAL+1] |

| protected |

◆mPrototxtPath

| std::string tensorNet::mPrototxtPath |

| protected |

◆mStream

| cudaStream_t tensorNet::mStream |

| protected |

◆mWorkspaceSize

| uint32_t tensorNet::mWorkspaceSize |

| protected |

The documentation for this class was generated from the following file:

jetson-inference/tensorNet.h
tensorNet
Generated on Fri Mar 17 2023 14:29:30 for Jetson Inference by 1.8.17