sampleOnnxMNIST

“Hello World” For TensorRT From ONNX

Table Of Contents

Description
How does this sample work?
Running the sample
- Sample --help options
Additional resources
License
Changelog
Known issues

Description

This sample, sampleOnnxMNIST, converts a model trained on the MNIST dataset in Open Neural Network Exchange (ONNX) format to a TensorRT network and runs inference on the network.

ONNX is a standard for representing deep learning models that enables models to be transferred between frameworks.

How does this sample work?

This sample creates and runs the TensorRT engine from an ONNX model of the MNIST network. It demonstrates how TensorRT can consume an ONNX model as input to create a network.

Specifically, this sample:

Converts the ONNX model to a TensorRT network
Builds an engine
Runs inference using the generated TensorRT network

Converting the ONNX model to a TensorRT network

The model file can be converted to a TensorRT network using the ONNX parser. The parser can be initialized with the network definition that the parser will write to and the logger object.

auto parser = nvonnxparser::createParser(*network, sample::gLogger.getTRTLogger());

The ONNX model file is then passed onto the parser along with the logging level

if (!parser->parseFromFile(model_file, static_cast<int>(sample::gLogger.getReportableSeverity())))
{
	  string msg("failed to parse onnx file");
	  sample::gLogger->log(nvinfer1::ILogger::Severity::kERROR, msg.c_str());
	  exit(EXIT_FAILURE);
}

After the TensorRT network is constructed by parsing the model, the TensorRT engine can be built to run inference.

Building the engine

To build the engine, create the builder and pass a logger created for TensorRT which is used for reporting errors, warnings and informational messages in the network: IBuilder* builder = createInferBuilder(sample::gLogger);

To build the engine from the generated TensorRT network, issue the following call: std::unique_ptr<nvinfer1::IHostMemory> plan{builder->buildSerializedNetwork(*network, *config)};

After you build the engine, verify that the engine is running properly by confirming the output is what you expected. The output format of this sample should be the same as the output of sampleMNIST.

Running inference

To run inference using the created engine, see Performing Inference In C++.

Note: It’s important to preprocess the data and convert it to the format accepted by the network. In this example, the sample input is in PGM (portable graymap) format. The model expects an input of image 1x28x28 scaled to between [0,1].

TensorRT API layers and ops

In this sample, the following layers are used. For more information about these layers, see the TensorRT Developer Guide: Layers documentation.

Activation layer The Activation layer implements element-wise activation functions. Specifically, this sample uses the Activation layer with the type kRELU.

Convolution layer The Convolution layer computes a 2D (channel, height, and width) convolution, with or without bias.

MatrixMultiplyLayer The MatrixMultiply layer implements a matrix multiplication operation. (The FullyConnected layer is deprecated since 8.4. The bias of a FullyConnected layer can be added with an ElementwiseLayer of SUM operation.)

Pooling layer The Pooling layer implements pooling within a channel. Supported pooling types are maximum, average and maximum-average blend.

Scale layer The Scale layer implements a per-tensor, per-channel, or per-element affine transformation and/or exponentiation by constant values.

Shuffle layer The Shuffle layer implements a reshape and transpose operator for tensors.

Prerequisites

Preparing sample data

See Preparing sample data in the main samples README.

Running the sample

Compile the sample by following build instructions in TensorRT README.

Run the sample to build and run the MNIST engine from the ONNX model.

./sample_onnx_mnist [-h or --help] [-d or --datadir=<path to data directory>] [--useDLACore=<int>]

Verify that the sample ran successfully. If the sample runs successfully you should see output similar to the following:

&&&& RUNNING TensorRT.sample_onnx_mnist # ./sample_onnx_mnist
----------------------------------------------------------------
Input filename: ../../../../../../data/samples/mnist/mnist.onnx
ONNX IR version: 0.0.3
Opset version: 1
Producer name: CNTK
Producer version: 2.4
Domain:
Model version: 1
Doc string:
----------------------------------------------------------------
[I] Input:
@@@@@@@@@@@@@@@@@@@@@@@@@@@@
@@@@@@@@@@@@@@@@@@@@@@@@@@@@
@@@@@@@@@@@@@@@@@@@@@@@@@@@@
@@@@@@@@@@@@@@@@@@@@@@@@@@@@
@@@@@@@@@@@*.  .*@@@@@@@@@@@
@@@@@@@@@@*.     +@@@@@@@@@@
@@@@@@@@@@. :#+   %@@@@@@@@@
@@@@@@@@@@.:@@@+  +@@@@@@@@@
@@@@@@@@@@.:@@@@:  +@@@@@@@@
@@@@@@@@@@=%@@@@:  +@@@@@@@@
@@@@@@@@@@@@@@@@#  +@@@@@@@@
@@@@@@@@@@@@@@@@*  +@@@@@@@@
@@@@@@@@@@@@@@@@:  +@@@@@@@@
@@@@@@@@@@@@@@@@:  +@@@@@@@@
@@@@@@@@@@@@@@@*  .@@@@@@@@@
@@@@@@@@@@%**%@.  *@@@@@@@@@
@@@@@@@@%+.  .:  .@@@@@@@@@@
@@@@@@@@=  ..    :@@@@@@@@@@
@@@@@@@@:  *@@:  :@@@@@@@@@@
@@@@@@@%   %@*    *@@@@@@@@@
@@@@@@@%   ++ ++  .%@@@@@@@@
@@@@@@@@-    +@@-  +@@@@@@@@
@@@@@@@@=  :*@@@#  .%@@@@@@@
@@@@@@@@@+*@@@@@%.   %@@@@@@
@@@@@@@@@@@@@@@@@@@@@@@@@@@@
@@@@@@@@@@@@@@@@@@@@@@@@@@@@
@@@@@@@@@@@@@@@@@@@@@@@@@@@@
@@@@@@@@@@@@@@@@@@@@@@@@@@@@

[I] Output:
Prob 0 0.0000 Class 0:
Prob 1 0.0000 Class 1:
Prob 2 1.0000 Class 2: **********
Prob 3 0.0000 Class 3:
Prob 4 0.0000 Class 4:
Prob 5 0.0000 Class 5:
Prob 6 0.0000 Class 6:
Prob 7 0.0000 Class 7:
Prob 8 0.0000 Class 8:
Prob 9 0.0000 Class 9:

&&&& PASSED TensorRT.sample_onnx_mnist # ./sample_onnx_mnist

This output shows that the sample ran successfully; PASSED.

Sample `--help` options

To see the full list of available options and their descriptions, use the -h or --help command line option.

Additional resources

The following resources provide a deeper understanding about the ONNX project and MNIST model:

ONNX

Models

Documentation

License

For terms and conditions for use, reproduction, and distribution, see the TensorRT Software License Agreement documentation.

Changelog

October 2025 Migrate to strongly typed APIs.

March 2019 This README.md file was recreated, updated and reviewed.

Known issues

There are no known issues in this sample.

Name		Name	Last commit message	Last commit date
parent directory ..
CMakeLists.txt		CMakeLists.txt
README.md		README.md
sampleOnnxMNIST.cpp		sampleOnnxMNIST.cpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

“Hello World” For TensorRT From ONNX

Description

How does this sample work?

Converting the ONNX model to a TensorRT network

Building the engine

Running inference

TensorRT API layers and ops

Prerequisites

Running the sample

Sample `--help` options

Additional resources

License

Changelog

Known issues

FilesExpand file tree

sampleOnnxMNIST

Directory actions

More options

Directory actions

More options

Latest commit

History

sampleOnnxMNIST

Folders and files

parent directory

README.md

“Hello World” For TensorRT From ONNX

Description

How does this sample work?

Converting the ONNX model to a TensorRT network

Building the engine

Running inference

TensorRT API layers and ops

Prerequisites

Running the sample

Sample --help options

Additional resources

License

Changelog

Known issues

Sample `--help` options