User Guide

Jetson AGX™ Orin Developer Kit Reviewer's Guide

NVIDIA TAO (Train-Adapt-Optimize) is a framework that lets developers create custom, production-

ready models, in hours rather than months, without AI expertise or large training datasets. The NVIDIA

TAO Toolkit abstracts away the AI/deep learning framework complexity, letting you fine-tune on high-

quality NVIDIA pre-trained AI models with only a fraction of the data compared to training from scratch.

Customers can use the TAO Toolkit to fine-tune and optimize on a wide variety of use cases from

computer vision and automatic speech recognition to speech synthesis and natural language

understanding.

Data collection and annotation is an expensive and laborious process. Simulation can help bridge the

need for data. NVIDIA Omniverse Replicator uses simulation to generate synthetic data that is an

order of magnitude faster and cheaper to create than real data. With Omniverse Replicator you can

quickly create diverse, massive and accurate datasets for training AI models.

We have provided you with a demo that will enable you to use TAO to train a model in the cloud and

deploy the trained model on Jetson using DeepStream. Please refer to NVIDIA TAO section in

Appendix for instructions on running the demo.

NVIDIA Isaac ROS GEMs are hardware-accelerated packages that make it easier for ROS developers

to build high-performance solutions on NVIDIA hardware. NVIDIA Isaac Sim, powered by Omniverse,

is a scalable robotics simulation application. It includes Replicator - a tool to generate diverse synthetic

datasets for training perception models. Isaac Sim is also a tool that powers photorealistic, physically

accurate virtual environments to develop, test, and manage AI-based robots.

NVIDIA RIVA is an SDK for building GPU-accelerated conversational AI applications. RIVA includes

state of the art pre-trained models for Automatic Speech Recognition (ASR) and Text-To-Speech

(TTS). These pre-trained models are highly accurate and can be easily customized using the TAO

Toolkit to improve accuracy on desired domains, accents, languages and use cases. NVIDIA RIVA

speech models are optimized for TensorRT to deliver high inferencing performance and low

latencies on Jetson AGX Orin.

We have provided you with a RIVA ASR demo which is a dictation application that showcases

the performance of Jetson AGX Orin and the accuracy of the pre-trained speech recognition

neural networks. Please refer to NVIDIA RIVA section in Appendix for more details and

instructions to run these demos.

DeepStream is an SDK for rapidly developing and deploying Vision AI applications and services.

DeepStream offers hardware acceleration beyond inference as it offers hardware accelerated plugins

for end-to-end AI pipeline acceleration. It offers state-of-the-art throughput. Developers can also bring

their own TensorFlow, PyTorch, or ONNX models and deploy them using DeepStream.