
For specific TensorRT product documentation, see TensorRT documentation. TensorRT also includes high speed mixed precision and Tensor Core routinesįor information about the optimizations and changes that have been made to TensorRT, see the TensorRT Release Notes. Use to execute this network on all of NVIDIA’s GPU’s from the Kepler generation onwards. TensorRT also supplies a runtime that you can While also finding the fastest implementation of that model leveraging a diverse collection TensorRT applies graph optimizations, layer fusion, among other optimizations, That allows TensorRT to optimize and run them on an NVIDIA GPU. Learning models via the Network Definition API or load a pre-defined model via the parsers TensorRT provides API's via C++ and Python that help to express deep Parameters, and produces a highly optimized runtime engine which performs inference for that Takes a trained network, which consists of a network definition and a set of trained Performance inference on NVIDIA graphics processing units (GPUs). The core of NVIDIA TensorRT is a C++ library that facilitates high For example, you can view the training histories as well as what the model looksįor information about the optimizations and changes that have been made to TensorFlow, see the TensorFlow Deep Learning Frameworks Documentation. Is general enough to be applicable in a wide variety of other domains, as well.įor visualizing TensorFlow results, the Docker ® image alsoĬontains TensorBoard. Purposes of conducting machine learning and deep neural networks (DNNs) research. Google Brain team within Google's Machine Intelligence research organization for the TensorFlow was originally developed by researchers and engineers working on the

This flexibleĪrchitecture lets you deploy computation to one or more CPUs or GPUs in a desktop, server, Represent the multidimensional data arrays (tensors) that flow between them. Nodes in the graph represent mathematical operations, while the graph edges TensorFlow is an open-source software library for numerical computation using data flow
