NVIDIA Triton Inference Server Description

NVIDIA Triton™, an inference server, delivers fast and scalable AI production-ready. Open-source inference server software, Triton inference servers streamlines AI inference. It allows teams to deploy trained AI models from any framework (TensorFlow or NVIDIA TensorRT®, PyTorch or ONNX, XGBoost or Python, custom, and more on any GPU or CPU-based infrastructure (cloud or data center, edge, or edge). Triton supports concurrent models on GPUs to maximize throughput. It also supports x86 CPU-based inferencing and ARM CPUs. Triton is a tool that developers can use to deliver high-performance inference. It integrates with Kubernetes to orchestrate and scale, exports Prometheus metrics and supports live model updates. Triton helps standardize model deployment in production.

Pricing

Pricing Starts At:
Free
Free Version:
Yes

Integrations

Reviews

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Company Details

Company:
NVIDIA
Headquarters:
United States
Website:
developer.nvidia.com/nvidia-triton-inference-server

Media

Recommended Products
Extended Threat Intelligence | SOCRadar Icon
Extended Threat Intelligence | SOCRadar

See what hackers already know about your organization – and stop them from getting in.

Enterprises need full-spectrum cyber intelligence—beyond social media and the dark web. SOCRadar monitors cloud buckets, dark web leaks, and external threats in real time. Automate takedowns, detect brand impersonations, and stay ahead of evolving attacks. Strengthen your security with Extended Threat Intelligence.
Free Trial

Product Details

Platforms
Windows
Mac
Linux
Type of Training
Documentation
In Person
Videos
Customer Support
Phone Support
Online

NVIDIA Triton Inference Server Features and Options

Artificial Intelligence Software

Chatbot
For Healthcare
For Sales
For eCommerce
Image Recognition
Machine Learning
Multi-Language
Natural Language Processing
Predictive Analytics
Process/Workflow Automation
Rules-Based Automation
Virtual Personal Assistant (VPA)

Machine Learning Software

Deep Learning
ML Algorithm Library
Model Training
Natural Language Processing (NLP)
Predictive Modeling
Statistical / Mathematical Tools
Templates
Visualization

NVIDIA Triton Inference Server Lists

NVIDIA Triton Inference Server User Reviews

Write a Review
  • Previous
  • Next