Amazon EC2 Inf1 Instances Description
Amazon EC2 Inf1 instances were designed to deliver high-performance, cost-effective machine-learning inference. Amazon EC2 Inf1 instances offer up to 2.3x higher throughput, and up to 70% less cost per inference compared with other Amazon EC2 instance. Inf1 instances are powered by up to 16 AWS inference accelerators, designed by AWS. They also feature Intel Xeon Scalable 2nd generation processors, and up to 100 Gbps of networking bandwidth, to support large-scale ML apps. These instances are perfect for deploying applications like search engines, recommendation system, computer vision and speech recognition, natural-language processing, personalization and fraud detection. Developers can deploy ML models to Inf1 instances by using the AWS Neuron SDK. This SDK integrates with popular ML Frameworks such as TensorFlow PyTorch and Apache MXNet.
Pricing
Integrations
Company Details
Product Details
Amazon EC2 Inf1 Instances Features and Options
Amazon EC2 Inf1 Instances Lists
Amazon EC2 Inf1 Instances User Reviews
Write a Review- Previous
- Next