AWS – Amazon EC2 Inf1 instances – New features, improved performance and lower prices
Amazon EC2 Inf1 instances and AWS Neuron now support YOLOv5 and ResNext deep learning models as well as the latest open-source Hugging Face Transformers. We have also optimized the Neuron compiler to enhance performance and you can now achieve an out-of-the box 12X higher throughput than comparable GPU-based instances for pre-trained BERT base models. These enhancements enable you to effectively meet your high-performance inference requirements and deploy state of the art deep learning models at low cost.
Read More for the details.