Elastic Inference PNG and SVG Icon
Amazon Elastic Inference allows you to attach low-cost GPU-powered inference acceleration to Amazon EC2 and SageMaker instances.
Last Modified: August 10, 2025

16px
32px
48px
64px
Details
Key Features
- Attach GPU-powered inference acceleration to EC2 and SageMaker.
- Reduces ML inference costs by scaling GPU capacity as needed.
- Supports TensorFlow, Apache MXNet, and PyTorch.
- Pay only for the acceleration hours used.
Common Use Cases
- Reducing inference costs by attaching partial GPUs
- Accelerating deep learning model predictions
- Optimizing real-time AI workloads
Explore More Icons
Signer
AWS Signer is a fully managed code-signing service to help ensure the integrity and trustworthiness of your code by digitally signing it before deployment.
PostgresSQL Instance
PostgreSQL Instance in Amazon RDS provides a scalable and managed PostgreSQL database with high availability and performance.
AppFlow
AWS AppFlow is a fully managed integration service that enables you to securely transfer data between SaaS applications and AWS services without writing code.
Interactive Video Service
Amazon Interactive Video Service (IVS) is a managed live streaming solution that makes it easy to build interactive video experiences.
MemoryDB
Amazon MemoryDB for Redis is a Redis-compatible, in-memory database service designed for ultra-fast performance and durability.
Fault Injection Service
AWS Fault Injection Service is a fully managed chaos engineering service that helps you improve application resilience by injecting failures into your AWS workloads.
Deadline Cloud
AWS Deadline Cloud is a fully managed service for visual content rendering in the cloud, providing scalable render farm infrastructure for creative teams.
Comprehend Medical
Amazon Comprehend Medical is a HIPAA-eligible NLP service that extracts medical information from unstructured clinical text.
MariaDB Instance
MariaDB Instance in Amazon RDS allows you to run managed MariaDB databases with automated backups, patching, and scaling.
Deep Learning AMIs
AWS Deep Learning AMIs are pre-configured Amazon Machine Images optimized for ML frameworks such as TensorFlow, PyTorch, and MXNet.
EC2 Auto Scaling
Amazon EC2 Auto Scaling helps you automatically add or remove EC2 instances based on demand to maintain application availability and optimize costs.
EventBridge
Amazon EventBridge is a serverless event bus service that enables you to connect applications using data from your own apps, SaaS apps, and AWS services.
Health Dashboard
AWS Health Dashboard provides alerts and remediation guidance for AWS events that may impact your environment.
Nova
Amazon Nova refers to internal AI infrastructure or services (if announced); details may vary as it's not yet publicly defined.
Device Farm
AWS Device Farm is an app testing service that lets you test and interact with your Android, iOS, and web apps on real, physical devices hosted by AWS.
Well Architected Tool
The AWS Well-Architected Tool helps you review and improve your cloud workloads using AWS's architectural best practices.
Fraud Detector
Amazon Fraud Detector is a service that uses machine learning to identify potentially fraudulent online activities in real time.
Lookout for Equipment
Amazon Lookout for Equipment uses machine learning to detect abnormal equipment behavior and prevent potential failures.
SageMaker Ground Truth
Amazon SageMaker Ground Truth is a data labeling service that uses human and machine learning techniques to generate high-quality labeled datasets.
CloudHSM
AWS CloudHSM provides hardware security modules in the AWS Cloud to generate and use your own encryption keys with FIPS 140-2 Level 3 compliance.
Mainframe Modernization
AWS Mainframe Modernization provides tools and infrastructure to migrate, run, and modernize mainframe applications on AWS.
Cost and Usage Report
AWS Cost and Usage Report (CUR) provides the most detailed information available about your AWS costs and usage, exported to Amazon S3 for advanced analysis.
Elemental Appliances Software
AWS Elemental Appliances and Software offer on-premises solutions for live and on-demand video workflows with seamless AWS integration.
FSx for NetApp ONTAP
Amazon FSx for NetApp ONTAP offers fully managed NetApp file systems on AWS with familiar features like snapshots, clones, and data tiering.