Elastic Inference PNG and SVG Icon
Amazon Elastic Inference allows you to attach low-cost GPU-powered inference acceleration to Amazon EC2 and SageMaker instances.
Last Modified: August 29, 2025
16px
32px
48px
64px
Details
Key Features
- Attach GPU-powered inference acceleration to EC2 and SageMaker.
- Reduces ML inference costs by scaling GPU capacity as needed.
- Supports TensorFlow, Apache MXNet, and PyTorch.
- Pay only for the acceleration hours used.
Common Use Cases
- Reducing inference costs by attaching partial GPUs
- Accelerating deep learning model predictions
- Optimizing real-time AI workloads
Explore More Icons
Simple Storage Service
Amazon Simple Storage Service (S3) is an object storage service that offers industry-leading scalability, data availability, security, and performance.
WorkSpaces Family
Amazon WorkSpaces Family includes fully managed, secure desktop-as-a-service (DaaS) solutions that allow users to access cloud desktops from anywhere.
Thinkbox Deadline
Thinkbox Deadline is a render management tool that provides flexible and scalable management of render farms for creative studios.
Snapshot
Snapshot refers to point-in-time backups of EBS volumes or file systems, allowing easy restore and replication of AWS storage resources.
Service Management Connector
AWS Service Management Connector integrates AWS services like Service Catalog with third-party ITSM tools such as ServiceNow or Jira Service Management.
OpenSearch Service
Amazon OpenSearch Service is a fully managed search and analytics service for log analytics, real-time application monitoring, and website search.
Redshift
Amazon Redshift is a fully managed, petabyte-scale data warehouse service in the cloud that enables fast querying and analytics across vast datasets.
Add on
AWS Add-ons are enhancements or extensions that can be applied to AWS services, often in managed environments like EKS, to provide additional functionality.
Elastic Cache for Memcached
Elastic Cache for Memcached is a managed, in-memory key-value store service by AWS for deploying Memcached clusters in the cloud.
ECS Anywhere
Amazon ECS Anywhere extends Amazon Elastic Container Service (ECS) to manage and run container workloads on customer-managed infrastructure, including on-premises servers.
S3 on Outposts
Amazon S3 on Outposts brings object storage to on-premises environments using AWS Outposts, enabling data residency and low-latency workloads.
CodeCommit
AWS CodeCommit is a secure, scalable, and managed source control service that hosts private Git repositories.
App Mesh
AWS App Mesh is a service mesh that provides application-level networking to make it easy to monitor and control microservices running on AWS.
Service Catalog
AWS Service Catalog allows organizations to create and manage approved catalogs of IT services for AWS use.
Device Farm
AWS Device Farm is an app testing service that lets you test and interact with your Android, iOS, and web apps on real, physical devices hosted by AWS.
GuardDuty
Amazon GuardDuty is a threat detection service that continuously monitors for malicious or unauthorized behavior to help protect your AWS accounts and workloads.
Step Functions
AWS Step Functions is a serverless orchestration service that helps you build and coordinate distributed applications using visual workflows.
Deep Learning Containers
AWS Deep Learning Containers are Docker images pre-installed with deep learning frameworks for training and inference on Amazon ECS and EKS.
WorkDocs
Amazon WorkDocs is a secure enterprise document storage and collaboration service that enables file sharing, feedback, and editing in real-time.
Fraud Detector
Amazon Fraud Detector is a service that uses machine learning to identify potentially fraudulent online activities in real time.
AppFlow
AWS AppFlow is a fully managed integration service that enables you to securely transfer data between SaaS applications and AWS services without writing code.
Site to Site VPN
AWS Site-to-Site VPN connects your on-premises network to AWS over an IPsec VPN tunnel for secure communication.
IoT SiteWise
AWS IoT SiteWise is a managed service to collect, organize, and analyze industrial equipment data at scale.
Q
Amazon Q is a generative AI assistant designed to help employees and developers get answers and complete tasks using internal and external data.