Elastic Inference PNG and SVG Icon
Amazon Elastic Inference allows you to attach low-cost GPU-powered inference acceleration to Amazon EC2 and SageMaker instances.
Last Modified: August 29, 2025
16px
32px
48px
64px
Details
Key Features
- Attach GPU-powered inference acceleration to EC2 and SageMaker.
- Reduces ML inference costs by scaling GPU capacity as needed.
- Supports TensorFlow, Apache MXNet, and PyTorch.
- Pay only for the acceleration hours used.
Common Use Cases
- Reducing inference costs by attaching partial GPUs
- Accelerating deep learning model predictions
- Optimizing real-time AI workloads
Explore More Icons
Bottlerocket
Bottlerocket is a Linux-based open-source operating system purpose-built by AWS for running containers securely and efficiently.
Interactive Video Service
Amazon Interactive Video Service (IVS) is a managed live streaming solution that makes it easy to build interactive video experiences.
GuardDuty
Amazon GuardDuty is a threat detection service that continuously monitors for malicious or unauthorized behavior to help protect your AWS accounts and workloads.
Health Dashboard
AWS Health Dashboard provides alerts and remediation guidance for AWS events that may impact your environment.
AWS STS
AWS Security Token Service (STS) enables you to request temporary, limited-privilege credentials for AWS Identity and Access Management (IAM) users or federated users.
License Manager
AWS License Manager helps you manage software licenses from vendors like Microsoft, SAP, and Oracle on AWS and on-premises.
EKS Anywhere
Amazon EKS Anywhere lets you create and operate Kubernetes clusters on your own infrastructure using the same tooling as Amazon EKS in the cloud.
Key Management Service
AWS Key Management Service (KMS) enables you to create and control cryptographic keys used to encrypt your data with integrated AWS service support.
Translate
Amazon Translate is a neural machine translation service that delivers fast, high-quality, and customizable language translation.
Chime SDK
Amazon Chime SDK is a set of real-time communications components that developers can use to build audio, video, and screen sharing capabilities into their applications.
Data Exchange
AWS Data Exchange is a service that makes it easy to find, subscribe to, and use third-party data in the cloud securely and efficiently.
DynamoDB
Amazon DynamoDB is a fully managed NoSQL database service designed to handle large-scale, high-availability workloads with low latency.
Private 5G
AWS Private 5G enables you to set up and scale your own private cellular network with plug-and-play hardware and software.
Migration Hub
AWS Migration Hub provides a central place to track the progress of application migrations across multiple AWS and partner tools.
Command Line Interface
AWS Command Line Interface (CLI) is a tool that enables you to manage AWS services and resources through commands in your terminal.
Private Certificate Authority
AWS Private Certificate Authority (CA) is a managed private CA service that helps you issue and manage private SSL/TLS certificates for internal applications.
Deep Learning Containers
AWS Deep Learning Containers are Docker images pre-installed with deep learning frameworks for training and inference on Amazon ECS and EKS.
Data Transfer Terminal
AWS Data Transfer Terminal is a user interface that simplifies the secure transfer of files and data to and from AWS services.
Migration Evaluator
AWS Migration Evaluator (formerly TSO Logic) helps build a business case for cloud migration by analyzing on-premises resource utilization and cost projections.
Organizational Unit
An Organizational Unit in AWS Organizations is a container for organizing AWS accounts for policy-based management.
S3 on Outposts
Amazon S3 on Outposts brings object storage to on-premises environments using AWS Outposts, enabling data residency and low-latency workloads.
Neuron
AWS Neuron is a software development kit (SDK) that enables running high-performance ML models on AWS Inferentia-based instances.
Elastic Beanstalk
AWS Elastic Beanstalk is an easy-to-use service for deploying and scaling web applications and services using familiar programming languages and servers.
Kinesis Video Streams
Amazon Kinesis Video Streams is a service for securely ingesting, storing, and processing video streams for analytics and machine learning.