Elastic Inference PNG and SVG Icon
Amazon Elastic Inference allows you to attach low-cost GPU-powered inference acceleration to Amazon EC2 and SageMaker instances.
Last Modified: August 29, 2025
16px
32px
48px
64px
Details
Key Features
- Attach GPU-powered inference acceleration to EC2 and SageMaker.
- Reduces ML inference costs by scaling GPU capacity as needed.
- Supports TensorFlow, Apache MXNet, and PyTorch.
- Pay only for the acceleration hours used.
Common Use Cases
- Reducing inference costs by attaching partial GPUs
- Accelerating deep learning model predictions
- Optimizing real-time AI workloads
Explore More Icons
EC2 Auto Scaling
Amazon EC2 Auto Scaling helps you automatically add or remove EC2 instances based on demand to maintain application availability and optimize costs.
Account
AWS Account refers to your uniquely identified entity used to access AWS services and manage resources securely.
Elemental MediaLive
AWS Elemental MediaLive is a broadcast-grade live video processing service that lets you create live outputs for streaming and broadcasting.
IoT SiteWise
AWS IoT SiteWise is a managed service to collect, organize, and analyze industrial equipment data at scale.
Security Incident Response
AWS Security Incident Response provides tools and guidance for detecting, responding to, and recovering from security incidents within AWS environments.
Client VPN
AWS Client VPN is a managed client-based VPN service that enables secure access to AWS resources and on-premises networks.
DeepRacer
AWS DeepRacer is an autonomous 1/18th scale race car powered by reinforcement learning that helps developers learn machine learning hands-on.
Deep Learning Containers
AWS Deep Learning Containers are Docker images pre-installed with deep learning frameworks for training and inference on Amazon ECS and EKS.
Cloud Map
AWS Cloud Map lets you register custom names for your application resources, allowing your applications to discover them using DNS or API calls.
Database Migration Service
AWS Database Migration Service (DMS) helps you migrate databases quickly and securely to AWS with minimal downtime.
Bottlerocket
Bottlerocket is a Linux-based open-source operating system purpose-built by AWS for running containers securely and efficiently.
WorkDocs SDK
Amazon WorkDocs SDK allows developers to integrate WorkDocs capabilities into their own applications, such as document management and user activity tracking.
Lake Formation
AWS Lake Formation is a service that simplifies setting up a secure data lake by automating data ingestion, cleaning, cataloging, and access control.
Key Management Service
AWS Key Management Service (KMS) enables you to create and control cryptographic keys used to encrypt your data with integrated AWS service support.
DataZone
AWS DataZone is a data management service that helps you catalog, share, govern, and access data across organizational boundaries in a secure and scalable way.
AppStream 2
Amazon AppStream 2.0 is a fully managed application streaming service that allows users to stream desktop applications securely from AWS to any device.
Template
AWS CloudFormation Templates are JSON or YAML files that define the resources and configurations to be provisioned in a stack.
Interactive Video Service
Amazon Interactive Video Service (IVS) is a managed live streaming solution that makes it easy to build interactive video experiences.
Proton
AWS Proton is a fully managed application delivery service that helps platform teams standardize and automate infrastructure and deployment for microservices.
Deep Learning AMIs
AWS Deep Learning AMIs are pre-configured Amazon Machine Images optimized for ML frameworks such as TensorFlow, PyTorch, and MXNet.
Location Service
Amazon Location Service helps developers add location-based features such as maps, tracking, and geofencing to applications without compromising data security.
Elastic Transcoder
Amazon Elastic Transcoder is a media transcoding service in the cloud designed to convert media files into formats required by playback devices.
WorkMail
Amazon WorkMail is a secure, managed business email and calendar service that supports existing desktop and mobile email clients.
DocumentDB
Amazon DocumentDB is a scalable, fully managed document database service that supports MongoDB workloads.