Elastic Inference PNG and SVG Icon
Amazon Elastic Inference allows you to attach low-cost GPU-powered inference acceleration to Amazon EC2 and SageMaker instances.
Last Modified: August 29, 2025
16px
32px
48px
64px
Details
Key Features
- Attach GPU-powered inference acceleration to EC2 and SageMaker.
- Reduces ML inference costs by scaling GPU capacity as needed.
- Supports TensorFlow, Apache MXNet, and PyTorch.
- Pay only for the acceleration hours used.
Common Use Cases
- Reducing inference costs by attaching partial GPUs
- Accelerating deep learning model predictions
- Optimizing real-time AI workloads
Explore More Icons
Augmented AI A2I
Amazon Augmented AI (A2I) enables human reviews for ML predictions to improve accuracy in critical workflows like document processing and moderation.
File System
File System represents AWS-managed or integrated file storage solutions like EFS, FSx, and on-premises gateways for structured file access.
EMR
Amazon EMR (Elastic MapReduce) is a cloud big data platform for processing massive amounts of data using open-source tools like Apache Spark, Hive, and Hadoop.
Keyspaces
Amazon Keyspaces is a scalable, highly available, and managed Apache Cassandra-compatible database service.
Timestream
Amazon Timestream is a fast, scalable, serverless time series database service for IoT and operational applications.
Parallel Computing Service
AWS Parallel Computing Service enables large-scale parallel processing for scientific, engineering, and analytics workloads using EC2, Batch, or HPC tools.
Redshift
Amazon Redshift is a fully managed, petabyte-scale data warehouse service in the cloud that enables fast querying and analytics across vast datasets.
HealthImaging
Amazon HealthImaging is a service that stores, transforms, and analyzes medical imaging data at scale using cloud-native tools and standards.
PostgresSQL Instance
PostgreSQL Instance in Amazon RDS provides a scalable and managed PostgreSQL database with high availability and performance.
Route 53
Amazon Route 53 is a scalable and highly available Domain Name System (DNS) web service for domain registration and traffic routing.
Elastic Cache for Memcached
Elastic Cache for Memcached is a managed, in-memory key-value store service by AWS for deploying Memcached clusters in the cloud.
Kinesis Video Streams
Amazon Kinesis Video Streams is a service for securely ingesting, storing, and processing video streams for analytics and machine learning.
Detective
Amazon Detective helps you investigate potential security issues by analyzing and visualizing security data from multiple AWS sources like GuardDuty and CloudTrail.
Kinesis Data Streams
Amazon Kinesis Data Streams is a service for building real-time data streaming applications that can continuously ingest and process large volumes of data.
EKS Distro
Amazon EKS Distro (EKS-D) is the open-source distribution of the same Kubernetes components used by Amazon EKS, enabling consistent cluster operations on any infrastructure.
Marketplace Dark
AWS Marketplace is a digital catalog that makes it easy to find, test, buy, and deploy third-party software that runs on AWS.
HTTP Notification
AWS HTTP Notification typically refers to using Amazon SNS or EventBridge to send notifications via HTTP endpoints in real-time.
Forecast
Amazon Forecast is a fully managed service that uses machine learning to generate accurate time series forecasts based on historical data.
Migration Hub
AWS Migration Hub provides a central place to track the progress of application migrations across multiple AWS and partner tools.
CodePipeline
AWS CodePipeline is a fully managed continuous delivery service that automates the build, test, and deployment phases of your release process.
User Notifications
AWS User Notifications is a service that lets you centrally manage and deliver alerts and notifications from AWS services.
Lake Formation
AWS Lake Formation is a service that simplifies setting up a secure data lake by automating data ingestion, cleaning, cataloging, and access control.
Deep Learning Containers
AWS Deep Learning Containers are Docker images pre-installed with deep learning frameworks for training and inference on Amazon ECS and EKS.
Elastic Transcoder
Amazon Elastic Transcoder is a media transcoding service in the cloud designed to convert media files into formats required by playback devices.