Elastic Inference PNG and SVG Icon
Amazon Elastic Inference allows you to attach low-cost GPU-powered inference acceleration to Amazon EC2 and SageMaker instances.
Last Modified: August 29, 2025

16px
32px
48px
64px
Details
Key Features
- Attach GPU-powered inference acceleration to EC2 and SageMaker.
- Reduces ML inference costs by scaling GPU capacity as needed.
- Supports TensorFlow, Apache MXNet, and PyTorch.
- Pay only for the acceleration hours used.
Common Use Cases
- Reducing inference costs by attaching partial GPUs
- Accelerating deep learning model predictions
- Optimizing real-time AI workloads
Explore More Icons
Maintenance Windows
AWS Systems Manager Maintenance Windows lets you define a schedule for when to perform administrative tasks on your instances.
Artifact
AWS Artifact is a portal for on-demand access to AWS compliance reports, agreements, and certifications, helping customers manage audit and compliance requirements.
Organizations
AWS Organizations lets you centrally manage and govern multiple AWS accounts across your environment.
Config
AWS Config enables you to assess, audit, and evaluate configurations of your AWS resources for compliance and security.
Patch Manager
AWS Systems Manager Patch Manager automates the process of patching managed instances with security-related updates.
Supply Chain
AWS Supply Chain is a cloud-native application that provides real-time visibility, insights, and analytics to help manage and optimize global supply chains.
HealthImaging
Amazon HealthImaging is a service that stores, transforms, and analyzes medical imaging data at scale using cloud-native tools and standards.
Verified Permissions
Amazon Verified Permissions is a fine-grained authorization service for applications that need flexible and scalable access control based on policies.
Outposts family
AWS Outposts family consists of fully managed solutions that extend AWS infrastructure, services, and tools to on-premises locations for a hybrid cloud experience.
PrivateLink
AWS PrivateLink provides secure, private connectivity between VPCs, AWS services, and on-premises networks without exposing traffic to the public internet.
Thinkbox XMesh
Thinkbox XMesh is a geometry caching system that optimizes complex animated geometry workflows in 3D applications.
Compute Optimizer
AWS Compute Optimizer uses machine learning to recommend optimal AWS compute resources for your workloads to reduce cost and improve performance.
Keyspaces
Amazon Keyspaces is a scalable, highly available, and managed Apache Cassandra-compatible database service.
Elemental MediaTailor
AWS Elemental MediaTailor enables server-side ad insertion for video content, allowing personalized ad experiences across platforms.
Elemental Appliances Software
AWS Elemental Appliances and Software offer on-premises solutions for live and on-demand video workflows with seamless AWS integration.
ECS Anywhere
Amazon ECS Anywhere extends Amazon Elastic Container Service (ECS) to manage and run container workloads on customer-managed infrastructure, including on-premises servers.
Permissions
AWS Permissions are managed through IAM to control which actions users and roles can perform on which resources.
HealthLake
Amazon HealthLake is a HIPAA-eligible service that stores, transforms, and analyzes health data in the FHIR format for advanced analytics and ML.
Polly
Amazon Polly is a text-to-speech (TTS) service that uses deep learning to synthesize lifelike human speech in multiple languages.
Chime SDK
Amazon Chime SDK is a set of real-time communications components that developers can use to build audio, video, and screen sharing capabilities into their applications.
Simple Storage Service
Amazon Simple Storage Service (S3) is an object storage service that offers industry-leading scalability, data availability, security, and performance.
Glue DataBrew
AWS Glue DataBrew is a visual data preparation tool that allows data analysts and scientists to clean and normalize data without writing code.
RoboMaker
AWS RoboMaker is a service for developing, simulating, and deploying robotic applications using Robot Operating System (ROS) with cloud scalability.
CodeCatalyst
AWS CodeCatalyst is a unified software development service that helps you plan, develop, build, test, and deploy applications on AWS.