Elastic Inference PNG and SVG Icon
Amazon Elastic Inference allows you to attach low-cost GPU-powered inference acceleration to Amazon EC2 and SageMaker instances.
Last Modified: August 29, 2025
16px
32px
48px
64px
Details
Key Features
- Attach GPU-powered inference acceleration to EC2 and SageMaker.
- Reduces ML inference costs by scaling GPU capacity as needed.
- Supports TensorFlow, Apache MXNet, and PyTorch.
- Pay only for the acceleration hours used.
Common Use Cases
- Reducing inference costs by attaching partial GPUs
- Accelerating deep learning model predictions
- Optimizing real-time AI workloads
Explore More Icons
Volume
Volume refers to block storage resources like EBS volumes that can be attached to EC2 instances for durable, low-latency storage.
Corretto
Amazon Corretto is a no-cost, multiplatform, production-ready distribution of the Open Java Development Kit (OpenJDK).
EKS Cloud
Amazon EKS Cloud is the standard managed Kubernetes service provided by AWS, making it easy to run Kubernetes without managing control plane infrastructure.
AWS Email refers to solutions like Amazon SES that help developers send marketing, notification, and transactional emails using scalable and cost-effective infrastructure.
Amplify
AWS Amplify is a set of tools and services that helps developers build scalable, full-stack web and mobile applications on AWS.
Open 3D Engine
Open 3D Engine (O3DE) is an open-source, real-time 3D engine for building high-fidelity games and simulations, originally developed by AWS.
Red Hat OpenShift Service on AWS
Red Hat OpenShift Service on AWS (ROSA) is a fully managed service that enables you to run Red Hat OpenShift, a Kubernetes-based container platform, directly on AWS.
VPN Connection
A VPN Connection in AWS provides encrypted connectivity between your data center or device and your AWS VPC.
IoT Device Management
AWS IoT Device Management allows you to onboard, organize, monitor, and remotely manage IoT devices at scale throughout their lifecycle.
Amazon Aurora
Amazon Aurora is a MySQL- and PostgreSQL-compatible relational database built for the cloud, offering high performance and availability at a global scale.
API Gateway
Amazon API Gateway is a fully managed service that makes it easy to create, publish, maintain, monitor, and secure APIs at any scale.
AppFabric
AWS AppFabric is a service that connects SaaS applications across an organization, streamlining security, monitoring, and operational insights without writing custom integrations.
rePost Private
AWS re:Post Private offers a secure, private version of the re:Post community within an organization, enabling internal knowledge sharing and collaboration around AWS topics.
rePost
AWS re:Post is a cloud knowledge community where users can ask questions and get answers from AWS experts and the community to accelerate learning and problem-solving.
Hosted Zone
A Hosted Zone in Amazon Route 53 is a container for records that define how traffic is routed for a domain and its subdomains.
CloudWatch
Amazon CloudWatch monitors AWS resources and applications, providing metrics, logs, and alarms for visibility and automation.
File Cache
AWS File Cache is a high-speed cache on AWS for frequently accessed file data stored in disparate sources such as on-premises file systems or S3.
Amazon Rekognition Video
Amazon Rekognition Video is a service that detects and tracks objects, faces, and activities in live or stored video streams.
Comprehend
Amazon Comprehend is a natural language processing (NLP) service that extracts insights like sentiment, entities, and key phrases from text.
EventBridge
Amazon EventBridge is a serverless event bus service that enables you to connect applications using data from your own apps, SaaS apps, and AWS services.
Elastic Load Balancing
Elastic Load Balancing automatically distributes incoming traffic across multiple targets to ensure application scalability and fault tolerance.
Email Notification
AWS Email Notification typically refers to services like Amazon SES or SNS used for sending email alerts, confirmations, and other automated notifications.
HDFC Cluster
Amazon EMR on HDFC Cluster refers to the use of Hadoop Distributed File System (HDFS) within Amazon EMR for distributed data storage and processing.
Detective
Amazon Detective helps you investigate potential security issues by analyzing and visualizing security data from multiple AWS sources like GuardDuty and CloudTrail.