EMR Engine PNG and SVG Icon
Amazon EMR Engine is the processing engine component within Amazon EMR that enables scalable, distributed big data processing using frameworks like Spark and Hadoop.
Last Modified: August 29, 2025
16px
32px
48px
64px
Details
Key Features
- Core compute and processing layer of Amazon EMR.
- Optimized for large-scale distributed data processing.
- Integrates with AWS storage and analytics services.
- Supports high-performance workloads.
Common Use Cases
- Optimizing Spark and Hive workloads for faster execution
- Managing resource allocation for big data jobs
- Providing scalable compute for analytics pipelines
Explore More Icons
Application Auto Scaling
AWS Application Auto Scaling automatically adjusts the capacity of your scalable resources to maintain performance and cost-effectiveness.
CodeBuild
AWS CodeBuild is a fully managed continuous integration service that compiles source code, runs tests, and produces deployable artifacts.
EC2
Amazon EC2 (Elastic Compute Cloud) is a web service that provides secure, resizable compute capacity in the cloud for virtually any workload.
Infrastructure Composer
AWS Infrastructure Composer is a visual tool that helps developers create and deploy infrastructure using AWS CloudFormation templates more easily.
Private 5G
AWS Private 5G enables you to set up and scale your own private cellular network with plug-and-play hardware and software.
AWS Glue Data Catalog
AWS Glue Data Catalog is a centralized metadata repository that stores table definitions, job metadata, and other control information to enable data discovery and ETL operations across AWS services.
Neptune
Amazon Neptune is a fast, reliable, and fully managed graph database service for building applications with highly connected data.
Security Lake
Amazon Security Lake centralizes your security data from AWS and other sources into a purpose-built data lake to facilitate security analytics and investigations.
Crawler
AWS Glue Crawler is a component that automatically scans data sources, infers schemas, and creates metadata tables in the AWS Glue Data Catalog.
Cloud Extensions ROS
AWS Cloud Extensions for ROS (Robot Operating System) are tools that enable robots to communicate with AWS cloud services for data processing and analytics.
Redshift
Amazon Redshift is a fully managed, petabyte-scale data warehouse service in the cloud that enables fast querying and analytics across vast datasets.
EventBridge
Amazon EventBridge is a serverless event bus service that enables you to connect applications using data from your own apps, SaaS apps, and AWS services.
Migration Evaluator
AWS Migration Evaluator (formerly TSO Logic) helps build a business case for cloud migration by analyzing on-premises resource utilization and cost projections.
Development Environment
AWS RoboMaker Development Environment provides a cloud-based IDE for developing, testing, and simulating robotics applications using ROS.
Elemental MediaPackage
AWS Elemental MediaPackage prepares and protects video for delivery over the internet to connected devices.
Athena
Amazon Athena is an interactive query service that lets you use standard SQL to analyze data directly in Amazon S3 without the need for infrastructure setup.
Verified Permissions
Amazon Verified Permissions is a fine-grained authorization service for applications that need flexible and scalable access control based on policies.
Firewall Manager
AWS Firewall Manager is a security management service that makes it easier to centrally configure and manage firewall rules across multiple AWS accounts and resources.
Managed Service for Prometheus
Amazon Managed Service for Prometheus is a fully managed, scalable, and secure monitoring service for container metrics using Prometheus.
Personalize
Amazon Personalize is a real-time personalization and recommendation service powered by the same technology used at Amazon.com.
Elemental Conductor
AWS Elemental Conductor is software for managing multiple AWS Elemental Live encoders from a central interface.
EKS Anywhere
Amazon EKS Anywhere lets you create and operate Kubernetes clusters on your own infrastructure using the same tooling as Amazon EKS in the cloud.
TensorFlow on AWS
TensorFlow on AWS offers optimized tools and infrastructure for training and deploying TensorFlow models across AWS services.
Service Catalog
AWS Service Catalog allows organizations to create and manage approved catalogs of IT services for AWS use.