AWS Glue Data Catalog PNG and SVG Icon
AWS Glue Data Catalog is a centralized metadata repository that stores table definitions, job metadata, and other control information to enable data discovery and ETL operations across AWS services.
Last Modified: August 29, 2025

16px
32px
48px
64px
Details
Key Features
- Centralized metadata repository for all AWS Glue data assets.
- Supports schema versioning and evolution.
- Enables quick discovery and search of datasets.
- Integrates with Athena, Redshift Spectrum, and EMR.
Common Use Cases
- Centralizing metadata for all data sources in a single repository
- Enabling schema discovery and data classification for analytics
- Facilitating query performance optimization by providing table definitions
Explore More Icons
IoT Core
AWS IoT Core allows connected devices to securely interact with cloud applications and other devices, enabling scalable IoT solutions.
VPC Lattice
Amazon VPC Lattice helps you securely connect, monitor, and manage service-to-service communication in a consistent way.
Red Hat OpenShift Service on AWS
Red Hat OpenShift Service on AWS (ROSA) is a fully managed service that enables you to run Red Hat OpenShift, a Kubernetes-based container platform, directly on AWS.
Serverless Application Repository
AWS Serverless Application Repository is a managed repository for discovering, deploying, and publishing serverless applications built with Lambda and other AWS services.
RoboMaker
AWS RoboMaker is a service for developing, simulating, and deploying robotic applications using Robot Operating System (ROS) with cloud scalability.
Network Firewall
AWS Network Firewall is a managed service that deploys essential network protections for your VPC, including deep packet inspection and intrusion prevention.
FSx for NetApp ONTAP
Amazon FSx for NetApp ONTAP offers fully managed NetApp file systems on AWS with familiar features like snapshots, clones, and data tiering.
Hosted Zone
A Hosted Zone in Amazon Route 53 is a container for records that define how traffic is routed for a domain and its subdomains.
Managed Workflows for Apache Airflow
Amazon Managed Workflows for Apache Airflow (MWAA) is a managed orchestration service for scheduling and monitoring data workflows using Apache Airflow.
rePost Private
AWS re:Post Private offers a secure, private version of the re:Post community within an organization, enabling internal knowledge sharing and collaboration around AWS topics.
Polly
Amazon Polly is a text-to-speech (TTS) service that uses deep learning to synthesize lifelike human speech in multiple languages.
AWS Dark
Amazon Web Services (AWS) is a comprehensive cloud computing platform offering over 200 fully featured services including computing, storage, databases, machine learning, analytics, and more to help businesses scale and innovate faster.
Well Architected Tool
The AWS Well-Architected Tool helps you review and improve your cloud workloads using AWS's architectural best practices.
Automation
AWS Systems Manager Automation simplifies common maintenance and deployment tasks using predefined or custom workflows.
Managed Services
AWS Managed Services (AMS) helps enterprises operate their AWS infrastructure by providing ongoing management, monitoring, patching, and operational support.
App Runner
AWS App Runner is a fully managed service that makes it easy to build, deploy, and run containerized web applications and APIs at scale without managing infrastructure.
Bedrock
Amazon Bedrock is a fully managed service that allows you to build generative AI applications using foundation models from leading AI providers via an API.
Mainframe Modernization
AWS Mainframe Modernization provides tools and infrastructure to migrate, run, and modernize mainframe applications on AWS.
Elastic Container Registry
Amazon Elastic Container Registry (ECR) is a fully managed Docker container registry that makes it easy to store, manage, and deploy container images.
HealthScribe
Amazon HealthScribe is a service that uses speech-to-text and generative AI to help clinicians automatically generate clinical notes from patient visits.
ElastiCache
Amazon ElastiCache is a fully managed in-memory caching service for Redis and Memcached to improve application performance.
Pinpoint
Amazon Pinpoint is a flexible and scalable outbound and inbound marketing communications service for sending targeted messages to customers across multiple channels.
Application Discovery Service
AWS Application Discovery Service helps you plan migration projects by collecting usage and configuration data from your on-premises servers.
RDS
Amazon RDS (Relational Database Service) simplifies the setup, operation, and scaling of relational databases in the cloud.