Crawler PNG and SVG Icon
AWS Glue Crawler is a component that automatically scans data sources, infers schemas, and creates metadata tables in the AWS Glue Data Catalog.
Last Modified: August 29, 2025
16px
32px
48px
64px
Details
Key Features
- Automatically scans data sources to detect schema and metadata.
- Populates AWS Glue Data Catalog entries.
- Supports incremental crawls for efficiency.
- Integrates with Amazon S3, RDS, Redshift, and JDBC sources.
Common Use Cases
- Automatically discovering and cataloging new datasets in S3
- Updating schema changes in AWS Glue Data Catalog
- Classifying data by file type and structure for ETL jobs
Explore More Icons
Elastic Cache for Memcached
Elastic Cache for Memcached is a managed, in-memory key-value store service by AWS for deploying Memcached clusters in the cloud.
IAM Identity Center
AWS IAM Identity Center (formerly AWS SSO) allows centralized access management to AWS accounts and business applications using single sign-on capabilities.
IoT ExpressLink
AWS IoT ExpressLink provides easy and secure connectivity to AWS IoT Core through hardware modules preloaded with AWS firmware.
Glue
AWS Glue is a serverless data integration service that simplifies discovering, preparing, moving, and integrating data from various sources for analytics and ML.
PrivateLink
AWS PrivateLink provides secure, private connectivity between VPCs, AWS services, and on-premises networks without exposing traffic to the public internet.
Billing Conductor
AWS Billing Conductor is a customizable billing service that helps service providers and resellers group accounts and define pricing models for accurate, consolidated billing.
Interactive Video Service
Amazon Interactive Video Service (IVS) is a managed live streaming solution that makes it easy to build interactive video experiences.
Lookout for Vision
Amazon Lookout for Vision uses computer vision to detect defects in manufactured products based on images of the items.
Audit Manager
AWS Audit Manager helps you continuously audit your AWS usage to simplify risk assessment and compliance with regulations and industry standards.
Lookout for Metrics
Amazon Lookout for Metrics automatically detects and diagnoses anomalies in business and operational data using ML models.
Managed Services
AWS Managed Services (AMS) helps enterprises operate their AWS infrastructure by providing ongoing management, monitoring, patching, and operational support.
Pinpoint
Amazon Pinpoint is a flexible and scalable outbound and inbound marketing communications service for sending targeted messages to customers across multiple channels.
Classic Load Balancer
Classic Load Balancer is the legacy Elastic Load Balancing option for distributing incoming traffic across EC2 instances.
Elemental MediaConnect
AWS Elemental MediaConnect is a reliable, secure, and flexible transport service for live video in the cloud.
Bedrock
Amazon Bedrock is a fully managed service that allows you to build generative AI applications using foundation models from leading AI providers via an API.
HDFC Cluster
Amazon EMR on HDFC Cluster refers to the use of Hadoop Distributed File System (HDFS) within Amazon EMR for distributed data storage and processing.
Chatbot
AWS Chatbot is an interactive agent that helps you monitor and interact with AWS resources from chat platforms like Slack or Microsoft Teams.
EC2 Image Builder
EC2 Image Builder is a service that automates the creation, maintenance, and deployment of secure, up-to-date Amazon Machine Images (AMIs).
Payment Cryptography
AWS Payment Cryptography provides cryptographic services to support secure payment processing, including PIN translation and key management.
EKS Anywhere
Amazon EKS Anywhere lets you create and operate Kubernetes clusters on your own infrastructure using the same tooling as Amazon EKS in the cloud.
Virtual Private Cloud
Amazon Virtual Private Cloud (VPC) lets you provision a logically isolated section of the AWS Cloud to launch AWS resources in a virtual network.
Device Farm
AWS Device Farm is an app testing service that lets you test and interact with your Android, iOS, and web apps on real, physical devices hosted by AWS.
CloudWatch
Amazon CloudWatch monitors AWS resources and applications, providing metrics, logs, and alarms for visibility and automation.
AMI
Amazon Machine Image (AMI) is a preconfigured template that contains the software configuration (OS, application server, applications) required to launch an EC2 instance.