Crawler PNG and SVG Icon
AWS Glue Crawler is a component that automatically scans data sources, infers schemas, and creates metadata tables in the AWS Glue Data Catalog.
Last Modified: August 29, 2025
16px
32px
48px
64px
Details
Key Features
- Automatically scans data sources to detect schema and metadata.
- Populates AWS Glue Data Catalog entries.
- Supports incremental crawls for efficiency.
- Integrates with Amazon S3, RDS, Redshift, and JDBC sources.
Common Use Cases
- Automatically discovering and cataloging new datasets in S3
- Updating schema changes in AWS Glue Data Catalog
- Classifying data by file type and structure for ETL jobs
Explore More Icons
Support
AWS Support provides a range of plans to assist customers with their AWS environments, offering 24/7 technical support, best practices, and guidance from cloud experts.
Textract
Amazon Textract is an AI service that automatically extracts text, tables, and other data from scanned documents and PDFs.
Elemental Server
AWS Elemental Server is an on-premises video processing system that converts input video for distribution to TVs, PCs, and mobile devices.
Application Migration Service
AWS Application Migration Service (MGN) simplifies and accelerates the migration of physical, virtual, or cloud-based servers to AWS with minimal downtime.
Oracle Database at AWS
Oracle Database at AWS enables you to run Oracle databases in the cloud with flexible deployment options, including Amazon RDS and custom EC2 instances.
Parallel Computing Service
AWS Parallel Computing Service enables large-scale parallel processing for scientific, engineering, and analytics workloads using EC2, Batch, or HPC tools.
Elastic Load Balancing
Elastic Load Balancing automatically distributes incoming traffic across multiple targets to ensure application scalability and fault tolerance.
ECS Task
Amazon ECS Task is the smallest deployable unit in ECS, representing a single running container or group of containers defined by a task definition.
Transit Gateway
AWS Transit Gateway connects VPCs and on-premises networks through a central hub, simplifying network architecture and routing.
Kinesis Data Streams
Amazon Kinesis Data Streams is a service for building real-time data streaming applications that can continuously ingest and process large volumes of data.
Change Set
AWS CloudFormation Change Sets let you preview changes AWS CloudFormation will make to your stack before executing them.
Nitro Enclaves
AWS Nitro Enclaves are isolated EC2 environments that enhance security by creating trusted execution environments for sensitive data processing.
RoboMaker
AWS RoboMaker is a service for developing, simulating, and deploying robotic applications using Robot Operating System (ROS) with cloud scalability.
Outposts rack
AWS Outposts rack is a part of the Outposts family that delivers AWS compute and storage racks to on-premises locations for low-latency applications.
Forecast
Amazon Forecast is a fully managed service that uses machine learning to generate accurate time series forecasts based on historical data.
Kendra
Amazon Kendra is an intelligent search service powered by machine learning that enables organizations to search unstructured data using natural language.
Migration Hub
AWS Migration Hub provides a central place to track the progress of application migrations across multiple AWS and partner tools.
DCV
NICE DCV is a high-performance remote display protocol that enables secure, low-latency access to remote desktop and application sessions, including 3D graphics workloads.
Systems Manager
AWS Systems Manager gives you visibility and control of your AWS infrastructure by unifying resource management under one interface.
EFS
Amazon Elastic File System (EFS) provides scalable, fully managed NFS file storage for use with AWS Cloud services and on-premises resources.
Notebook
Amazon SageMaker Notebook is a fully managed Jupyter notebook environment for developing, training, and deploying machine learning models.
MFA Token
An MFA Token is used in AWS Multi-Factor Authentication to provide an additional layer of security by requiring a one-time code from a hardware or virtual device.
QuickSight
Amazon QuickSight is a cloud-powered business intelligence (BI) service that enables you to visualize and share insights from your data with interactive dashboards.
GuardDuty
Amazon GuardDuty is a threat detection service that continuously monitors for malicious or unauthorized behavior to help protect your AWS accounts and workloads.