Crawler PNG and SVG Icon
AWS Glue Crawler is a component that automatically scans data sources, infers schemas, and creates metadata tables in the AWS Glue Data Catalog.
Last Modified: August 29, 2025
16px
32px
48px
64px
Details
Key Features
- Automatically scans data sources to detect schema and metadata.
- Populates AWS Glue Data Catalog entries.
- Supports incremental crawls for efficiency.
- Integrates with Amazon S3, RDS, Redshift, and JDBC sources.
Common Use Cases
- Automatically discovering and cataloging new datasets in S3
- Updating schema changes in AWS Glue Data Catalog
- Classifying data by file type and structure for ETL jobs
Explore More Icons
Lightsail for Research
Amazon Lightsail for Research provides pre-configured virtual research environments with tools and computing resources for researchers and students.
Resilience Hub
AWS Resilience Hub helps you assess and improve the resilience of your applications using AWS best practices.
Aurora
Aurora is a high-performance cloud-native relational database engine developed by AWS, compatible with MySQL and PostgreSQL.
CloudHSM
AWS CloudHSM provides hardware security modules in the AWS Cloud to generate and use your own encryption keys with FIPS 140-2 Level 3 compliance.
Neptune
Amazon Neptune is a fast, reliable, and fully managed graph database service for building applications with highly connected data.
DataSync
AWS DataSync is an online data transfer service that simplifies, automates, and accelerates moving data between on-premises storage and AWS.
File System
File System represents AWS-managed or integrated file storage solutions like EFS, FSx, and on-premises gateways for structured file access.
HTTP Notification
AWS HTTP Notification typically refers to using Amazon SNS or EventBridge to send notifications via HTTP endpoints in real-time.
DeepComposer
AWS DeepComposer is a creative ML tool that lets developers generate music using AI models and a physical MIDI keyboard.
Outposts servers
AWS Outposts servers are smaller form factor devices that bring AWS services to edge or branch office environments with constrained space or resources.
License Manager
AWS License Manager helps you manage software licenses from vendors like Microsoft, SAP, and Oracle on AWS and on-premises.
User Notifications
AWS User Notifications is a service that lets you centrally manage and deliver alerts and notifications from AWS services.
Simulation
AWS RoboMaker Simulation lets you test and validate robotic applications in large-scale virtual 3D environments without physical hardware.
Elemental MediaTailor
AWS Elemental MediaTailor enables server-side ad insertion for video content, allowing personalized ad experiences across platforms.
Application Discovery Service
AWS Application Discovery Service helps you plan migration projects by collecting usage and configuration data from your on-premises servers.
CloudFront
Amazon CloudFront is a fast content delivery network (CDN) service that securely delivers data, videos, applications, and APIs globally.
Q
Amazon Q is a generative AI assistant designed to help employees and developers get answers and complete tasks using internal and external data.
Thinkbox Deadline
Thinkbox Deadline is a render management tool that provides flexible and scalable management of render farms for creative studios.
SimSpace Weaver
AWS SimSpace Weaver is a managed service that enables you to run large-scale spatial simulations across multiple EC2 instances in near real-time.
CodeBuild
AWS CodeBuild is a fully managed continuous integration service that compiles source code, runs tests, and produces deployable artifacts.
IAM Identity Center
AWS IAM Identity Center (formerly AWS SSO) allows centralized access management to AWS accounts and business applications using single sign-on capabilities.
Private 5G
AWS Private 5G enables you to set up and scale your own private cellular network with plug-and-play hardware and software.
Audit Manager
AWS Audit Manager helps you continuously audit your AWS usage to simplify risk assessment and compliance with regulations and industry standards.
WorkMail
Amazon WorkMail is a secure, managed business email and calendar service that supports existing desktop and mobile email clients.