Data Lake PNG and SVG Icon
AWS Data Lake is a centralized, scalable, and secure data repository that allows you to store and analyze all your structured and unstructured data.
Last Modified: August 29, 2025
16px
32px
48px
64px
Details
Key Features
- Centralized repository for storing structured and unstructured data.
- Supports multiple analytics frameworks and tools.
- Scalable storage with pay-as-you-go pricing.
- Enables advanced analytics and machine learning.
Common Use Cases
- Centralizing structured and unstructured data for analytics
- Storing historical data for compliance and auditing
- Supporting AI/ML workloads with unified data access
Explore More Icons
RoboMaker
AWS RoboMaker is a service for developing, simulating, and deploying robotic applications using Robot Operating System (ROS) with cloud scalability.
Apache MXNet on AWS
Apache MXNet on AWS is a scalable deep learning framework optimized for performance and training on AWS infrastructure.
EMR
Amazon EMR (Elastic MapReduce) is a cloud big data platform for processing massive amounts of data using open-source tools like Apache Spark, Hive, and Hadoop.
Elastic Container Registry
Amazon Elastic Container Registry (ECR) is a fully managed Docker container registry that makes it easy to store, manage, and deploy container images.
MySQL Instance
MySQL Instance in Amazon RDS offers a managed MySQL database environment with automated administration and scaling.
Red Hat OpenShift Service on AWS
Red Hat OpenShift Service on AWS (ROSA) is a fully managed service that enables you to run Red Hat OpenShift, a Kubernetes-based container platform, directly on AWS.
EKS Distro
Amazon EKS Distro (EKS-D) is the open-source distribution of the same Kubernetes components used by Amazon EKS, enabling consistent cluster operations on any infrastructure.
Resource Access Manager
AWS Resource Access Manager (RAM) enables you to securely share AWS resources with other AWS accounts or within your organization.
Aurora
Aurora is a high-performance cloud-native relational database engine developed by AWS, compatible with MySQL and PostgreSQL.
Data Firehose
Amazon Kinesis Data Firehose is a fully managed service for delivering real-time streaming data to destinations like Amazon S3, Redshift, and OpenSearch.
Transfer Family
AWS Transfer Family supports fully managed file transfers into and out of AWS using SFTP, FTPS, and FTP protocols.
Thinkbox XMesh
Thinkbox XMesh is a geometry caching system that optimizes complex animated geometry workflows in 3D applications.
SageMaker
Amazon SageMaker is a fully managed service that provides tools to build, train, and deploy machine learning models at scale.
ECS Service
Amazon ECS Service manages the running of long-lived containers on a cluster, handling tasks like scaling and load balancing automatically.
Reserved Instance Reporting
AWS Reserved Instance Reporting helps you monitor and optimize the utilization and coverage of your purchased Reserved Instances for cost savings.
Certificate Manager
AWS Certificate Manager (ACM) lets you easily provision, manage, and deploy public and private SSL/TLS certificates for secure web applications.
Wickr
Amazon Wickr is an end-to-end encrypted messaging and collaboration service designed for secure communications, file sharing, and compliance in enterprises and government.
WAF
AWS Web Application Firewall (WAF) helps protect web applications from common exploits and bots that can affect availability, security, or consume resources.
Console Mobile Application
The AWS Console Mobile Application allows you to view and manage a select set of AWS resources from your mobile device.
Simple Storage Service Glacier
Amazon S3 Glacier is a secure, durable, and low-cost storage class for data archiving and long-term backup.
Cloud Map
AWS Cloud Map lets you register custom names for your application resources, allowing your applications to discover them using DNS or API calls.
Transcribe
Amazon Transcribe is an automatic speech recognition (ASR) service that converts speech into text for applications like subtitles and transcription.
CloudShell
AWS CloudShell is a browser-based shell that provides command-line access to AWS services and tools directly from the AWS Management Console.
SimSpace Weaver
AWS SimSpace Weaver is a managed service that enables you to run large-scale spatial simulations across multiple EC2 instances in near real-time.