Data Lake PNG and SVG Icon
AWS Data Lake is a centralized, scalable, and secure data repository that allows you to store and analyze all your structured and unstructured data.
Last Modified: August 29, 2025

16px
32px
48px
64px
Details
Key Features
- Centralized repository for storing structured and unstructured data.
- Supports multiple analytics frameworks and tools.
- Scalable storage with pay-as-you-go pricing.
- Enables advanced analytics and machine learning.
Common Use Cases
- Centralizing structured and unstructured data for analytics
- Storing historical data for compliance and auditing
- Supporting AI/ML workloads with unified data access
Explore More Icons
CloudFormation
AWS CloudFormation provides infrastructure as code for provisioning AWS resources using YAML or JSON templates.
IAM Permissions
IAM Permissions define what actions users and roles can take on which resources, using policies attached to IAM identities or resources.
AWS Glue Data Catalog
AWS Glue Data Catalog is a centralized metadata repository that stores table definitions, job metadata, and other control information to enable data discovery and ETL operations across AWS services.
CloudFront
Amazon CloudFront is a fast content delivery network (CDN) service that securely delivers data, videos, applications, and APIs globally.
Interactive Video Service
Amazon Interactive Video Service (IVS) is a managed live streaming solution that makes it easy to build interactive video experiences.
MemoryDB
Amazon MemoryDB for Redis is a Redis-compatible, in-memory database service designed for ultra-fast performance and durability.
VPN Connection
A VPN Connection in AWS provides encrypted connectivity between your data center or device and your AWS VPC.
Amazon Aurora
Amazon Aurora is a MySQL- and PostgreSQL-compatible relational database built for the cloud, offering high performance and availability at a global scale.
Keyspaces
Amazon Keyspaces is a scalable, highly available, and managed Apache Cassandra-compatible database service.
RDS
Amazon RDS (Relational Database Service) simplifies the setup, operation, and scaling of relational databases in the cloud.
Lookout for Vision
Amazon Lookout for Vision uses computer vision to detect defects in manufactured products based on images of the items.
Wavelength
AWS Wavelength brings AWS services to the edge of the 5G network, minimizing latency for mobile and edge applications by deploying compute closer to users.
Timestream
Amazon Timestream is a fast, scalable, serverless time series database service for IoT and operational applications.
Elastic Cache for Memcached
Elastic Cache for Memcached is a managed, in-memory key-value store service by AWS for deploying Memcached clusters in the cloud.
Outposts servers
AWS Outposts servers are smaller form factor devices that bring AWS services to edge or branch office environments with constrained space or resources.
Train
AWS Train refers to the process of building and optimizing machine learning models using training data on services like SageMaker.
Elemental Link
AWS Elemental Link is a hardware device that connects a live video source to AWS Elemental MediaLive for high-quality, low-latency cloud-based encoding.
CodeWhisperer
Amazon CodeWhisperer is an AI coding companion that suggests real-time code snippets and helps developers write code faster and more securely.
X Ray
AWS X-Ray helps developers analyze and debug production and distributed applications, such as those built using microservices architecture.
Resource Access Manager
AWS Resource Access Manager (RAM) enables you to securely share AWS resources with other AWS accounts or within your organization.
IoT Greengrass
AWS IoT Greengrass is an open-source edge runtime and cloud service that helps build, deploy, and manage device software at the edge.
DeepComposer
AWS DeepComposer is a creative ML tool that lets developers generate music using AI models and a physical MIDI keyboard.
Fault Injection Service
AWS Fault Injection Service is a fully managed chaos engineering service that helps you improve application resilience by injecting failures into your AWS workloads.
Red Hat OpenShift Service on AWS
Red Hat OpenShift Service on AWS (ROSA) is a fully managed service that enables you to run Red Hat OpenShift, a Kubernetes-based container platform, directly on AWS.