In this course, you will learn about the raw ingredients and processes that are used to physically store data on disk and in memory. You’ll explore different storage systems, including object, block, and file storage, as well as databases, that are built on top of these raw ingredients. You’ll also get a chance to use the Cypher language to query a Neo4j graph database, and perform vector similarity search, a key feature behind generative AI and large language models. You will explore the evolution of data storage abstractions, from data warehouses, to data lakes, and data lakehouses, while comparing the advantages and drawbacks of each architectural paradigm. With hands-on practice, you will design a simple data lake using Amazon Glue, and build a data lakehouse using AWS LakeFormation and Apache Iceberg. In the last week of this course, you’ll see how queries work behind the scenes, practice writing more advanced SQL queries, compare the query performance in row vs column-oriented storage, and perform streaming queries using Apache Flink.
Data Storage and Queries


Data Storage and Queries
This course is part of DeepLearning.AI Data Engineering Professional Certificate


Instructors: Joe Reis +1 more
Top Instructor
8,078 already enrolled
83 reviews
Recommended experience
What you'll learn
Design storage architectures for various use cases, and select appropriate technologies to implement these architectures
Practice common query patters and identify ways to improve query performance and enhance the value of your data systems
Skills you'll gain
- Category: Databases
- Category: Data Store
- Category: Data Storage
- Category: Data Architecture
- Category: Data Warehousing
- Category: Data Storage Technologies
- Category: SQL
- Category: File Systems
- Category: Performance Tuning
Tools you'll learn
- Category: Amazon Web Services
- Category: Cloud Storage
- Category: Data Lakes
- Category: Query Languages
- Category: Database Systems
- Category: Vector Databases
Details to know

Add to your LinkedIn profile
3 assignments
Build your Cloud Computing expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate from DeepLearning.AI

There are 3 modules in this course
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructors

Top Instructor
Offered by
Why people choose Coursera for their career

Felipe M.

Jennifer J.

Larry W.

Chaitanya A.
Learner reviews
- 5 stars
82.14%
- 4 stars
9.52%
- 3 stars
2.38%
- 2 stars
2.38%
- 1 star
3.57%
Showing 3 of 83
Reviewed on Apr 24, 2025
This is a really excellent course covering a number of topics that anyone going into data engineering should be familiar with.
Reviewed on May 24, 2025
Excellent course, Iceberg is still a new thing but the way the tutor take us from the need of data lake to data lake house and then iceberg it's great.
Reviewed on Jan 22, 2026
Excellent course it like a read a large book with the same result and with hands-on experience