- Data Science
- SQL
- Apache Spark
- Delta Lake
Distributed Computing with Spark SQL
Completed by Wesley Silva
May 27, 2022
13 hours (approximately)
Wesley Silva's account is verified. Coursera certifies their successful completion of Distributed Computing with Spark SQL
What you will learn
Use the collaborative Databricks workspace to write scalable Spark SQL code that executes against a cluster of machines
Inspect the Spark UI to analyze query performance and identify bottlenecks
Create an end-to-end pipeline that reads data, transforms it, and saves the result
Build a medallion (bronze, silver, gold) lakehouse architecture with Delta Lake to ensure the reliability, scalability, and performance of your data