Data Analysis Using Pyspark
Completed by Shuangjie Zhao
November 9, 2023
2 hours (approximately)
Shuangjie Zhao's account is verified. Coursera certifies their successful completion of Data Analysis Using Pyspark
What you will learn
Learn how to setup the google colab for distributed data processing
Learn applying different queries to your dataset to extract useful Information
Learn how to visualize this information using matplotlib
Skills you will gain
- Category: Apache Spark
- Category: Big Data
- Category: PySpark
- Category: Matplotlib
- Category: Data Analysis
- Category: Data Cleansing
- Category: Data Processing
- Category: Data Presentation
- Category: Data Visualization
- Category: Python Programming
- Category: Query Languages
- Category: Distributed Computing

