- PySpark
- Data Analysis Expressions (DAX)
- Data Storage Technologies
- Data Pipelines
- Performance Tuning
- Data Manipulation
- Apache Spark
- Apache Hadoop
- SQL
- Data Storage
- Data Processing
- Distributed Computing
PySpark in Action: Hands-On Data Processing
Completed by Narsin Ashritha
June 27, 2025
15 hours (approximately)
Narsin Ashritha 's account is verified. Coursera certifies their successful completion of PySpark in Action: Hands-On Data Processing
What you will learn
Explore the fundamental concepts of Big Data and the components of the Hadoop ecosystem.
Explain the architecture and key principles of Apache Spark and its role in big data processing.
Utilize RDD transformations and actions to effectively process large-scale datasets with PySpark.
Execute advanced DataFrame operations, including data manipulation and aggregation techniques.
Skills you will gain

