Machine Learning with Apache Spark
Completed by Mario Jiménez Gutiérrez
June 6, 2024
15 hours (approximately)
Mario Jiménez Gutiérrez's account is verified. Coursera certifies their successful completion of Machine Learning with Apache Spark
What you will learn
Describe ML, explain its role in data engineering, summarize generative AI, discuss Spark's uses, and analyze ML pipelines and model persistence.
Evaluate ML models, distinguish between regression, classification, and clustering models, and compare data engineering pipelines with ML pipelines.
Construct the data analysis processes using Spark SQL, and perform regression, classification, and clustering using SparkML.
Demonstrate connecting to Spark clusters, build ML pipelines, perform feature extraction and transformation, and model persistence.
Skills you will gain
- Category: Regression Analysis
- Category: Data Processing
- Category: Model Deployment
- Category: Extract, Transform, Load
- Category: Predictive Modeling
- Category: Model Evaluation
- Category: Data Transformation
- Category: Data Pipelines
- Category: Generative AI
- Category: Unsupervised Learning
- Category: Classification Algorithms
- Category: Apache Spark

