In this 1 hour long project-based course, you will learn to build a linear regression model using Pyspark ML to predict students' admission at the university. We will use the graduate admission 2 data set from Kaggle. Our goal is to use a Simple Linear Regression Machine Learning Algorithm from the Pyspark Machine learning library to predict the chances of getting admission. We will be carrying out the entire project on the Google Colab environment with the installation of Pyspark. You will need a free Gmail account to complete this project. Please be aware of the fact that the dataset and the model in this project, can not be used in the real-life. We are only using this data for the learning purposes.
New year. Big goals. Bigger savings. Unlock a year of unlimited access to learning with Coursera Plus for $199. Save now.
(32 reviews)
What you'll learn
Learn to build the Linear Regression Model using Pyspark ML to predict admission
Learn to setup Pyspark and work with Pyspark dataframes in Colab Environment
Learn to clean and prepare data for analysis.
Skills you'll practice
- Apache Spark
- Analytics
- PySpark
- Data Analysis
- Machine Learning Methods
- Predictive Modeling
- Data Science
- Business Analytics
- Machine Learning
- Artificial Intelligence and Machine Learning (AI/ML)
- Applied Machine Learning
- Statistical Machine Learning
- Probability & Statistics
- Statistical Modeling
- Statistics
- Predictive Analytics
- Mathematical Modeling
- Statistical Analysis
- Advanced Analytics
Details to know
Add to your LinkedIn profile
Only available on desktop
See how employees at top companies are mastering in-demand skills
Learn, practice, and apply job-ready skills in less than 2 hours
- Receive training from industry experts
- Gain hands-on experience solving real-world job tasks
- Build confidence using the latest tools and technologies
About this Guided Project
Learn step-by-step
In a video that plays in a split-screen with your work area, your instructor will walk you through these steps:
Introduction and Installing Dependencies
Clone and Explore the Dataset
Data Cleaning
Correlation analysis and Feature Selection
Build the Linear Regression Model
Evaluate and Test the model
4 project images
Instructor
Offered by
How you'll learn
Skill-based, hands-on learning
Practice new skills by completing job-related tasks.
Expert guidance
Follow along with pre-recorded videos from experts using a unique side-by-side interface.
No downloads or installation required
Access the tools and resources you need in a pre-configured cloud workspace.
Available only on desktop
This Guided Project is designed for laptops or desktop computers with a reliable Internet connection, not mobile devices.
Why people choose Coursera for their career
Learner reviews
32 reviews
- 5 stars
78.12%
- 4 stars
12.50%
- 3 stars
6.25%
- 2 stars
3.12%
- 1 star
0%
Showing 3 of 32
Reviewed on Aug 25, 2021
Straightforward tutorial of how to use pyspark for a simple machine learning task.
Reviewed on Aug 9, 2022
Great walkthrough w good explanations of the concepts used.
You might also like
Google Cloud
University of Washington
New to Machine Learning? Start here.
Open new doors with Coursera Plus
Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription
Advance your career with an online degree
Earn a degree from world-class universities - 100% online
Join over 3,400 global companies that choose Coursera for Business
Upskill your employees to excel in the digital economy
Frequently asked questions
By purchasing a Guided Project, you'll get everything you need to complete the Guided Project including access to a cloud desktop workspace through your web browser that contains the files and software you need to get started, plus step-by-step video instruction from a subject matter expert.
Because your workspace contains a cloud desktop that is sized for a laptop or desktop computer, Guided Projects are not available on your mobile device.
Guided Project instructors are subject matter experts who have experience in the skill, tool or domain of their project and are passionate about sharing their knowledge to impact millions of learners around the world.