What Does MVP Stand For? It’s Not What You Think.
October 7, 2024
Article
This course is part of multiple programs.
Instructors: Tiffany Zhu
Instructor ratings
We asked all learners to give feedback on our instructors based on the quality of their teaching style.
30,059 already enrolled
Included with
(326 reviews)
(326 reviews)
Prepare data for analysis by handling missing values, formatting and normalizing data, binning, and turning categorical values into numeric values.
Compare and contrast predictive models using simple linear, multiple linear, and polynomial regression methods.
Examine data using descriptive statistics, data grouping, analysis of variance (ANOVA), and correlation statistics.
Evaluate a model for overfitting and underfitting conditions and tune its performance using regularization and grid search.
Add to your LinkedIn profile
11 assignments
Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance review
The R programming language is purpose-built for data analysis. R is the key that opens the door between the problems that you want to solve with data and the answers you need to meet your objectives. This course starts with a question and then walks you through the process of answering it through data. You will first learn important techniques for preparing (or wrangling) your data for analysis. You will then learn how to gain a better understanding of your data through exploratory data analysis, helping you to summarize your data and identify relevant relationships between variables that can lead to insights. Once your data is ready to analyze, you will learn how to develop your model and evaluate and tune its performance. By following this process, you can be sure that your data analysis performs to the standards that you have set, and you can have confidence in the results.
You will build hands-on experience by playing the role of a data analyst who is analyzing airline departure and arrival data to predict flight delays. Using an Airline Reporting Carrier On-Time Performance Dataset, you will practice reading data files, preprocessing data, creating models, improving models, and evaluating them to ultimately choose the best model. Watch the videos, work through the labs, and add to your portfolio. Good luck! Note: The pre-requisite for this course is basic R programming skills. For example, ensure that you have completed a course like Introduction to R Programming for Data Science from IBM.
All data analysis starts with a problem that you need to solve and understanding your data and the types of questions you can answer about it are key aspects of this. The R programming language provides you with all the tools you need to conduct powerful data analysis, providing the conduit between your data and the real-world problems you want to solve. In this module, you’ll review a type of problem that you can solve in R and the underlying data that forms the basis for your analysis. You’ll also learn about the R packages for data analysis, which provide a powerful set of tools that you’re likely to use in everyday data analyses. Finally, you’ll see how to import data and gain basic insights from the dataset.
6 videos1 reading2 assignments1 app item1 plugin
Data wrangling, or data pre-processing, is an essential first step to achieving accurate and complete analysis of your data. This process transforms your raw data into a format that can be easily categorized or mapped to other data, creating predictable relationships between them, and making it easier to build the models you need to answer questions about your data. This module provides an introduction to data pre-processing in R and then provides you with the tools you need to identify and handle missing values in your dataset, transform data formats to align them with other data you may want to compare them to, normalize your data, create categories of information through data binning, and convert categorical variables into quantitative values that can then be used in numeric-based analyses.
6 videos1 reading2 assignments1 app item1 plugin
Exploratory data analysis, or EDA, is an approach to analyzing data that summarizes its main characteristics and helps you gain a better understanding of the dataset, uncover relationships between different variables, and extract important variables for the problem you are trying to solve. The main question you are trying to answer in this module is: "What causes flight delays?" In this module, you’ll learn some useful exploratory data analysis techniques that will help answer this question.
5 videos1 reading2 assignments1 app item1 plugin
You have identified the problem that you’re trying to solve and have pre-processed the dataset you’ll use in your analysis, and you have conducted some exploratory data analysis to answer some of your initial questions. Now, it’s time to develop your model and assess the strength of your assumptions. In this module, you will examine model development by trying to predict the arrival delay of a flight using the Airline dataset. You’ll learn regression techniques for determining the correlation between variables in your dataset, and evaluate the result both visually and through the calculation of metrics.
7 videos1 reading2 assignments1 app item1 plugin
You have a firm understanding of your data and have pre-processed it to ensure the best possible outcomes. And you have conducted exploratory data analysis and developed your model. Everything looks good so far, but how can you be certain your model works in the real world and performs optimally? In this module, you’ll learn how to use the tidymodels framework to evaluate your model. Tidymodels is a collection of packages for modeling and machine learning using tidyverse principles. Using these packages, you’ll learn how to cross-validate your models, identify potential problems, like overfitting and underfitting, and handle overfitting problems using a technique called regularization. You’ll also learn how to tune your models using grid search.
4 videos1 reading2 assignments1 app item1 plugin
4 readings1 assignment1 peer review2 app items3 plugins
We asked all learners to give feedback on our instructors based on the quality of their teaching style.
At IBM, we know how rapidly tech evolves and recognize the crucial need for businesses and professionals to build job-ready, hands-on skills quickly. As a market-leading tech innovator, we’re committed to helping you thrive in this dynamic landscape. Through IBM Skills Network, our expertly designed training programs in AI, software development, cybersecurity, data science, business management, and more, provide the essential skills you need to secure your first job, advance your career, or drive business success. Whether you’re upskilling yourself or your team, our courses, Specializations, and Professional Certificates build the technical expertise that ensures you, and your organization, excel in a competitive world.
Course
Course
Duke University
Specialization
326 reviews
82.26%
11.31%
2.75%
1.22%
2.44%
Showing 3 of 326
Reviewed on Dec 2, 2022
Demanding for beginners but rewarding. A lot of extra-curricular study required
Reviewed on Sep 23, 2022
this course is not for the week, its not challenging but you have to litle dictated...
Reviewed on Jul 24, 2022
One of the best courses for learning R programming and data analysis.
Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription
Earn a degree from world-class universities - 100% online
Upskill your employees to excel in the digital economy
Access to lectures and assignments depends on your type of enrollment. If you take a course in audit mode, you will be able to see most course materials for free. To access graded assignments and to earn a Certificate, you will need to purchase the Certificate experience, during or after your audit. If you don't see the audit option:
The course may not offer an audit option. You can try a Free Trial instead, or apply for Financial Aid.
The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
When you enroll in the course, you get access to all of the courses in the Certificate, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile. If you only want to read and view the course content, you can audit the course for free.
If you subscribed, you get a 7-day free trial during which you can cancel at no penalty. After that, we don’t give refunds, but you can cancel your subscription at any time. See our full refund policy.
These cookies are necessary for the website to function and cannot be switched off in our systems. They are usually only set in response to actions made by you which amount to a request for services, such as setting your privacy preferences, logging in or filling in forms. You can set your browser to block or alert you about these cookies, but some parts of the site will not then work.
These cookies may be set through our site by our advertising partners. They may be used by those companies to build a profile of your interests and show you relevant adverts on other sites. They are based on uniquely identifying your browser and internet device. If you do not allow these cookies, you will experience less targeted advertising.
These cookies allow us to count visits and traffic sources so we can measure and improve the performance of our site. They help us to know which pages are the most and least popular and see how visitors move around the site. If you do not allow these cookies we will not know when you have visited our site, and will not be able to monitor its performance.
These cookies enable the website to provide enhanced functionality and personalization. They may be set by us or by third party providers whose services we have added to our pages. If you do not allow these cookies then some or all of these services may not function properly.