This course delves into regression analysis using R, covering key concepts, software tools, and differences between statistical analysis and machine learning.
Regression Analysis for Statistics & Machine Learning in R
Instructor: Packt - Course Instructors
Sponsored by Louisiana Workforce Commission
Recommended experience
What you'll learn
Understand the principles of Ordinary Least Square (OLS) regression and its application in R.
Analyze and evaluate statistical and ML-based regression models to address issues like multicollinearity.
Apply techniques for variable selection and evaluate model accuracy using cross-validation methods.
Create and interpret Generalized Linear Models (GLMs), utilizing logistic regression as a binary classifier.
Details to know
Add to your LinkedIn profile
8 assignments
August 2024
See how employees at top companies are mastering in-demand skills
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV
Share it on social media and in your performance review
There are 7 modules in this course
In this module, we will introduce you to the essential concepts and tools for regression analysis in R. You'll learn the differences between statistical analysis and machine learning, get familiar with R and R Studio, and start working with data. We'll guide you through the steps of data cleaning and perform some initial exploratory data analysis to set a solid foundation for your future learning.
What's included
8 videos1 reading1 assignment
In this module, we will delve into Ordinary Least Squares (OLS) regression, covering both theory and practical implementation in R. You will learn how to interpret OLS results, calculate and apply confidence intervals, and explore various OLS regression techniques, including models without intercepts, ANOVA, and multiple linear regression with interaction and dummy variables. Additionally, we will discuss the essential conditions that OLS models must satisfy to ensure accurate and reliable results.
What's included
12 videos1 assignment
In this module, we will address the challenge of multicollinearity in OLS regression models. You will learn how to detect multicollinearity and manage regression analyses with correlated predictors. The module covers advanced regression techniques such as Principal Component Regression, Partial Least Square Regression, Ridge Regression, and LASSO Regression, providing you with a comprehensive toolkit to handle multicollinearity effectively in R.
What's included
7 videos1 assignment
In this module, we will explore the critical aspects of variable and model selection in regression analysis. You will understand why selection is essential, learn how to choose the most appropriate OLS regression model, and identify model subsets. We'll cover evaluating regression model accuracy from a machine learning perspective and assessing performance using diverse metrics. Additionally, you will implement LASSO Regression for variable selection and analyze the contribution of predictors in explaining the variation in the outcome variable.
What's included
8 videos1 assignment
In this module, we will tackle common violations of OLS regression model assumptions. You will learn how to apply data transformations to correct issues, use robust regression methods to manage outliers, and address heteroscedasticity to ensure the reliability and validity of your regression models. This module equips you with essential techniques to refine your analysis and improve model performance.
What's included
4 videos1 assignment
In this module, we will introduce you to Generalized Linear Models (GLMs) and their various applications. You will learn the fundamentals of GLMs, including logistic regression for binary response variables, multinomial logistic regression, and regression techniques for count data. Additionally, we will cover methods to evaluate the goodness of fit for these models. This module will enhance your understanding of how GLMs extend traditional linear regression models to handle a wider range of data types and distributions.
What's included
7 videos1 assignment
In this module, we will explore advanced methods for working with non-parametric and non-linear data. You will learn to implement polynomial and non-linear regression techniques, use Generalized Additive Models (GAMs) and their boosted versions, and develop Multivariate Adaptive Regression Splines (MARS) models. We will also cover CART regression trees, Conditional Inference Trees, Random Forests, and Gradient Boosting Regression. Additionally, you will gain insights into selecting suitable machine learning models for complex data scenarios, enhancing your ability to handle diverse data structures in R.
What's included
10 videos2 assignments
Instructor
Offered by
Why people choose Coursera for their career
Recommended if you're interested in Data Science
Duke University
Coursera Project Network
Open new doors with Coursera Plus
Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription
Advance your career with an online degree
Earn a degree from world-class universities - 100% online
Join over 3,400 global companies that choose Coursera for Business
Upskill your employees to excel in the digital economy