Preprocessing Data for Machine Learning - Deep Dive
Offered By: CodeEmporium via YouTube
Course Description
Overview
Dive into a comprehensive 20-minute video tutorial on preprocessing data for machine learning, focusing on logistic regression. Explore the Snape artificial data generator and examine the effects of standardization, encoding, data imbalance, and correlation on your models. Learn about the Variance Inflation Factor and strategies for dealing with multicollinearity. Discover how to handle missing data effectively. Follow along with code examples available on GitHub to enhance your understanding of these crucial preprocessing techniques for logistic regression and improve your machine learning workflows.
Syllabus
Introduction
Snape – Artificial Data Generator
Effects of Standardization
Effects of Encoding
Effects of Data Imbalance
Effects of Correlation
Variance Inflation Factor Explained
Dealing with Multicollinearity
Effects of Missing Data
Summary
Taught by
CodeEmporium
Related Courses
Regression AnalysisIndian Institute of Science Education and Research, Pune via Swayam Econometric Methods
CEC via Swayam Basic Econometrics
CEC via Swayam Inference for Linear Regression in R
DataCamp HR Analytics: Predicting Employee Churn in R
DataCamp