Preprocessing Data for Machine Learning - Deep Dive
Offered By: CodeEmporium via YouTube
Course Description
Overview
Dive into a comprehensive 20-minute video tutorial on preprocessing data for machine learning, focusing on logistic regression. Explore the Snape artificial data generator and examine the effects of standardization, encoding, data imbalance, and correlation on your models. Learn about the Variance Inflation Factor and strategies for dealing with multicollinearity. Discover how to handle missing data effectively. Follow along with code examples available on GitHub to enhance your understanding of these crucial preprocessing techniques for logistic regression and improve your machine learning workflows.
Syllabus
Introduction
Snape – Artificial Data Generator
Effects of Standardization
Effects of Encoding
Effects of Data Imbalance
Effects of Correlation
Variance Inflation Factor Explained
Dealing with Multicollinearity
Effects of Missing Data
Summary
Taught by
CodeEmporium
Related Courses
Genomic Data Science and Clustering (Bioinformatics V)University of California, San Diego via Coursera 用Python玩转数据 Data Processing Using Python
Nanjing University via Coursera Data Mining Project
University of Illinois at Urbana-Champaign via Coursera Advanced Business Analytics Capstone
University of Colorado Boulder via Coursera Data Mining: Theories and Algorithms for Tackling Big Data | 数据挖掘:理论与算法
Tsinghua University via edX