YoVDO

Techniques to Work with Imbalanced Data for Machine Learning in Python

Offered By: DigitalSreeni via YouTube

Tags

Machine Learning Courses Deep Learning Courses Python Courses Data Collection Courses Decision Trees Courses

Course Description

Overview

Learn seven effective techniques for handling imbalanced data in machine learning using Python. Explore methods such as upsampling minority classes, downsampling majority classes, combining over and under sampling, penalizing algorithms for misclassification of minority classes, generating synthetic data with SMOTE and ADASYN, and adding appropriate weights to deep learning models. Understand the importance of selecting proper metrics, analyzing confusion matrices, and using ROC_AUC scores. Gain practical insights through examples in image segmentation, feature generation, and label creation. Download accompanying code from the provided GitHub repository to apply these techniques in your own projects.

Syllabus

Intro
What is imbalance
Top 7 techniques
Image segmentation
Generate features
Create labels
Unique and counts
Accuracy
ROCAUC Score
Upsampling
moti
smote
results
Deep learning
Class weights
Adding weights
Manual class weights
Summary


Taught by

DigitalSreeni

Related Courses

Statistical Learning with R
Stanford University via edX
The Analytics Edge
Massachusetts Institute of Technology via edX
Machine Learning 1—Supervised Learning
Brown University via Udacity
The Caltech-JPL Summer School on Big Data Analytics
California Institute of Technology via Coursera
機器學習技法 (Machine Learning Techniques)
National Taiwan University via Coursera