Data Science with Python: Munging, Outliers, and Feature Engineering
Offered By: Pluralsight
Course Description
Overview
Learn to preprocess and engineer features from raw data using Python. This course will teach you to handle missing values, detect outliers, create relevant features, and optimize datasets for effective data science projects.
Raw data often contains missing values, outliers, and irrelevant features that can hinder the success of data science projects. In this course, Data Science with Python: Munging, Outliers, and Feature Engineering, you'll gain the ability to preprocess and engineer features from raw data effectively using Python. First, you'll explore techniques for handling missing data and imputing missing values to ensure your datasets remain informative and reliable for analysis. Next, you'll discover methods for detecting and treating outliers using statistical and machine learning approaches, developing strategies to handle them appropriately. Finally, you'll learn how to create meaningful features from raw data, transform categorical variables, and optimize datasets for efficient analysis by dropping irrelevant columns and reordering them for better readability and processing. When you're finished with this course, you'll have the skills and knowledge of data munging and feature engineering needed to tackle real-world data science problems, transform raw data into clean, informative datasets, and enhance the performance of your machine learning models.
Raw data often contains missing values, outliers, and irrelevant features that can hinder the success of data science projects. In this course, Data Science with Python: Munging, Outliers, and Feature Engineering, you'll gain the ability to preprocess and engineer features from raw data effectively using Python. First, you'll explore techniques for handling missing data and imputing missing values to ensure your datasets remain informative and reliable for analysis. Next, you'll discover methods for detecting and treating outliers using statistical and machine learning approaches, developing strategies to handle them appropriately. Finally, you'll learn how to create meaningful features from raw data, transform categorical variables, and optimize datasets for efficient analysis by dropping irrelevant columns and reordering them for better readability and processing. When you're finished with this course, you'll have the skills and knowledge of data munging and feature engineering needed to tackle real-world data science problems, transform raw data into clean, informative datasets, and enhance the performance of your machine learning models.
Syllabus
- Course Overview 2mins
- Data Munging 26mins
- Feature Engineering 18mins
Taught by
Dr. Ali Feizollah
Related Courses
Data Science at Scale - Capstone ProjectUniversity of Washington via Coursera Feature Engineering for Improving Learning Environments
University of Texas Arlington via edX How to Win a Data Science Competition: Learn from Top Kagglers
Higher School of Economics via Coursera Advanced Machine Learning
The Open University via FutureLearn Feature Engineering
Google Cloud via Coursera