Python for AI: Exploring and Cleaning Data with Pandas
Offered By: AssemblyAI via YouTube
Course Description
Overview
Learn how to explore, clean, and prepare data for machine learning model training in this 39-minute video tutorial. Dive into various data types, documentation practices, and notebook setup before examining the dataset. Master techniques for handling missing values, identifying outliers, and addressing issues with categorical variables. Prepare the target feature and finalize data preparation for model training. Access the provided code and flight delays dataset to follow along. Explore essential Python libraries and tools for AI development, including links to Python, Miniconda, and VS Code. Gain practical skills in data preprocessing, a crucial step in the machine learning pipeline.
Syllabus
- Intro
- Types of data
- Data documentation
- Settings up the notebook
- First look at the data
- Missing values
- Outliers
- Issues with categorical values
- Preparing the target feature
- Final prep for model training
Taught by
AssemblyAI
Related Courses
Data Wrangling with MongoDBMongoDB via Udacity Getting and Cleaning Data
Johns Hopkins University via Coursera 软件包在流行病学研究中的应用 Using software apps in epidemiological research
Peking University via Coursera Creating an Analytical Dataset
Udacity Implementing ETL with SQL Server Integration Services
Microsoft via edX