Flight Delay Dataset Creation
Offered By: Rob Mulla via YouTube
Course Description
Overview
Embark on a comprehensive live coding session that guides you through the process of creating a flight delay dataset using Python and Pandas on Kaggle. Learn how to pull airline flight data, explore existing datasets, and navigate public flight information sources. Gain hands-on experience in data cleaning, feature understanding, and visualization techniques using Plotly Express. Follow along as the instructor encounters and resolves real-time challenges, including dealing with partial data and downloading additional information. By the end of this tutorial, you'll have created a Kaggle dataset, generated visualizations, and gained valuable insights into the intricacies of working with flight delay data.
Syllabus
Intro
Existing Kaggle Datasets
Finding Public Flight Data
Checking the CSV in IPython
Creating a Kaggle Dataset
Dalle2 Dataset Image
Data Exploration
Understanding Features
More Data Cleaning
Plotly Express Plot
Plotting Cancellation Rates
Realizing it's Partial Data
Downloading More Data
Data Filtering
Final Plot Works
Bye Bye
Taught by
Rob Mulla
Related Courses
Data Wrangling with MongoDBMongoDB via Udacity Getting and Cleaning Data
Johns Hopkins University via Coursera 软件包在流行病学研究中的应用 Using software apps in epidemiological research
Peking University via Coursera Creating an Analytical Dataset
Udacity Implementing ETL with SQL Server Integration Services
Microsoft via edX