Generating Mock Data with Python - NumPy, Pandas, and Datetime Libraries
Offered By: Keith Galli via YouTube
Course Description
Overview
Syllabus
- Intro & Background Info
- What we're creating in this video!
- Start writing code generating a simple dataframe & csv
- Task: Making our data more realistic, selecting some products with higher probability than others
- Task: Generate 12 months worth of data in 12 csvs calendar library, f-strings
- Make some months have more purchases than others
- Normal distributions in NumPy
- Improving speed of our code making testing easier
- Task: Generate random addresses for our data
- Task: Generate order times for purchases datetime library overview
- Using timedelta objects to add & subtract time from dates
- Generate a realistic quantity ordered for each product using numpy geometric distribution
- Add multiple items being more likely to be sold together and cleaning code a bit
Taught by
Keith Galli
Related Courses
Computational Investing, Part IGeorgia Institute of Technology via Coursera Введение в машинное обучение
Higher School of Economics via Coursera Математика и Python для анализа данных
Moscow Institute of Physics and Technology via Coursera Introduction to Python for Data Science
Microsoft via edX Using Python for Research
Harvard University via edX