Generating Mock Data with Python - NumPy, Pandas, and Datetime Libraries
Offered By: Keith Galli via YouTube
Course Description
Overview
Syllabus
- Intro & Background Info
- What we're creating in this video!
- Start writing code generating a simple dataframe & csv
- Task: Making our data more realistic, selecting some products with higher probability than others
- Task: Generate 12 months worth of data in 12 csvs calendar library, f-strings
- Make some months have more purchases than others
- Normal distributions in NumPy
- Improving speed of our code making testing easier
- Task: Generate random addresses for our data
- Task: Generate order times for purchases datetime library overview
- Using timedelta objects to add & subtract time from dates
- Generate a realistic quantity ordered for each product using numpy geometric distribution
- Add multiple items being more likely to be sold together and cleaning code a bit
Taught by
Keith Galli
Related Courses
Computational Investing, Part IGeorgia Institute of Technology via Coursera Введение в машинное обучение
Higher School of Economics via Coursera Математика и Python для анализа данных
Moscow Institute of Physics and Technology via Coursera Introduction to Python for Data Science
Microsoft via edX Python for Data Science
University of California, San Diego via edX