YoVDO

Simulating Time Series Data by Parallel Computing in Python

Offered By: Coursera Project Network via Coursera

Tags

Time Series Analysis Courses Data Science Courses Big Data Courses Python Courses Correlation Courses Parallel Computing Courses

Course Description

Overview

By the end of this project, you will learn how to simulate large datasets from a small original dataset using parallel computing in Python, a free, open-source program that you can download. Sometimes large datasets are not readily available when a project has just started or when a proof of concept prototype is required. In this project, you will learn how to find the rate of change of a time dependent parameter. Next, you will learn how to simulate large number of values using the starmap function. Lastly, you will learn how to simulate large datasets while maintaining the original correlation between columns using a custom function passed to parallel processes.

In this project, you will generate 10000 time dependent samples from an initial dataset containing just 20 samples. In reality, you can use several parallel processes and can generate millions of new time dependent samples which can be used to experiment a new big data product or solution.

Note: You will need a Gmail account which you will use to sign into Google Colab.

Note: This course works best for learners who are based in the North America region. We’re currently working on providing the same experience in other regions.

Taught by

Dr. Nikunj Maheshwari

Related Courses

Policy Analysis Using Interrupted Time Series
The University of British Columbia via edX
Quantitative Finance
Indian Institute of Technology Kanpur via Swayam
Macroeconometric Forecasting
International Monetary Fund via edX
Explaining Your Data Using Tableau
University of California, Davis via Coursera
Time Series Forecasting
Udacity