Scaling Python Data Applications with Dask
Offered By: Pluralsight
Course Description
Overview
Learn how to work with very large datasets without leaving familiar and rich Python data ecosystem. This course will teach you how to leverage power of Dask library in order to handle data that is too big for regular tools like Pandas or NumPy.
Working with so-called ‘Big Data’ can be a daunting task and many tools that solve this problem have a very steep learning curve. Also, developers familiar with Python may not want to resort to solutions built on another technology stack. In this course, Scaling Python Data Applications with Dask 1, you will gain the ability to work with very large datasets using a Python-native and approachable tool. First, you will learn how to use Dask when your application written using standard Python stops working because of the growing size of the data. Next, you will discover how Dask works underneath and what techniques it uses to make processing large datasets in various scenarios possible and accessible. Finally, you will explore how to exchange Pandas and NumPy for their Big Data variants, with practically no changes to the code. When you’re finished with this course, you will have the skills and knowledge of Dask needed to confidently write data applications that scale, using exclusively Python stack.
Working with so-called ‘Big Data’ can be a daunting task and many tools that solve this problem have a very steep learning curve. Also, developers familiar with Python may not want to resort to solutions built on another technology stack. In this course, Scaling Python Data Applications with Dask 1, you will gain the ability to work with very large datasets using a Python-native and approachable tool. First, you will learn how to use Dask when your application written using standard Python stops working because of the growing size of the data. Next, you will discover how Dask works underneath and what techniques it uses to make processing large datasets in various scenarios possible and accessible. Finally, you will explore how to exchange Pandas and NumPy for their Big Data variants, with practically no changes to the code. When you’re finished with this course, you will have the skills and knowledge of Dask needed to confidently write data applications that scale, using exclusively Python stack.
Syllabus
- Course Overview 1min
- Understanding Dask 17mins
- Scaling Simple Python Data Apps 19mins
- Dask Internals and Dashboard 17mins
- Scaling NumPy and Pandas 18mins
- Beyond Single Machine 6mins
Taught by
Paweł Kordek
Related Courses
Amazon SageMaker: Build an Object Detection Model Using Images Labeled with Ground Truth (Vietnamese)Amazon Web Services via AWS Skill Builder Amazon SageMaker: Build an Object Detection Model Using Images Labeled with Ground Truth (German)
Amazon Web Services via AWS Skill Builder Amazon SageMaker: Build an Object Detection Model Using Images Labeled with Ground Truth (Korean)
Amazon Web Services via AWS Skill Builder Amazon SageMaker: Build an Object Detection Model Using Images Labeled with Ground Truth (French)
Amazon Web Services via AWS Skill Builder Amazon SageMaker: Build an Object Detection Model Using Images Labeled with Ground Truth (Indonesian)
Amazon Web Services via AWS Skill Builder