YoVDO

Uncluster Your Data Science Using Vaex

Offered By: GOTO Conferences via YouTube

Tags

GOTO Conferences Courses Data Science Courses Python Courses Data Manipulation Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore how to build snappy dashboards visualizing hundreds of millions of data points and interactively explore hundreds of gigabytes of data on a single machine using Vaex, an out-of-core DataFrame library in Python. Learn about memory mapping, column-based storage, and the compute and expression system that enables Vaex to perform typical data manipulations, filtering, and aggregations on a billion rows in real-time. Discover how this approach can empower your team by removing the DevOps overhead of configuring and maintaining a cluster. Watch a comprehensive demo showcasing Vaex's capabilities, and gain insights into its production use cases, including examples with Dash. Understand how Vaex evolved from an academic project to a consultancy, and explore its potential applications in data science and machine learning workflows.

Syllabus

Intro
Motivation
Vaex
Concepts: Memory mapping
Concepts: Column based storage
Concepts: No memory copies
Concepts: Compute & expression system
Vaex.io: From academic project to consultancy
Demo
In production
In the wild
In production: Dash example
Vaex.io: Consultancy
Summary
Outro


Taught by

GOTO Conferences

Related Courses

Data Analysis
Johns Hopkins University via Coursera
Computing for Data Analysis
Johns Hopkins University via Coursera
Scientific Computing
University of Washington via Coursera
Introduction to Data Science
University of Washington via Coursera
Web Intelligence and Big Data
Indian Institute of Technology Delhi via Coursera