Uncluster Your Data Science Using Vaex
Offered By: GOTO Conferences via YouTube
Course Description
Overview
Explore how to build snappy dashboards visualizing hundreds of millions of data points and interactively explore hundreds of gigabytes of data on a single machine using Vaex, an out-of-core DataFrame library in Python. Learn about memory mapping, column-based storage, and the compute and expression system that enables Vaex to perform typical data manipulations, filtering, and aggregations on a billion rows in real-time. Discover how this approach can empower your team by removing the DevOps overhead of configuring and maintaining a cluster. Watch a comprehensive demo showcasing Vaex's capabilities, and gain insights into its production use cases, including examples with Dash. Understand how Vaex evolved from an academic project to a consultancy, and explore its potential applications in data science and machine learning workflows.
Syllabus
Intro
Motivation
Vaex
Concepts: Memory mapping
Concepts: Column based storage
Concepts: No memory copies
Concepts: Compute & expression system
Vaex.io: From academic project to consultancy
Demo
In production
In the wild
In production: Dash example
Vaex.io: Consultancy
Summary
Outro
Taught by
GOTO Conferences
Related Courses
Addressing Algorithmic BiasGOTO Conferences via YouTube Empowering Consumers - Evolution of Software in the Future
GOTO Conferences via YouTube Why Static Typing Came Back
GOTO Conferences via YouTube Higher Kinded Types in a Lower Kinded Language - Functional Programming in Kotlin
GOTO Conferences via YouTube It's Not Hard to Test Smart - Delivering Customer Value Faster
GOTO Conferences via YouTube