YoVDO

No, Maybe and Close Enough - Using Probabilistic Data Structures in Python

Offered By: PyCon US via YouTube

Tags

PyCon US Courses Python Courses Data Processing Courses Bloom Filters Courses Probabilistic Data Structures Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore probabilistic data structures in Python for efficient handling of large-scale data in this PyCon US talk. Discover how to count distinct items from a data firehose and determine if an item has been seen before, while balancing accuracy with speed and resource efficiency. Learn about the Hyperloglog and Bloom Filter, their high-level functioning, and practical applications in Python. Gain insights into scenarios where absolute accuracy may be impractical and how these structures provide fast, scalable solutions for problems like counting social media likes or tracking user interactions on websites. Access the accompanying GitHub repository and slides for hands-on examples and further study.

Syllabus

Introduction
The Problem
Probabilistic Data Structures
Hyperlog Log
Hyperlog Log Algorithm
Hyperlog Log Example
Bloom Filter
Python Code
When to Use


Taught by

PyCon US

Related Courses

Bootiful Multi Model Applications with Redis Stack
Devoxx via YouTube
Understanding Probabilistic Data Structures with UFO Sightings
NDC Conferences via YouTube
Understanding Probabilistic Data Structures with UFO Sightings
NDC Conferences via YouTube
Hyper Speed! When Big Data Blooms
NDC Conferences via YouTube
Count-min Sketch to Infinity - Using Probabilistic Data Structures to Solve Counting Problems in .NET
NDC Conferences via YouTube