Processing Trillions of Records with Mini Serverless Databases at Okta

Offered By: Data Council via YouTube

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!

Explore serverless DuckDBs and their role in processing trillions of records at Okta in this 27-minute Data Council video. Delve into data preprocessing intricacies, operational metadata harvesting, and the optimization of batch ETL, streaming, SQL flexibility, S3 durability, and serverless scalability for cost-efficient environments. Learn how Okta's next-gen security platform leverages these technologies and gain insights into pressing questions about Kafka's necessity, batch ETL latencies, and ELT complexity. Discover the capabilities and potential applications of serverless DuckDB as Jake Thomas, Manager of Data Foundations at Okta, shares his expertise on this cutting-edge approach to handling massive data volumes.

Syllabus

Introduction
Solution
Results
Future Challenges

Taught by

Data Council

Related Courses

DuckDB - High-Performance SQL Queries on Pandas Dataframe - Python
Samuel Chan via YouTube New Feature - What the Fibers Extension Can Do for You
International PHP Conference via YouTube Getting Started with PHP-FFI - Introduction to Foreign Function Interface
International PHP Conference via YouTube DuckDB - Bringing Analytical SQL Directly to Your Python Shell
EuroPython Conference via YouTube OLAP on Cassandra Data with Arrow, Flight SQL, ADBC, and DuckDB
Linux Foundation via YouTube