Processing Trillions of Records with Mini Serverless Databases at Okta
Offered By: Data Council via YouTube
Course Description
Overview
Explore serverless DuckDBs and their role in processing trillions of records at Okta in this 27-minute Data Council video. Delve into data preprocessing intricacies, operational metadata harvesting, and the optimization of batch ETL, streaming, SQL flexibility, S3 durability, and serverless scalability for cost-efficient environments. Learn how Okta's next-gen security platform leverages these technologies and gain insights into pressing questions about Kafka's necessity, batch ETL latencies, and ELT complexity. Discover the capabilities and potential applications of serverless DuckDB as Jake Thomas, Manager of Data Foundations at Okta, shares his expertise on this cutting-edge approach to handling massive data volumes.
Syllabus
Introduction
Solution
Results
Future Challenges
Taught by
Data Council
Related Courses
Building Batch Data Pipelines on GCP auf DeutschGoogle Cloud via Coursera Building Batch Data Pipelines on GCP en Français
Google Cloud via Coursera Mastering Azure Data Factory: From Basics to Advanced Level
Udemy Data Science de A a Z - Extraçao e Exibição dos Dados
Udemy Building Batch Data Processing Solutions in Microsoft Azure
Pluralsight