YoVDO

Processing Data at Scale with Rails

Offered By: Ruby Central via YouTube

Tags

Conference Talks Courses Ruby on Rails Courses PostgreSQL Courses Data Storage Courses Data Transformation Courses Data Processing Courses Data Pipelines Courses Data Ingestion Courses ActiveRecord Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore strategies for processing large-scale data using Ruby on Rails in this 35-minute conference talk. Learn how to handle massive datasets from various sources, including public records, shared resources, and IoT data. Discover techniques for ingesting, transforming, and storing data efficiently using familiar tools like ActiveRecord, Sidekiq, and Postgres. Gain insights into breaking down large data jobs into manageable tasks, turning data firehoses into structured pipelines. Dive into topics such as ETL processes, pipeline options, and practical examples using lobby data. Understand the importance of scraping, Redis usage, fanning out processes, load jobs, unique indexing, and data normalization. Address scaling concerns and explore opportunities for optimizing data processing in Rails applications.

Syllabus

Introduction
Data Pipelines
Use Cases
What is ETL
Pipeline options
Example
Lobby Focus
Familiar tools
Scrape
Redis
Fanning out
Fanning out view
Load job
Unique index
Normalizing
Stuff Changes
Will this scale
Options
Opportunities


Taught by

Ruby Central

Related Courses

Software Engineering for SaaS
University of California, Berkeley via Coursera
Android. Programación de Aplicaciones
Miríadax
Informatik für Ökonomen
University of Zurich via Coursera
Aprendizaje en la Nube, Herramientas web en el Aula
Galileo University via Independent
Desarrollo de Aplicaciones para Android
Galileo University via Independent