YoVDO

Architecting Big Data Applications: Batch Mode Application Engineering

Offered By: LinkedIn Learning

Tags

Big Data Courses Data Engineering Courses Distributed Computing Courses Batch Processing Courses Horizontal Scaling Courses

Course Description

Overview

Develop the skills required to architect and manage batch processing applications to generate consistent data-driven results.

Syllabus

Introduction
  • Architecting big data applications
1. Introduction to Big Data Applications
  • Characteristics of batch processing
  • Challenges building batch applications
  • Technologies for batch big data engineering
  • Use cases for batch big data
  • Architecture process for data engineering
2. Big Data Architecture Principles
  • Making the choice: Real-time vs. batch
  • Horizontal scaling
  • Distributed processing
  • Technology selection
  • Technology integrations
3. Batch Application Architecture Principles
  • Schedule selection
  • Minimizing data volumes
  • Uniform load distribution
  • Using caches
  • Reprocessing
4. Use Case 1: Audit Trail Data Archive
  • Audit trail: Define the problem
  • Audit trail: Study requirements
  • Audit trail: Create a workflow
  • Audit trail: Scale the workflow
  • Audit trail: Select technologies
  • Audit trail: Review final architecture
5. Use Case 2: Advertising Analytics
  • Advertising analytics: Define the problem
  • Advertising analytics: Study requirements
  • Advertising analytics: Create a workflow
  • Advertising analytics: Scale the workflow
  • Advertising analytics: Select technologies
  • Advertising analytics: Review final architecture
6. Use Case 3: Product Recommendations
  • Product recommendations: Define the problem
  • Product recommendations: Study requirements
  • Product recommendations: Create a workflow
  • Product recommendations: Scale the workflow
  • Product recommendations: Select technologies
  • Product recommendations: Review the final architecture
Conclusion
  • Continuing to architect big data applications

Taught by

Kumaran Ponnambalam

Related Courses

内存数据库管理
openHPI
CS115x: Advanced Apache Spark for Data Science and Data Engineering
University of California, Berkeley via edX
Processing Big Data with Azure Data Lake Analytics
Microsoft via edX
Google Cloud Big Data and Machine Learning Fundamentals en Español
Google Cloud via Coursera
Google Cloud Big Data and Machine Learning Fundamentals 日本語版
Google Cloud via Coursera