Tips and Tricks for Robust Big Data Applications
Offered By: NDC Conferences via YouTube
Course Description
Overview
Discover essential strategies for building robust big data applications in this conference talk from NDC Sydney 2020. Learn how to optimize data storage using columnar files to reduce costs and enhance performance. Explore techniques for ensuring data quality through unit testing and quality assurance measures. Understand the importance of data governance and tracking, including how to maintain dataset lineage for auditing and compliance purposes. Gain insights into useful tools for automation and collaboration in big data projects. Master key concepts such as metadata management, skew prevention, and the significance of thorough data understanding. Equip yourself with practical knowledge to tackle the challenges of modern big data applications and stay relevant in an ever-evolving technological landscape.
Syllabus
Intro
Brief History
Agenda
Know your data and storage format
Metadata matters with slow file system
Watch out for skewness
Guard against bad data
Governance
Invest in tooling
Taught by
NDC Conferences
Related Courses
内存数据库管理openHPI CS115x: Advanced Apache Spark for Data Science and Data Engineering
University of California, Berkeley via edX Processing Big Data with Azure Data Lake Analytics
Microsoft via edX Google Cloud Big Data and Machine Learning Fundamentals en Español
Google Cloud via Coursera Google Cloud Big Data and Machine Learning Fundamentals 日本語版
Google Cloud via Coursera