ByteDance's 100 Billion File HDFS Clustering Practice
Offered By: The ASF via YouTube
Course Description
Overview
Explore ByteDance's innovative approach to managing a massive 100 billion file HDFS cluster in this insightful conference talk. Delve into the challenges faced by Apache HDFS in the era of expanding big data technology and increasing data scale complexity. Learn how ByteDance's big data storage team has evolved HDFS to support diverse application scenarios, including traditional Hadoop warehouse services, storage separation architecture for computing engines, and machine learning model training. Discover the implementation of cross-regional storage scheduling capabilities, integrated user-side caching, regular triple copy mechanisms, and intelligent hot and cold data management. Gain valuable insights into ByteDance's understanding of emerging storage requirements and their technical solutions for enhancing stability, efficiency, and adaptability in large-scale data environments.
Syllabus
Bytedance 100 Billion File Hdfs Clustering Practice
Taught by
The ASF
Related Courses
Big Data Essentials: HDFS, MapReduce and Spark RDDYandex via Coursera Créez votre Data Lake
CentraleSupélec via OpenClassrooms Big data Internship Program - Foundation
Udemy Learning Hadoop
LinkedIn Learning Azure Synapse SQL Pool - Implement Polybase
Coursera Project Network via Coursera