ByteDance's 100 Billion File HDFS Clustering Practice
Offered By: The ASF via YouTube
Course Description
Overview
Explore ByteDance's innovative approach to managing a massive 100 billion file HDFS cluster in this insightful conference talk. Delve into the challenges faced by Apache HDFS in the era of expanding big data technology and increasing data scale complexity. Learn how ByteDance's big data storage team has evolved HDFS to support diverse application scenarios, including traditional Hadoop warehouse services, storage separation architecture for computing engines, and machine learning model training. Discover the implementation of cross-regional storage scheduling capabilities, integrated user-side caching, regular triple copy mechanisms, and intelligent hot and cold data management. Gain valuable insights into ByteDance's understanding of emerging storage requirements and their technical solutions for enhancing stability, efficiency, and adaptability in large-scale data environments.
Syllabus
Bytedance 100 Billion File Hdfs Clustering Practice
Taught by
The ASF
Related Courses
Cloud Computing Concepts: Part 2University of Illinois at Urbana-Champaign via Coursera Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera Advanced Big Data Systems | 高级大数据系统
Tsinghua University via edX 数据科学 | Data Science
Tsinghua University via edX Windows Server Administration Concepts: Storage
Pluralsight