YoVDO

ByteDance's 100 Billion File HDFS Clustering Practice

Offered By: The ASF via YouTube

Tags

HDFS Courses Distributed File Systems Courses Data Replication Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore ByteDance's innovative approach to managing a massive 100 billion file HDFS cluster in this insightful conference talk. Delve into the challenges faced by Apache HDFS in the era of expanding big data technology and increasing data scale complexity. Learn how ByteDance's big data storage team has evolved HDFS to support diverse application scenarios, including traditional Hadoop warehouse services, storage separation architecture for computing engines, and machine learning model training. Discover the implementation of cross-regional storage scheduling capabilities, integrated user-side caching, regular triple copy mechanisms, and intelligent hot and cold data management. Gain valuable insights into ByteDance's understanding of emerging storage requirements and their technical solutions for enhancing stability, efficiency, and adaptability in large-scale data environments.

Syllabus

Bytedance 100 Billion File Hdfs Clustering Practice


Taught by

The ASF

Related Courses

Cloud Computing Concepts: Part 2
University of Illinois at Urbana-Champaign via Coursera
Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera
Advanced Big Data Systems | 高级大数据系统
Tsinghua University via edX
数据科学 | Data Science
Tsinghua University via edX
Windows Server Administration Concepts: Storage
Pluralsight