YoVDO

ByteDance's 100 Billion File HDFS Clustering Practice

Offered By: The ASF via YouTube

Tags

HDFS Courses Distributed File Systems Courses Data Replication Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore ByteDance's innovative approach to managing a massive 100 billion file HDFS cluster in this insightful conference talk. Delve into the challenges faced by Apache HDFS in the era of expanding big data technology and increasing data scale complexity. Learn how ByteDance's big data storage team has evolved HDFS to support diverse application scenarios, including traditional Hadoop warehouse services, storage separation architecture for computing engines, and machine learning model training. Discover the implementation of cross-regional storage scheduling capabilities, integrated user-side caching, regular triple copy mechanisms, and intelligent hot and cold data management. Gain valuable insights into ByteDance's understanding of emerging storage requirements and their technical solutions for enhancing stability, efficiency, and adaptability in large-scale data environments.

Syllabus

Bytedance 100 Billion File Hdfs Clustering Practice


Taught by

The ASF

Related Courses

Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera
Créez votre Data Lake
CentraleSupélec via OpenClassrooms
Big data Internship Program - Foundation
Udemy
Learning Hadoop
LinkedIn Learning
Azure Synapse SQL Pool - Implement Polybase
Coursera Project Network via Coursera