How Companies Are Using Tachyon, a Memory Centric Distributed Storage
Offered By: Open Data Science via YouTube
Course Description
Overview
Explore the cutting-edge Tachyon distributed storage system in this ODSC West 2015 conference talk by Hayuan Li. Learn how memory-centric storage addresses big data processing bottlenecks and enables reliable file sharing at memory-speed across cluster frameworks like Apache Spark, MapReduce, and Flink. Discover Tachyon's key features, including Hadoop compatibility, fault tolerance, and its role as the default off-heap option in Spark. Gain insights into real-world use cases from companies leveraging Tachyon in production environments. Delve into topics such as star use cases, SAS and Spark implementations, SSD integration, new features, common misconceptions, configuration options, policies, transparent naming, and unified namespace. Understand how Tachyon fits into the Berkeley Data Analytics Stack and its widespread adoption across various institutions. Conclude with information on how to get involved in this open-source project that's revolutionizing distributed storage for big data processing.
Syllabus
Introduction
Star Use Case
SAS Use Case
Spark Use Case
SSD Use Case
New Features
Common Misconceptions
Configuration Options
Policies
Transparent Naming
Unified Namespace
Additional Features
How to get involved
Taught by
Open Data Science
Related Courses
Intro to Data ScienceUdacity Mining Massive Datasets
Stanford University via edX Cloud Computing Concepts, Part 1
University of Illinois at Urbana-Champaign via Coursera Data Manipulation at Scale: Systems and Algorithms
University of Washington via Coursera Big Data Analytics in Healthcare
Georgia Institute of Technology via edX