Hadoop Essentials for the SQL Server Professional
Offered By: PASS Data Community Summit via YouTube
Course Description
Overview
Explore Hadoop essentials tailored for SQL Server professionals in this 59-minute conference talk from PASS Data Community Summit. Delve into the history and definition of Hadoop, understanding key components like HDFS, MapReduce, and the Hadoop Ecosystem. Learn about on-premises and cloud options, and discover Hive and HiveQL, comparing them to SQL Server. Watch a practical demo, gain insights into Hadoop ecosystem tools, and address common questions about report development, indexing, and available resources. Equip yourself with the knowledge to bridge the gap between traditional SQL Server environments and the world of big data processing with Hadoop.
Syllabus
Introduction
Questions
Sponsors
Pass Summit
Pass Community
Speaker Introduction
Agenda
History of Hadoop
Definition of Hadoop
HDFS
Replication
Fault Tolerance
Name Node
MapReduce
Application Master
Microprocessing
MapReduce analogy
Hadoop Ecosystem
Onpremises Options
Cloud Options
Hive
What is Hive
HiveQL
Hive vs Sequel Server
HiveQL compatibility
MapReduce and Tez
Demo Overview
Demo Start
Hadoop Ecosystem Tool
Summary
One box vs cluster
Can I develop reports in SSRS
Is there a notion of a unique index
What resources are available
Wrap up
Taught by
PASS Data Community Summit
Related Courses
Intro to Hadoop and MapReduceCloudera via Udacity Processing Big Data with Hadoop in Azure HDInsight
Microsoft via edX Implementing Real-Time Analytics with Hadoop in Azure HDInsight
Microsoft via edX Hadoop Platform and Application Framework
University of California, San Diego via Coursera Data Manipulation at Scale: Systems and Algorithms
University of Washington via Coursera