YoVDO

Exploring Google Ngrams with Amazon EMR and Hive

Offered By: Amazon Web Services via AWS Skill Builder

Tags

Amazon EMR Courses Big Data Courses Data Analysis Courses Amazon S3 Courses Data Normalization Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Languages Available: Español (Latinoamérica) | Español (España) | Français | Bahasa Indonesia | Italiano | 日本語 | 한국어 | Português (Brasil) | 中文(简体)

This lab demonstrates how to launch an Amazon Elastic MapReduce (EMR) cluster for Big Data processing and use Hive with SQL-style queries to analyze data. You will create a Hadoop cluster using Amazon EMR which will allow to run interactive Hive queries against data stored in Amazon S3. You will use Hive to normalize the data in a more useful way, and you will run queries to analyze the data.


Level

Advanced


Duration

1 Hours 15 Minutes


Course Objectives

In this course, you will learn how to:

  • Create an Amazon EMR cluster running Hive
  • Use Hive statements to create tables from Google Ngram input data stored in Amazon S3
  • Run Hive queries to drill-down and analyze data


Intended Audience

This course is intended for:

  • Architects
  • Data Engineers


Prerequisites

We recommend that attendees of this course have the following prerequisites:

  • None


Course Outline

  • Task 1: Launch an Amazon EMR cluster
  • Task 2: Connect to Your Cluster
  • Task 3: Analyze Data

Tags

Related Courses

Introduction to Amazon Elastic MapReduce (EMR)
Pluralsight
Creando tu plataforma de BIG DATA con EMR
Coursera Project Network via Coursera
Data Analytics Learning Plan
Amazon Web Services via AWS Skill Builder
Getting Started with Amazon EMR
Amazon Web Services via AWS Skill Builder
Introduction to Amazon Elastic MapReduce (EMR) (Italian)
Amazon Web Services via AWS Skill Builder