Scale R to Big Data with Hadoop & Spark
Offered By: Data Science Dojo via YouTube
Course Description
Overview
Learn how to scale R for big data processing using Hadoop and Spark in this 1-hour 10-minute tutorial. Set up a Spark cluster with R installed, wrangle data stored in HDFS using R, and build and deploy machine learning models on large datasets. Discover how to utilize Microsoft R Server to enable distributed computing in R, run native R code via SSH, and set up RStudio server on a cluster. Explore techniques for data manipulation in HDFS, model building on large-scale data, and deploying models to elastically scaled web services for predictions and insights. Gain practical skills to overcome R's traditional limitations with big data and leverage its capabilities throughout the entire data science workflow.
Syllabus
Scale R to Big Data with Hadoop & Spark
Taught by
Data Science Dojo
Related Courses
Excel 2010Miríadax Intro to Data Science
Udacity Data Manipulation at Scale: Systems and Algorithms
University of Washington via Coursera Statistical Computing with R - a gentle introduction
University College London via Independent Introducción a Data Science: Programación Estadística con R
Universidad Nacional Autónoma de México via Coursera