YoVDO

Baseball Data Wrangling with Vagrant, R, and Retrosheet

Offered By: Udemy

Tags

Data Analysis Courses R Programming Courses Vagrant Courses

Course Description

Overview

Analytics with the Chadwick tools, dplyr, and ggplot.

What you'll learn:
  • install VirtualBox and Vagrant
  • run a virtual Linux machine
  • install the Chadwick software tools
  • extract game and play-by-play baseball data from Retrosheet files
  • produce graphs with ggplot

This course is for those interested in doing baseball analytics with the Retrosheet game-by-game and play-by-play data. The main tools for working with such data are in the Chadwick software. We install a virtual Linux machine, on which we will install the Chadwick software. We will then learn how to extract baseball data with the Chadwick software, how to further filter the data with dplyr in R, and how to plot our results with ggplot.

For the first part of the course, in which we install the virtual Linux machine and learn how to work with the Chadwick software, there are no prerequisites. To follow the second part of the course, knowledge of dplyr is necessary. This can be obtained through my course "Baseball Database Queries with SQL and dplyr".

At a relaxed pace, the course should take two to three weeks to complete.


Taught by

Charles Redmond

Related Courses

MBA Core Curriculum
University System of Maryland via edX
حدث كايزن في شرائح جوجل
Coursera Project Network via Coursera
A Organização Centrada na Jornada do Cliente
Fundação Instituto de Administração via Coursera
Accounting Data Analytics
University of Illinois at Urbana-Champaign via Coursera
Data Analytics in Accounting Capstone
University of Illinois at Urbana-Champaign via Coursera