SQL for Exploratory Data Analysis Essential Training
Offered By: LinkedIn Learning
Course Description
Overview
Learn how to use SQL to understand the characteristics of data sets destined for data science and machine learning.
Syllabus
Introduction
- Welcome
- What you should know
- Why explore data?
- Exploring data with statistics
- Testing hypothesis with statistics
- Why check data?
- Types of quality checks
- Imputing missing values
- Identifying business logic checks
- Why learn about the distribution of data?
- Minimum, maximum, and median values
- Ordering and counting
- Calculating quartiles
- Introduction to box plots
- Introduction to histograms
- Partitioning data
- Calculating histograms
- Simple histogram visualization
- Introduction to correlation
- Calculating correlation with SQL
- Next steps
Taught by
Dan Sullivan
Related Courses
Introduction to DatabasesMeta via Coursera Web Development
Udacity Introduction to Data Science
University of Washington via Coursera Datenmanagement mit SQL
openHPI Sabermetrics 101: Introduction to Baseball Analytics
Boston University via edX