Data Cleansing With SQL And R
Offered By: NDC Conferences via YouTube
Course Description
Overview
Learn essential data cleansing and preparation techniques using SQL Server and R in this comprehensive conference talk. Explore the concept of tidy data and discover how to simplify research and analysis of a small but realistic data set. Dive into various aspects of dirty data, including consistency, incompleteness, accuracy, and duplicate results. Gain insights into normalization, Boyce Codd normal form, data types, and key constraints. Follow along with practical demonstrations on mapping tables, notebooks, and entity-attribute values. Acquire valuable skills to efficiently handle the time-consuming task of data preparation, which often consumes up to 80% of a data scientist's project time.
Syllabus
Intro
About Kevin
What is dirty data
General philosophy
Data quality services
Bonus
Dirty Data
Data Consistency
Incomplete Data
Data Accuracy
Duplicate Results
Rules Of Thumb
Normalization
Boyce Codd
Data Types
Key Constraints
Demo
Mapping Tables
Notebooks
Entity Attribute Values
Analysis
Taught by
NDC Conferences
Related Courses
Getting and Cleaning DataJohns Hopkins University via Coursera Reshaping Data with tidyr
DataCamp Cleaning Bad Data in R
LinkedIn Learning Data Wrangling in R (2017)
LinkedIn Learning R Data Pre-Processing & Data Management - Shape your Data!
Udemy