Data Quality Round-trip in the MS Dataplatform
Offered By: PASS Data Community Summit via YouTube
Course Description
Overview
Explore a comprehensive data quality solution using Microsoft's Data Platform in this conference talk from PASS Summit 2019. Learn how to leverage Machine Learning Services, Common Data Model, Azure Data Factory, SQL Server 2019, and more to address data quality challenges. Discover the costs associated with poor data quality and understand the reasons behind it. Dive into the Data Quality Process and explore Microsoft's portfolio for data governance and ingestion. Gain insights into profiling techniques using Excel, Machine Learning SDK, Azure ML Data Prep, and Python. Examine fuzzy matching concepts and explore tools like Azure Data Cutter and Common Data Model to enhance your data quality initiatives.
Syllabus
Intro
About us
Bad data
Theory
Cost of Data Quality
Do things 100
Reasons for bad data
Data Quality Process
Data Platform Landscape
Governance Layer
Ingest Layer
Microsofts portfolio
DQS
Data Quality Matrix
Profiling
Profiling in Excel
Machine Learning SDK
Azure ML Data Prep
Python Profiling
FuzzyWuzzy
FuzzyGrouping
Console
Azure Data Cutter
Tags
Common Data Model
Taught by
PASS Data Community Summit
Related Courses
Business Intelligence for ConsultantsLinkedIn Learning Profile Your Data with Power BI
Pluralsight Data Science Foundations: Data Engineering
LinkedIn Learning Machine Learning for Data Analysis: Data Profiling & QA
Udemy Beginners Introduction to SQL and Database Part II
Udemy