YoVDO

Entity Resolution at Scale

Offered By: GOTO Conferences via YouTube

Tags

GOTO Conferences Courses Apache Spark Courses Data Cleaning Courses Software Engineering Courses Data Engineering Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore entity resolution techniques for large-scale data cleaning in this 23-minute conference talk from YOW! 2019. Discover how Software Engineer Huon Wilson from CSIRO's Data61 approaches the challenge of connecting duplicate and corrupted records to their single underlying entity. Learn about the solutions and lessons gained from implementing entity resolution on Apache Spark to process billions of records. Gain insights into overcoming real-world data challenges, scaling data cleaning processes, and improving data quality for more effective analysis and decision-making.

Syllabus

Entity Resolution at Scale • Huon Wilson • YOW! 2019


Taught by

GOTO Conferences

Related Courses

Data Wrangling with MongoDB
MongoDB via Udacity
Getting and Cleaning Data
Johns Hopkins University via Coursera
软件包在流行病学研究中的应用 Using software apps in epidemiological research
Peking University via Coursera
Creating an Analytical Dataset
Udacity
Implementing ETL with SQL Server Integration Services
Microsoft via edX