Entity Resolution at Scale
Offered By: GOTO Conferences via YouTube
Course Description
Overview
Explore entity resolution techniques for large-scale data cleaning in this 23-minute conference talk from YOW! 2019. Discover how Software Engineer Huon Wilson from CSIRO's Data61 approaches the challenge of connecting duplicate and corrupted records to their single underlying entity. Learn about the solutions and lessons gained from implementing entity resolution on Apache Spark to process billions of records. Gain insights into overcoming real-world data challenges, scaling data cleaning processes, and improving data quality for more effective analysis and decision-making.
Syllabus
Entity Resolution at Scale • Huon Wilson • YOW! 2019
Taught by
GOTO Conferences
Related Courses
Data Wrangling with MongoDBMongoDB via Udacity Getting and Cleaning Data
Johns Hopkins University via Coursera 软件包在流行病学研究中的应用 Using software apps in epidemiological research
Peking University via Coursera Creating an Analytical Dataset
Udacity Implementing ETL with SQL Server Integration Services
Microsoft via edX