Do We Know Our Data, as Good as We Know Our Tools?
Offered By: Devoxx via YouTube
Course Description
Overview
Explore the critical importance of understanding and preparing data before model training in this 51-minute conference talk from Devoxx. Delve into common problems encountered during data analysis and preparation, including dirty data, disparate datasets requiring normalization, and information overload. Learn various techniques to address these issues, such as detecting misleading data and outliers, handling missing or ambiguous values, and applying dimensionality reduction. Discover how to use statistical and physics functions, feature selection, and resampling methods to enhance data quality. Gain insights into utilizing different types of plots at various stages of the data preparation process. Walk away with valuable resources to further explore data analysis and preparation techniques at your own pace, equipping yourself with essential skills for transitioning from developer to data scientist.
Syllabus
Do we know our data, as good as we know our tools? by Mani Sarkar & Jeremie Charlet
Taught by
Devoxx
Related Courses
Data Wrangling with MongoDBMongoDB via Udacity Getting and Cleaning Data
Johns Hopkins University via Coursera 软件包在流行病学研究中的应用 Using software apps in epidemiological research
Peking University via Coursera Creating an Analytical Dataset
Udacity Implementing ETL with SQL Server Integration Services
Microsoft via edX