Do We Know Our Data, as Good as We Know Our Tools?
Offered By: Devoxx via YouTube
Course Description
Overview
Explore the critical importance of understanding and preparing data before model training in this 51-minute conference talk from Devoxx. Delve into common problems encountered during data analysis and preparation, including dirty data, disparate datasets requiring normalization, and information overload. Learn various techniques to address these issues, such as detecting misleading data and outliers, handling missing or ambiguous values, and applying dimensionality reduction. Discover how to use statistical and physics functions, feature selection, and resampling methods to enhance data quality. Gain insights into utilizing different types of plots at various stages of the data preparation process. Walk away with valuable resources to further explore data analysis and preparation techniques at your own pace, equipping yourself with essential skills for transitioning from developer to data scientist.
Syllabus
Do we know our data, as good as we know our tools? by Mani Sarkar & Jeremie Charlet
Taught by
Devoxx
Related Courses
Social Network AnalysisUniversity of Michigan via Coursera Intro to Algorithms
Udacity Data Analysis
Johns Hopkins University via Coursera Computing for Data Analysis
Johns Hopkins University via Coursera Health in Numbers: Quantitative Methods in Clinical & Public Health Research
Harvard University via edX