Feature Engineering & Interpretability for XGBoost with Board Game Ratings
Offered By: Julia Silge via YouTube
Course Description
Overview
Explore advanced modeling techniques using #TidyTuesday data on board game ratings in this 48-minute screencast. Dive into custom feature engineering, xgboost tuning, and explainability methods. Learn about data overview, average distribution, preprocessing, custom tokenizing, and string squishing. Delve into regression, tuning, and result analysis. Discover how to plot function games, find the best game, and work with testing sets. Gain insights into explainability tools, including parsnip fit, model importance, and dependency partial plots. Examine min age plots and create summaries to enhance your understanding of feature engineering and interpretability in xgboost models.
Syllabus
Introduction
Data overview
Average distribution
Modeling
Preprocessing
Custom Tokenizing
String Squish
Regression
Tuning
Results
Plotting function game
Finding the best game
Last fit
Testing set
Explainability tools
parsnip fit
model importance
shop
model
other arguments
matrix
making plots
dependency partial plot
min age plot
summary
Taught by
Julia Silge
Related Courses
Genomic Data Science and Clustering (Bioinformatics V)University of California, San Diego via Coursera 用Python玩转数据 Data Processing Using Python
Nanjing University via Coursera Data Mining Project
University of Illinois at Urbana-Champaign via Coursera Advanced Business Analytics Capstone
University of Colorado Boulder via Coursera Data Mining: Theories and Algorithms for Tackling Big Data | 数据挖掘:理论与算法
Tsinghua University via edX