YoVDO

Machine Learning in R - Repurpose Machine Learning Code for New Data

Offered By: Data Professor via YouTube

Tags

Machine Learning Courses Data Visualization Courses R Programming Courses RStudio Courses Data Exploration Courses Classification Models Courses

Course Description

Overview

Learn how to repurpose machine learning code in R for new datasets in this comprehensive tutorial video. Explore the process of adapting existing R code to model a new dataset, specifically focusing on the DHFR (dihydrofolate reductase) data. Follow along as the instructor guides you through launching RStudio, loading and exploring the DHFR dataset, performing summary statistics, creating visualizations, and building a classification model. Gain practical skills in data understanding, feature analysis, and model building while learning how to effectively reuse and modify R code for different datasets.

Syllabus

Launch RStudio or RStudio.cloud
Open iris-data-understanding.R file
Create a copy of iris-data-understanding.R
Save as dhfr-data-understanding.R
What is DHFR?
Load in DHFR data, type: librarydatasets and then datadhfr
Perform summary statistics
Use skimr package to explore the data
Make a scatter plot
Make a histogram
Make feature plots
Let's build the DHFR classification model
Load in the libraries
Set the seed for reproducibility
Build the training and CV models
Let's look at prediction results
Let's make Feature importance plots


Taught by

Data Professor

Related Courses

The Data Scientist’s Toolbox
Johns Hopkins University via Coursera
Linear Regression and Modeling
Duke University via Coursera
Reproducible Templates for Analysis and Dissemination
Emory University via Coursera
Data Science: Productivity Tools
Harvard University via edX
Анализ данных в R
Bioinformatics Institute via Stepik