Survival Analysis with TCGA Data in R - Create Kaplan-Meier Curves
Offered By: Bioinformagician via YouTube
Course Description
Overview
Syllabus
Intro
Intuition behind survival analysis
Why do we perform survival analysis?
What is Censoring and why is it important?
What is considered as an event?
Methods for survival analysis
How to read a Kaplan-Meier curve?
Question to answer using survival analysis
3 things required for survival analysis
Download clinical data from GDC portal
Getting status information and censoring data
Set up an “overall survival” i.e. time for each patient in the cohort
For event/strata information for each patient, fetch gene expression data from GDC portal
Build query using GDCquery
Download data using GDCdownload
Extract counts using GDCprepare
Perform Variance Stabilization Transformation vst on counts before further analysis
Wrangle data to get the relevant data and data in the right shape
Approaches to divide cohort into 2 groups based on expression
Bifurcating patients into low and high TP53 expression groups
Define strata for each patient
Compute a survival curve using survfit and creating a Kaplan-Meier curve using ggsruvplot
survfit vs survdiff
Taught by
bioinformagician
Related Courses
Social Network AnalysisUniversity of Michigan via Coursera Intro to Algorithms
Udacity Data Analysis
Johns Hopkins University via Coursera Computing for Data Analysis
Johns Hopkins University via Coursera Health in Numbers: Quantitative Methods in Clinical & Public Health Research
Harvard University via edX