Survival Analysis with TCGA Data in R - Create Kaplan-Meier Curves
Offered By: Bioinformagician via YouTube
Course Description
Overview
Syllabus
Intro
Intuition behind survival analysis
Why do we perform survival analysis?
What is Censoring and why is it important?
What is considered as an event?
Methods for survival analysis
How to read a Kaplan-Meier curve?
Question to answer using survival analysis
3 things required for survival analysis
Download clinical data from GDC portal
Getting status information and censoring data
Set up an “overall survival” i.e. time for each patient in the cohort
For event/strata information for each patient, fetch gene expression data from GDC portal
Build query using GDCquery
Download data using GDCdownload
Extract counts using GDCprepare
Perform Variance Stabilization Transformation vst on counts before further analysis
Wrangle data to get the relevant data and data in the right shape
Approaches to divide cohort into 2 groups based on expression
Bifurcating patients into low and high TP53 expression groups
Define strata for each patient
Compute a survival curve using survfit and creating a Kaplan-Meier curve using ggsruvplot
survfit vs survdiff
Taught by
bioinformagician
Related Courses
Network Analysis in Systems BiologyIcahn School of Medicine at Mount Sinai via Coursera Molecular Dynamics for Computational Discoveries in Science
University of Massachusetts Boston via Independent Biology Meets Programming: Bioinformatics for Beginners
University of California, San Diego via Coursera Python for Informatics: Exploring Information
Open Education by Blackboard Genomic Medicine Gets Personal
Georgetown University via edX