Large Language Models for Computational Biology - A Primer
Offered By: Computational Genomics Summer Institute CGSI via YouTube
Course Description
Overview
Explore the intersection of large language models and computational biology in this 49-minute lecture by Jian Ma at the Computational Genomics Summer Institute (CGSI). Delve into the recent history of large language models and foundation models, understanding their architecture and applications in genomics. Learn about the Transformer model, self-attention mechanisms, and various tokenization techniques specific to biological sequences. Discover specialized models like DNA Bird 2, nucleotide Transformer, SD Bert Model, and SCGPT, designed for genomic data analysis. Examine the concept of generative pretraining and its relevance to computational biology. Conclude with a discussion on open questions and a summary of the potential impact of these technologies on genomic research.
Syllabus
Introduction
Antonio Van Lovin Hook
CGSI YouTube
Presentation Preparation
Recent History
Large Language Models
Foundation Models
Transformer
Selfattention
Attention
Transformer Architecture
Pretraining
Applications
Models
Tokenization
Masking
DNA Bird 2
nucleotide Transformer
SD Bert Model
SCGPT
generative pretraining
SC Foundation model
Open question
Summary
Taught by
Computational Genomics Summer Institute CGSI
Related Courses
Network Analysis in Systems BiologyIcahn School of Medicine at Mount Sinai via Coursera Molecular Dynamics for Computational Discoveries in Science
University of Massachusetts Boston via Independent Biology Meets Programming: Bioinformatics for Beginners
University of California, San Diego via Coursera Python for Informatics: Exploring Information
Open Education by Blackboard Genomic Medicine Gets Personal
Georgetown University via edX