Ocelot: A System for Summarizing Web Pages
Offered By: Center for Language & Speech Processing(CLSP), JHU via YouTube
Course Description
Overview
Explore a prototype system called OCELOT designed to automatically generate concise summaries or "gists" of web pages. Learn how this innovative approach tackles the unique challenges of summarizing web content, which often lacks the coherent structure found in traditional text documents like news articles. Discover how OCELOT employs non-extractive summarization techniques, using probabilistic models to select and order words into a concise representation rather than extracting verbatim text spans. Examine the process of training these models using a collection of human-summarized web pages. Gain insights into the complexities of summarizing web content, which frequently consists of a mix of phrases, links, graphics, and formatting commands. This hour-long lecture, presented by Adam Berger from the Center for Language & Speech Processing at Johns Hopkins University, offers a deep dive into cutting-edge text summarization technology specifically tailored for the web.
Syllabus
Ocelot: A system for summarizing web pages - Adam Berger
Taught by
Center for Language & Speech Processing(CLSP), JHU
Related Courses
Fundamentals of Quantitative ModelingUniversity of Pennsylvania via Coursera Теория вероятностей – наука о случайности
Tomsk State University via Stepik Statistics and Data Science
Massachusetts Institute of Technology via edX Natural Language Processing with Probabilistic Models
DeepLearning.AI via Coursera Natural Language Processing
DeepLearning.AI via Coursera