Ocelot: A System for Summarizing Web Pages
Offered By: Center for Language & Speech Processing(CLSP), JHU via YouTube
Course Description
Overview
Explore a prototype system called OCELOT designed to automatically generate concise summaries or "gists" of web pages. Learn how this innovative approach tackles the unique challenges of summarizing web content, which often lacks the coherent structure found in traditional text documents like news articles. Discover how OCELOT employs non-extractive summarization techniques, using probabilistic models to select and order words into a concise representation rather than extracting verbatim text spans. Examine the process of training these models using a collection of human-summarized web pages. Gain insights into the complexities of summarizing web content, which frequently consists of a mix of phrases, links, graphics, and formatting commands. This hour-long lecture, presented by Adam Berger from the Center for Language & Speech Processing at Johns Hopkins University, offers a deep dive into cutting-edge text summarization technology specifically tailored for the web.
Syllabus
Ocelot: A system for summarizing web pages - Adam Berger
Taught by
Center for Language & Speech Processing(CLSP), JHU
Related Courses
Semantic Web TechnologiesopenHPI أساسيات استرجاع المعلومات
Rwaq (رواق) 《gacco特別企画》Evernoteで広がるgaccoの学びスタイル (ga038)
University of Tokyo via gacco La Web Semántica: Herramientas para la publicación y extracción efectiva de información en la Web
Pontificia Universidad Católica de Chile via Coursera 快速学习
University of Science and Technology of China via Coursera