Improving Statistical Parsers Using Cross-Corpus Data - 2002
Offered By: Center for Language & Speech Processing(CLSP), JHU via YouTube
Course Description
Overview
Explore techniques for improving statistical parsers using cross-corpus data in this 55-minute lecture by Xiaoqiang Luo from IBM. Learn how to leverage data annotated for other purposes to enhance parser performance without the need for expensive, labor-intensive labeling. Discover how the EM algorithm can be employed to infer missing information from partial constraints provided by label information from other domains or corpora. Examine results demonstrating improvements to a maximum entropy parser using cross-domain and cross-corpus data. Gain insights into statistical modeling for natural language processing, language modeling, speech recognition, and spoken dialog systems from Luo's experience developing semantic parsers and interpreters for IBM's DARPA Communicator project.
Syllabus
Improving Statistical Parsers Using Cross-Corpus Data – Xiaoqiang Luo (IBM) - 2002
Taught by
Center for Language & Speech Processing(CLSP), JHU
Related Courses
Miracles of Human Language: An Introduction to LinguisticsLeiden University via Coursera Language and Mind
Indian Institute of Technology Madras via Swayam Text Analytics with Python
University of Canterbury via edX Playing With Language
TED-Ed via YouTube Computational Language: A New Kind of Science
World Science U