Zipf's Law Suggests a Three-Pronged Approach to Inclusive Speech Recognition
Offered By: Center for Language & Speech Processing(CLSP), JHU via YouTube
Course Description
Overview
Explore Zipf's law and its implications for inclusive speech recognition in this 55-minute lecture by Mark Hasegawa-Johnson from the Center for Language & Speech Processing at JHU. Delve into the three types of words - frequent, infrequent, and out-of-vocabulary - and how speech recognition technology has evolved to address each category. Examine the power-law distribution in language demographics and its impact on speech recognition approaches. Learn about monolingual pre-training, multilingual knowledge transfer, and unsupervised ASR methods for languages with varying amounts of data. Discuss the challenges of speech recognition for individuals with disabilities and the importance of collaboration between researchers and affected communities. Gain insights from Hasegawa-Johnson's extensive research in speech production, perception, source separation, voice conversion, and low-resource automatic speech recognition.
Syllabus
Zipf's Law Suggests a Three-Pronged Approach to Inclusive Speech Recognition–Mark Hasegawa-Johnson
Taught by
Center for Language & Speech Processing(CLSP), JHU
Related Courses
Natural Language ProcessingColumbia University via Coursera Bioinformatics Algorithms (Part 2)
University of California, San Diego via Coursera Finding Mutations in DNA and Proteins (Bioinformatics VI)
University of California, San Diego via Coursera Elaborazione del linguaggio naturale
University of Naples Federico II via Federica Dynamic Programming: Applications In Machine Learning and Genomics
University of California, San Diego via edX