Digital Speech Processing
Offered By: Indian Institute of Technology, Kharagpur via Swayam
Course Description
Overview
ABOUT THE COURSE: Oral Speech may be the most natural, common and direct mode of human communication. Since the middle of the last century, Speech has become an area of intense and active research and development (R&D) to become a prime means of direct Human-Computer Interactions (HCI). The pace of such R&D has farther got boosted with the general abundance of cheap computing power in the form of PC, PDA or Mobile Handset. While man to machine in speech mode is yet to reach the minimum threshold level for wide-spread deployment, spoken messages directly by machine. This need research in speech science and development of speech technology. The course provides the foundation knowledge on speech production and perception along with processing of speech signal in digital domain.INTENDED AUDIENCE: ECE, CS, EE, IEPREREQUISITES: Digital Signal Processing or Signals and SystemINDUSTRY SUPPORT: Companies, Industry like Microsoft, Google , IBM who are working in the area of speech technology development
Syllabus
Week 1: Introduction to speech processing, Digitization and Recording of speech signal, Review of Digital Signal Processing Concepts
Week 2:Human Speech production, Acoustic Phonetics and Articulatory Phonetics, Different categories speech sounds and Location of sounds in the acoustic waveform and spectrograms
Week 3:Uniform Tube Modeling of Speech Production, Speech Perception
Week 4:Time Domain Methods in Speech Processing, Analysis and Synthesis of Pole-Zero Speech Models
Week 5:Short-Time Fourier Transform, Analysis:- FT view and Filtering view, Synthesis:-Filter bank summation (FBS) Method and OLA Method
Week 6:Features Extraction, Extraction of Fundamental frequency
Week 7:Speech Prosody, Speech Prosody Modeling (Fujisaki Model)
Week 8:Speech based Applications (TTS, ASR and spoken language acquisition)
Tags
Related Courses
Fundamentals of Electrical EngineeringRice University via Coursera Digital Signal Processing
École Polytechnique Fédérale de Lausanne via Coursera Fundamentals of Electrical Engineering Laboratory
Rice University via Coursera Processamento Digital de Sinais - Amostragem
Universidade Estadual de Campinas via Coursera Physics-Based Sound Synthesis for Games and Interactive Systems
Stanford University via Kadenze