Gemino: Practical and Robust Neural Compression for Video Conferencing
Offered By: USENIX via YouTube
Course Description
Overview
Explore a groundbreaking conference talk from NSDI '24 that introduces Gemino, an innovative neural compression system designed to enhance video conferencing quality. Delve into the challenges faced by current video codecs during poor network conditions and discover how Gemino addresses these issues through a novel high-frequency-conditional super-resolution pipeline. Learn about the system's ability to upsample low-resolution frames while enhancing high-frequency details using information from a single high-resolution reference image. Understand the multi-scale architecture that enables Gemino to scale to resolutions comparable to 720p and its personalization capabilities for improved fidelity at low bitrates. Gain insights into the implementation of Gemino using aiortc, an open-source Python implementation of WebRTC, and its real-time performance on a Titan X GPU. Discover how Gemino achieves 2.2–5x lower bitrate than traditional video codecs while maintaining perceptual quality, potentially revolutionizing video conferencing experiences.
Syllabus
NSDI '24 - Gemino: Practical and Robust Neural Compression for Video Conferencing
Taught by
USENIX
Related Courses
Neural Networks for Machine LearningUniversity of Toronto via Coursera 機器學習技法 (Machine Learning Techniques)
National Taiwan University via Coursera Machine Learning Capstone: An Intelligent Application with Deep Learning
University of Washington via Coursera Прикладные задачи анализа данных
Moscow Institute of Physics and Technology via Coursera Leading Ambitious Teaching and Learning
Microsoft via edX