YoVDO

Multimodal Generative AI Demystified

Offered By: GAIA via YouTube

Tags

Deep Learning Courses Computer Vision Courses 3D Reconstruction Courses Image Generation Courses Audio generation Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the intricacies of multimodal generative AI in this 28-minute conference talk delivered by Ekaterina Sirazitdinova, Senior Data Scientist at Nvidia. Gain insights into the inner workings of these complex models, understanding key concepts and techniques used in their development. Discover various applications and use cases of multimodal generative AI, which enables the creation of realistic images, videos, and audio from textual or other inputs. Learn from Ekaterina's expertise in leveraging AI techniques for computer vision and language processing challenges, as well as her experience in end-to-end AI productization. Benefit from her background in medical image analysis and her academic achievements, including a Ph.D. in Computer Science and publications on image-based 3D reconstruction, localization, and tracking. Recorded at the 2024 GAIA Conference, this talk is suitable for anyone interested in the current state of AI and its potential to produce realistic and immersive multimedia experiences.

Syllabus

Multimodal Generative AI Demystified by Ekaterina Sirazitdinova


Taught by

GAIA

Related Courses

Amazon SageMaker JumpStart で始める生成系 AI (Japanese ONLY) (Na) 日本語実写版
Amazon Web Services via AWS Skill Builder
Amazon SageMaker JumpStart Foundations
Amazon Web Services via AWS Skill Builder
Amazon SageMaker JumpStart Foundations (Japanese)
Amazon Web Services via AWS Skill Builder
Apply Generative Adversarial Networks (GANs)
DeepLearning.AI via Coursera
AWS Flash - Amazon SageMaker JumpStart で始める生成 AI (Japanese ONLY) (Na) 日本語実写版
Amazon Web Services via AWS Skill Builder