Stable Diffusion and Friends - High-Resolution Image Synthesis via Two-Stage Generative Models
Offered By: HuggingFace via YouTube
Course Description
Overview
Explore the evolution of generative image models in this insightful talk by Robin Rombach, co-creator of Stable Diffusion. Delve into the progression from GANs to Transformers and latent Diffusion models, gaining a comprehensive understanding of high-resolution image synthesis techniques. Learn about two-stage generative models, the QCVAE architecture, Vision Transformers, and the groundbreaking Stable Diffusion model. Discover applications in text-to-image generation, semantic synthesis, upscaling, and creative endeavors like text-to-color palette conversion and video stylization. Gain valuable insights from Rombach's extensive research experience and his pivotal role in developing widely-used projects such as VQGAN, Taming Transformers, and Latent Diffusion Models.
Syllabus
Introduction
Diffusion
TwoStage Generative Models
Leon Model
Why domain knowledge
QCVAE architecture
QCVAE reconstruction
VisionTransformers
VQan
HighResolution Image Synthesis
Text to Image Generation
Stable Diffusion
Classifier Free Diffusion Guidance
Stereo Fusion in Painting
Semantic Synthesis
Upscaling
SBEdit
Diffusion Model
Creative Applications
Text to Color Palette
Video stylization
Lexi Carlile
Credits
Questions
One Direction
Adding Numerology
Conclusion
Taught by
Hugging Face
Related Courses
Vision Transformers Explained + Fine-Tuning in PythonJames Briggs via YouTube ConvNeXt- A ConvNet for the 2020s - Paper Explained
Aleksa Gordić - The AI Epiphany via YouTube Do Vision Transformers See Like Convolutional Neural Networks - Paper Explained
Aleksa Gordić - The AI Epiphany via YouTube Intro to Dense Vectors for NLP and Vision
James Briggs via YouTube Geo-localization Framework for Real-world Scenarios - Defense Presentation
University of Central Florida via YouTube