Stable Diffusion and Friends - High-Resolution Image Synthesis via Two-Stage Generative Models

Offered By: HuggingFace via YouTube

Course Description

Overview

Explore the evolution of generative image models in this insightful talk by Robin Rombach, co-creator of Stable Diffusion. Delve into the progression from GANs to Transformers and latent Diffusion models, gaining a comprehensive understanding of high-resolution image synthesis techniques. Learn about two-stage generative models, the QCVAE architecture, Vision Transformers, and the groundbreaking Stable Diffusion model. Discover applications in text-to-image generation, semantic synthesis, upscaling, and creative endeavors like text-to-color palette conversion and video stylization. Gain valuable insights from Rombach's extensive research experience and his pivotal role in developing widely-used projects such as VQGAN, Taming Transformers, and Latent Diffusion Models.

Syllabus

Introduction
Diffusion
TwoStage Generative Models
Leon Model
Why domain knowledge
QCVAE architecture
QCVAE reconstruction
VisionTransformers
VQan
HighResolution Image Synthesis
Text to Image Generation
Stable Diffusion
Classifier Free Diffusion Guidance
Stereo Fusion in Painting
Semantic Synthesis
Upscaling
SBEdit
Diffusion Model
Creative Applications
Text to Color Palette
Video stylization
Lexi Carlile
Credits
Questions
One Direction
Adding Numerology
Conclusion

Taught by

Hugging Face

Stable Diffusion and Friends - High-Resolution Image Synthesis via Two-Stage Generative Models

Tags

Course Description

Overview

Syllabus

Taught by

Related Courses

Stable Diffusion and Friends - High-Resolution Image Synthesis via Two-Stage Generative Models

Tags

Course Description

Overview

Syllabus

Taught by

Related Courses

Login to Continue