OpenAI DALL·E - Creating Images from Text - Blog Post Explained
Offered By: Yannic Kilcher via YouTube
Course Description
Overview
Syllabus
- Introduction
- Overview
- Dataset
- Comparison to GPT-3
- Model Architecture
- VQ-VAE
- Combining VQ-VAE with GPT-3
- Pre-Training with Relaxation
- Experimental Results
- My Hypothesis about DALL·E's inner workings
- Sparse Attention Patterns
- DALL·E can't count
- DALL·E can't global order
- DALL·E renders different views
- DALL·E is very good at texture
- DALL·E can complete a bust
- DALL·E can do some reflections, but not others
- DALL·E can do cross-sections of some objects
- DALL·E is amazing at style
- DALL·E can generate logos
- DALL·E can generate bedrooms
- DALL·E can combine unusual concepts
- DALL·E can generate illustrations
- DALL·E sometimes understands complicated prompts
- DALL·E can pass part of an IQ test
- DALL·E probably does not have geographical / temporal knowledge
- Reranking dramatically improves quality
- Conclusions & Comments
Taught by
Yannic Kilcher
Related Courses
6.S191: Introduction to Deep LearningMassachusetts Institute of Technology via Independent Generate Synthetic Images with DCGANs in Keras
Coursera Project Network via Coursera Image Compression and Generation using Variational Autoencoders in Python
Coursera Project Network via Coursera Build Basic Generative Adversarial Networks (GANs)
DeepLearning.AI via Coursera Apply Generative Adversarial Networks (GANs)
DeepLearning.AI via Coursera