OpenAI DALL·E - Creating Images from Text - Blog Post Explained
Offered By: Yannic Kilcher via YouTube
Course Description
Overview
Syllabus
- Introduction
- Overview
- Dataset
- Comparison to GPT-3
- Model Architecture
- VQ-VAE
- Combining VQ-VAE with GPT-3
- Pre-Training with Relaxation
- Experimental Results
- My Hypothesis about DALL·E's inner workings
- Sparse Attention Patterns
- DALL·E can't count
- DALL·E can't global order
- DALL·E renders different views
- DALL·E is very good at texture
- DALL·E can complete a bust
- DALL·E can do some reflections, but not others
- DALL·E can do cross-sections of some objects
- DALL·E is amazing at style
- DALL·E can generate logos
- DALL·E can generate bedrooms
- DALL·E can combine unusual concepts
- DALL·E can generate illustrations
- DALL·E sometimes understands complicated prompts
- DALL·E can pass part of an IQ test
- DALL·E probably does not have geographical / temporal knowledge
- Reranking dramatically improves quality
- Conclusions & Comments
Taught by
Yannic Kilcher
Related Courses
How to Build Codex SolutionsMicrosoft via YouTube Unlocking the Power of OpenAI for Startups - Microsoft for Startups
Microsoft via YouTube Building Intelligent Applications with World-Class AI
Microsoft via YouTube Stanford Seminar - Transformers in Language: The Development of GPT Models Including GPT-3
Stanford University via YouTube ChatGPT: GPT-3, GPT-4 Turbo: Unleash the Power of LLM's
Udemy