Exploring Gemini 1.5 Pro: Large Context Window and Multimodal Capabilities
Offered By: Sam Witteveen via YouTube
Course Description
Overview
Explore the capabilities of Google's latest AI model, Gemini 1.5, in this informative video demonstration. Dive into the expanded 1 million token context window and witness its impressive performance across various tasks. Learn about the model's ability to query documents, write code, and analyze video and image content. Compare Gemini 1.5's context window to other leading AI models and gain insights into its potential applications. Watch as the presenter showcases real-time examples, including document analysis, code generation, and multimodal understanding of video and image inputs. Discover how this advanced AI technology pushes the boundaries of natural language processing and multimodal comprehension.
Syllabus
Intro
Google Gemini 1.5 Pro Blog
Context Window Comparison
Demo
Demo: Querying Documents
Demo: Writing some code
Demo: Using Video Sample 01
Demo: Using Vide Sample 02
Demo: Using Video + Images Sample
Taught by
Sam Witteveen
Related Courses
CompilersStanford University via Coursera Build a Modern Computer from First Principles: Nand to Tetris Part II (project-centered course)
Hebrew University of Jerusalem via Coursera Разработка веб-сервисов на Go - основы языка
Moscow Institute of Physics and Technology via Coursera Complete Guide to Protocol Buffers 3 [Java, Golang, Python]
Udemy Angular tooling: Generating code with schematics
Coursera Project Network via Coursera