Mathematical Structure Computed by Large Language Models - A First Approximation
Offered By: Institut des Hautes Etudes Scientifiques (IHES) via YouTube
Course Description
Overview
Explore the mathematical structure behind Large Language Models in this comprehensive lecture. Delve into the conditional probability distributions of text extensions and their representation as a directed metric structure on the space of texts. Discover how this structure is encoded in a directed metric polyhedron, with texts isometrically embedded as generators of special extremal rays. Learn about the tropical generation of the polyhedron and its relation to a duality theorem connecting text extensions and restrictions. Examine the approximation of text generators using Boltzmann weighted linear combinations of word generators. Gain insights into the categorical interpretations of these constructions, including the Yoneda embedding and generalizations of language as a monoid or poset. This joint work with Stéphane Gaubert offers a deep dive into the mathematical foundations of LLMs, presented by Yiannis Vlassopoulos from the Athena Research Center.
Syllabus
Yiannis Vlassopoulos - A First Approximation to the Mathematical Structure Computed by LLMs
Taught by
Institut des Hautes Etudes Scientifiques (IHES)
Related Courses
Unleashing Algebraic Metaprogramming in Julia with Metatheory.jlThe Julia Programming Language via YouTube COSC250 - Functional and Reactive Programming
Independent Free as in Monads - Understanding and Applying Free Monads - Lecture 44
ChariotSolutions via YouTube Generalised Integrated Information Theories
Models of Consciousness Conferences via YouTube Reasoning About Conscious Experience With Axiomatic and Graphical Mathematics
Models of Consciousness Conferences via YouTube