Nugget: Neural Agglomerative Embeddings of Text
Offered By: Center for Language & Speech Processing(CLSP), JHU via YouTube
Course Description
Overview
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore a novel approach to text embedding called Nugget in this 37-minute conference talk by Guanghui Qin from the Center for Language & Speech Processing at Johns Hopkins University. Learn how Nugget addresses the limitations of constant-size representations by dynamically encoding language into meaningful units based on a subset of input tokens. Discover how this method outperforms existing approaches in semantic comparison tasks and offers potential for expanding the contextual window of language models. Gain insights into the training process of Nugget through tasks like autoencoding and machine translation, and understand its implications for future language models that can process significantly larger amounts of content.
Syllabus
Nugget: Neural Agglomerative Embeddings of Text - Guanghui Qin
Taught by
Center for Language & Speech Processing(CLSP), JHU
Related Courses
Create Text Embeddings for a Vector Store using LangChainGoogle Cloud via Coursera Introduction to Vertex AI Embeddings: Text and Multimodal
Google Cloud via Coursera Product Recommender System: OpenAI Text Embedding
Coursera Project Network via Coursera Vector Search and Embeddings - Bahasa Indonesia
Google Cloud via Coursera Vector Search and Embeddings - Deutsch
Google Cloud via Coursera