YoVDO

Deconstructing Text Embedding Models - Understanding Tokenizers and Model Selection

Offered By: EuroPython Conference via YouTube

Tags

Data Analysis Courses Machine Learning Courses Model Selection Courses Fine-Tuning Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the intricacies of text embedding models in this 44-minute EuroPython Conference talk. Delve into the critical role of tokenizers in model selection, moving beyond reliance on benchmarks like the Massive Text Embedding Benchmark (MTEB). Learn to assess model suitability for specific datasets based on tokenizer performance, and discover strategies for optimizing tokenizers during the fine-tuning process of embedding models. Gain insights into making informed decisions when choosing text embedding models for unique data characteristics.

Syllabus

Deconstructing the text embedding models — Kacper Łukawski


Taught by

EuroPython Conference

Related Courses

Introduction to Artificial Intelligence
Stanford University via Udacity
Natural Language Processing
Columbia University via Coursera
Probabilistic Graphical Models 1: Representation
Stanford University via Coursera
Computer Vision: The Fundamentals
University of California, Berkeley via Coursera
Learning from Data (Introductory Machine Learning course)
California Institute of Technology via Independent