Query Embeddings - Web Scale Search Powered by Deep Learning and Python
Offered By: EuroPython Conference via YouTube
Course Description
Overview
Explore the world of query embeddings and web-scale search powered by deep learning and Python in this EuroPython Conference talk. Dive into an unsupervised deep learning system built using Python and open-source libraries like Annoy and keyvi, designed to recognize similarities between queries and their vector representations. Learn how this technology improves recall for previously unseen queries and integrates into the Cliqz browser's search stack. Discover the transition from traditional keyword-based search to deep learning and NLP techniques that represent sentences and documents as fixed-dimensional vectors in high-dimensional space. Gain insights into the architecture of query embeddings, including vector indexing, approximate nearest neighbor models, and the use of Word2Vec. Explore real-world applications, latency issues in real-time search systems, and the potential for this framework to be utilized in other low-latency systems involving vector representations.
Syllabus
Intro
TRADITIONAL SEARCH
OUR SEARCH STORY
SEARCH PROBLEM - OVERVIEW
ENTERS DEEP LEARNING
EXAMPLE QUERY: SIMS GAME PC DOWNLOAD
LEARNING DISTRIBUTED REPRESENTATION OF WORDS
WORDZVEC DEMYSTIFIED
NEURAL PROBABILISTIC LANGUAGE MODELS
EXAMPLE: SKIP-GRAM MODEL
WORD VECTORS CAPTURING SEMANTIC INFORMATION
WORD VECTORS IN 20
QUERY VECTOR FORMATION - SIMS GAME PC DOWNLOAD
TERMS RELEVANCE
QUERY VECTOR INDEX
FINDING CLOSEST QUERIES
ANNOY (APPROXIMATE NEAREST NEIGHBOR MODEL)
ANATOMY OF ANNOY
STORING WORD EMBEDDINGS & QUERY-INTEGER MAPPINGS
RESULTS
CONCLUSION
Taught by
EuroPython Conference
Related Courses
A Brief History of Data StorageEuroPython Conference via YouTube Breaking the Stereotype - Evolution & Persistence of Gender Bias in Tech
EuroPython Conference via YouTube We Can Get More from Spatial, GIS, and Public Domain Datasets
EuroPython Conference via YouTube Using NLP to Detect Knots in Protein Structures
EuroPython Conference via YouTube The Challenges of Doing Infra-As-Code Without "The Cloud"
EuroPython Conference via YouTube