Query Embeddings - Web Scale Search Powered by Deep Learning and Python
Offered By: EuroPython Conference via YouTube
Course Description
Overview
Explore the world of query embeddings and web-scale search powered by deep learning and Python in this EuroPython Conference talk. Dive into an unsupervised deep learning system built using Python and open-source libraries like Annoy and keyvi, designed to recognize similarities between queries and their vector representations. Learn how this technology improves recall for previously unseen queries and integrates into the Cliqz browser's search stack. Discover the transition from traditional keyword-based search to deep learning and NLP techniques that represent sentences and documents as fixed-dimensional vectors in high-dimensional space. Gain insights into the architecture of query embeddings, including vector indexing, approximate nearest neighbor models, and the use of Word2Vec. Explore real-world applications, latency issues in real-time search systems, and the potential for this framework to be utilized in other low-latency systems involving vector representations.
Syllabus
Intro
TRADITIONAL SEARCH
OUR SEARCH STORY
SEARCH PROBLEM - OVERVIEW
ENTERS DEEP LEARNING
EXAMPLE QUERY: SIMS GAME PC DOWNLOAD
LEARNING DISTRIBUTED REPRESENTATION OF WORDS
WORDZVEC DEMYSTIFIED
NEURAL PROBABILISTIC LANGUAGE MODELS
EXAMPLE: SKIP-GRAM MODEL
WORD VECTORS CAPTURING SEMANTIC INFORMATION
WORD VECTORS IN 20
QUERY VECTOR FORMATION - SIMS GAME PC DOWNLOAD
TERMS RELEVANCE
QUERY VECTOR INDEX
FINDING CLOSEST QUERIES
ANNOY (APPROXIMATE NEAREST NEIGHBOR MODEL)
ANATOMY OF ANNOY
STORING WORD EMBEDDINGS & QUERY-INTEGER MAPPINGS
RESULTS
CONCLUSION
Taught by
EuroPython Conference
Related Courses
Neural Networks for Machine LearningUniversity of Toronto via Coursera 機器學習技法 (Machine Learning Techniques)
National Taiwan University via Coursera Machine Learning Capstone: An Intelligent Application with Deep Learning
University of Washington via Coursera Прикладные задачи анализа данных
Moscow Institute of Physics and Technology via Coursera Leading Ambitious Teaching and Learning
Microsoft via edX