Data Extraction via Semantic Regular Expression Synthesis
Offered By: ACM SIGPLAN via YouTube
Course Description
Overview
Explore a groundbreaking approach to data extraction in this 18-minute conference talk from OOPSLA2 2023. Delve into the concept of semantic regexes, an innovative extension of regular expressions that combines syntactic pattern matching with semantic reasoning for more effective textual data analysis. Learn about a novel learning algorithm that synthesizes semantic regexes from minimal examples, utilizing neural sketch generation and compositional type-directed synthesis. Discover how the new tool Smore implements these ideas and outperforms existing solutions, including state-of-the-art neural networks and program synthesis tools, in complex data extraction tasks. Gain insights into the practical applications and superior performance of semantic regexes compared to standard regular expressions across various textual datasets.
Syllabus
[OOPSLA23] Data Extraction via Semantic Regular Expression Synthesis
Taught by
ACM SIGPLAN
Related Courses
Introduction to Artificial IntelligenceStanford University via Udacity Natural Language Processing
Columbia University via Coursera Probabilistic Graphical Models 1: Representation
Stanford University via Coursera Computer Vision: The Fundamentals
University of California, Berkeley via Coursera Learning from Data (Introductory Machine Learning course)
California Institute of Technology via Independent