YoVDO

Apache Tika 2.0 - New Features and Improvements

Offered By: Linux Foundation via YouTube

Tags

Laravel Courses Web Development Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the latest features and improvements in Apache Tika 2.0 in this comprehensive conference talk. Discover how this powerful tool detects and extracts metadata and text from a vast array of file formats, benefiting applications from search engines to big data processing. Learn about Tika's evolution over the past decade, including expanded format support, new usage methods, and refined philosophies for handling various file types. Gain insights into Tika's multi-language programming support and its capabilities for big-data scale operations. Whether you're an experienced Tika user or new to the technology, delve into topics such as detection, OCR, databases, Tika Config XML, batch processing, geo entity lookup, image object recognition, and text-searchable video. Understand the changes in metadata storage, logging, configuration, and content enhancement introduced in Tika 2.0. Presented by Nick Burch, a long-time Apache contributor and CTO at Quanticate, this talk offers valuable knowledge for anyone interested in efficient content extraction and analysis.

Syllabus

Intro
Tika in the news
Tika's History in brief
Detection
Supported Formats
OCR
Databases
Tika Config XML example
Tika Batch
Geo Entity Lookup
Image Object Reconition
Text Searchable Video
Tika 1.14
Metadata Storage
Metadata for Video etc
Logging, Config, Defaults
Content Handler Reset Add
Content Enhancement
Metadata Standards


Taught by

Linux Foundation

Tags

Related Courses

Create an eCommerce Website Using Laravel (PHP & MySQL)
Udemy
PHP with Laravel for beginners - Become a Master in Laravel
Udemy
Beginning Laravel 10 - From Novice to Professional (2023)
Udemy
Laravel PHP Framework - Beginners
Udemy
Learn Laravel 7 along with REST API & Livewire
Udemy