YoVDO

Jlama: A Native Java LLM Inference Engine

Offered By: Devoxx via YouTube

Tags

Java Courses Artificial Intelligence Courses Machine Learning Courses LLaMA (Large Language Model Meta AI) Courses Vector API Courses Gemma Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore JLama, a cutting-edge inference engine designed to empower Java developers with native AI capabilities. Discover how this innovative tool brings the power of large language models directly into the Java ecosystem without the need for GPUs. Learn about JLama's support for popular open models like Llama, Gemma, and Mixtral, and its utilization of the new Vector API in Java 21 for enhanced performance. Delve into key features such as advanced model support, tokenizer compatibility, and implementation of state-of-the-art techniques including Flash Attention, Mixture of Experts, and Group Query Attention. Understand how JLama integrates with the LangChain4j project and complements JVector's Java native vector search capabilities to create a comprehensive AI stack for Java. Gain insights into JLama's technical intricacies, practical applications, and witness a live demonstration showcasing its potential to revolutionize Java-AI integration.

Syllabus

Jlama: A Native Java LLM inference engine by Jake Luciani


Taught by

Devoxx

Related Courses

Running Gemma Using HuggingFace Transformers and Ollama
Sam Witteveen via YouTube
Machine Learning News: Gemma, Gemini, Groq, Sora, and AI Developments
Yannic Kilcher via YouTube
AI News Roundup: Grok-1, Nvidia GTC, OpenAI Leaks, and EU AI Act
Yannic Kilcher via YouTube
Creating, Building, and Releasing Gemma - Google's Open Model Family
TensorFlow via YouTube
Claude 3 vs ChatGPT in Street Fighter - AI Model Tournament
All About AI via YouTube