YoVDO

Building a Custom AI Agent with Llama 3 70B and Runpod - Compatible with Hugging Face LLMs

Offered By: Data Centric via YouTube

Tags

GitHub Courses LLM (Large Language Model) Courses AI Engineering Courses Hugging Face Courses RunPod Courses Retrieval Augmented Generation (RAG) Courses vLLM Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Learn to build a custom AI agent using the Llama 3 70b model deployed on Runpod with vLLM in this 24-minute tutorial. Discover how to create a flexible and scalable AI project compatible with any Hugging Face LLM. Follow along as the video covers the inference server schema, memory requirement determination, server deployment on Runpod, integration with the agent, and a live demonstration of the custom agent. Gain practical insights into AI engineering, large language models, and custom websearch agent development through hands-on explanations and provided resources.

Syllabus

Introduction:
Inference Server Schema:
Determine memory requirements:
Deploying server on Runpod:
Using the inference server with the agent:
Demoing the custom agent:


Taught by

Data Centric

Related Courses

Finetuning, Serving, and Evaluating Large Language Models in the Wild
Open Data Science via YouTube
Cloud Native Sustainable LLM Inference in Action
CNCF [Cloud Native Computing Foundation] via YouTube
Optimizing Kubernetes Cluster Scaling for Advanced Generative Models
Linux Foundation via YouTube
LLaMa for Developers
LinkedIn Learning
Scaling Video Ad Classification Across Millions of Classes with GenAI
Databricks via YouTube