YoVDO

Building a Custom AI Agent with Llama 3 70B and Runpod - Compatible with Hugging Face LLMs

Offered By: Data Centric via YouTube

Tags

GitHub Courses LLM (Large Language Model) Courses AI Engineering Courses Hugging Face Courses RunPod Courses Retrieval Augmented Generation (RAG) Courses vLLM Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Learn to build a custom AI agent using the Llama 3 70b model deployed on Runpod with vLLM in this 24-minute tutorial. Discover how to create a flexible and scalable AI project compatible with any Hugging Face LLM. Follow along as the video covers the inference server schema, memory requirement determination, server deployment on Runpod, integration with the agent, and a live demonstration of the custom agent. Gain practical insights into AI engineering, large language models, and custom websearch agent development through hands-on explanations and provided resources.

Syllabus

Introduction:
Inference Server Schema:
Determine memory requirements:
Deploying server on Runpod:
Using the inference server with the agent:
Demoing the custom agent:


Taught by

Data Centric

Related Courses

Better Llama with Retrieval Augmented Generation - RAG
James Briggs via YouTube
Live Code Review - Pinecone Vercel Starter Template and Retrieval Augmented Generation
Pinecone via YouTube
Nvidia's NeMo Guardrails - Full Walkthrough for Chatbots - AI
James Briggs via YouTube
Hugging Face LLMs with SageMaker - RAG with Pinecone
James Briggs via YouTube
Supercharge Your LLM Applications with RAG
Data Science Dojo via YouTube