YoVDO

Rust for Large Language Model Operations (LLMOps)

Offered By: Pragmatic AI Labs via edX

Tags

Rust Courses GPU Acceleration Courses Systems Programming Courses ONNX Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!

This advanced course trains you for the cutting-edge of AI development by combining the power of Rust with Large Language Model Operations

  • Learn to build scalable LLM solutions using the performance of Rust
  • Master integrating Rust with LLM frameworks like HuggingFace Transformers
  • Integrate Rust with LLM frameworks like HuggingFace, Candle, ONNX

Get trained in the latest AI/ML innovations while mastering systems programming with Rust - your pathway to building state-of-the-art LLM applications.

  • Optimize LLM training/inference by leveraging Rust's parallelism and GPU acceleration
  • Build Rust bindings for seamless integration with HuggingFace Transformers
  • Convert and deploy BERT models to Rust apps via ONNX runtime
  • Utilize Candle for streamlined ML model building and training in Rust
  • Host and scale LLM solutions on AWS cloud infrastructure
  • Hands-on labs: Build chatbots, text summarizers, machine translation
  • Apply LLMOps DevOps practices - CI/CD, monitoring, security
  • Techniques for memory safety, multithreading, lock-free concurrency
  • Best practices for LLMOps reliability, scalability, cost optimization
  • Real-world projects demonstrating production-ready LLMOps expertise

Syllabus

Module 1: DevOps Concepts for MLOps (6 hours)

\- Instructor Intro (Video - 1 minute)

\- A Function, the Essence of Programming (Video - 6 minutes)

\- Operationalize Microservices (Video - 1 minute)

\- Continuous Integration for Microservices (Video - 6 minutes)

\- What is Makefile and how do you use it? (Video - 2 minutes)

\- What is DevOps? (Video - 2 minutes)

\- Kaizen methodology (Video - 4 minutes)

\- Infrastructure as Code for Continuous Delivery (Video - 2 minutes)

\- Responding to Compromised Resources and Workloads (Video - 4 minutes)

\- Designing and Implementing Monitoring and Alerting (Video - 1 minute)

\- Audit Network Security (Video - 1 minute)

\- Rust Secure by Design (Video - 4 minutes)

\- Preventing Data Races with Rust Compiler (Video - 3 minutes)

\- Using AWS Config for Security (Video - 4 minutes)

\- AWS Security Hub Demo (Video - 3 minutes)

\- Explain How to Secure Your Account with 2FA (Video - 3 minutes)

\- Understanding Access Permissions (Video - 4 minutes)

\- Repository Permission Levels Explained (Video - 2 minutes)

\- Repository Privacy Settings and Options (Video - 2 minutes)

\- Unveiling Key Concepts of the GitHub Ecosystem (Video - 3 minutes)

\- Demo: Implementing GitHub Actions (Video - 3 minutes)

\- Demo: GitHub Codespaces (Video - 6 minutes)

\- Demo: GitHub Copilot (Video - 8 minutes)

\- Source Code Resources (Reading - 10 minutes)

\- Infrastructure as code (Reading - 10 minutes)

\- Continuous integration (Reading - 10 minutes)

\- Continuous delivery (Reading - 10 minutes)

\- Automation and tooling (Reading - 10 minutes)

\- Shared responsibility (Reading - 10 minutes)

\- Identity and access management (Reading - 10 minutes)

\- Infrastructure protection (Reading - 10 minutes)

\- Incident response (Reading - 10 minutes)

\- External Lab: Use GitHub Actions and Codespaces (Reading - 10 minutes)

\- About two-factor authentication (Reading - 10 minutes)

\- Access permissions on GitHub (Reading - 10 minutes)

\- About Continuous Integration (Reading - 10 minutes)

\- About continuous deployment (Reading - 10 minutes)

\- Final Week-Reflections (Reading - 10 minutes)

\- DevOps Concepts for MLOps (Quiz - 30 minutes)

\- Lab: Using a Makefile with Rust (Ungraded Lab - 60 minutes)

\- Lab: Preventing Data Races in Rust (Ungraded Lab - 60 minutes)

Module 2: Rust Hugging Face Candle (4 hours)

\- Candle: A Minimalist ML Framework for Rust (Video - 2 minutes)

\- Using GitHub Codespaces for GPU Inference with Rust Candle (Video - 5 minutes)

\- VSCode Remote SSH development AWS Accelerated Compute (Video - 5 minutes)

\- Building Hello World Candle (Video - 2 minutes)

\- Exploring StarCoder: A State-of-the-Art LLM (Video - 5 minutes)

\- Using Whisper with Candle to Transcribe (Video - 5 minutes)

\- Exploring Remote Dev Architectures on AWS (Video - 2 minutes)

\- Advantages of Rust for LLMs (Video - 1 minute)

\- Serverless Inference (Video - 1 minute)

\- Rust CLI Inference (Video - 2 minutes)

\- Rust Chat Inference (Video - 1 minute)

\- Continuous Build of Binaries for LLMOps (Video - 2 minutes)

\- Chat Loop for StarCoder (Video - 2 minutes)

\- Invoking Rust Candle on AWS G5-Part One (Video - 4 minutes)

\- Invoking BigCode on AWS G5-Part Two (Video - 3 minutes)

\- rust-candle-demos (Reading - 10 minutes)

\- Configuring NVIDIA CUDA for your codespace (Reading - 10 minutes)

\- Getting Started Candle (Reading - 10 minutes)

\- Candle Examples (Reading - 10 minutes)

\- External Lab: Candle Hello World (Reading - 10 minutes)

\- External Lab: Run an LLM with Candle (Reading - 10 minutes)

\- Developer Guide cuDNN (Reading - 10 minutes)

\- cuDNN Webinar (Reading - 10 minutes)

\- Programming Tensor Cores in CUDA 9 (Reading - 10 minutes)

\- Tensor Ops Made Easier in cuDNN (Reading - 10 minutes)

\- External Lab: Using BigCode to Assist With Coding (Reading - 10 minutes)

\- StarCoder: A State-of-the-Art LLM for Code (Reading - 10 minutes)

\- Falcon LLM (Reading - 10 minutes)

\- Whisper LLM (Reading - 10 minutes)

\- Candle Structure (Reading - 10 minutes)

\- Final Week Reflection (Reading - 10 minutes)

\- Rust Hugging Face Candle (Quiz - 30 minutes)

Module 3: Key LLMOps Technologies (3 hours)

\- Introduction to Rust Bert (Video - 1 minute)

\- Installation and Setup (Video - 5 minutes)

\- Basic Syntax and Model Loading (Video - 2 minutes)

\- Building a sentiment analysis CLI (Video - 4 minutes)

\- Introduction to Rust PyTorch (Video - 1 minute)

\- Running a PyTorch Hello World (Video - 2 minutes)

\- PyTorch Pretrained (Video - 3 minutes)

\- Running PyTorch Pretrained (Video - 6 minutes)

\- Introduction to ONNX (Video - 1 minute)

\- ONNX Conversions (Video - 2 minutes)

\- Getting Started Rust Bert (Reading - 10 minutes)

\- External Lab: Translate a Spanish song to English (Reading - 10 minutes)

\- Rust Bert pipelines (Reading - 10 minutes)

\- ONNX Support Rust Bert (Reading - 10 minutes)

\- Loading pretrained and custom model weights (Reading - 10 minutes)

\- External Lab: Run a Pretrained model (Reading - 10 minutes)

\- Rust bindings for PyTorch (Reading - 10 minutes)

\- ONNX Concepts (Reading - 10 minutes)

\- ONNX with Python (Reading - 10 minutes)

\- Converters (Reading - 10 minutes)

\- ONNX Model Hub (Reading - 10 minutes)

\- Final Week-Reflections (Reading - 10 minutes)

\- External Lab: Use ONNX (Reading - 10 minutes)

\- Using Rust Bert (Quiz - 30 minutes)

Module 4: Key Generative AI Technologies (3 hours)

\- Extending Google Bard (Video - 4 minutes)

\- Exploring Google Colab with Bard (Video - 4 minutes)

\- Exploring Colab AI (Video - 4 minutes)

\- Exploring Gen App Builder (Video - 2 minutes)

\- Responsible AI with AWS Bedrock (Video - 4 minutes)

\- AWS Bedrock with Claude (Video - 7 minutes)

\- Summarizing text with Claude (Video - 5 minutes)

\- Using the AWS Bedrock API (Video - 1 minute)

\- Live Coding AWS CodeWhisperer Part One (Video - 6 minutes)

\- Live Coding AWS CodeWhisperer Part Two (Video - 14 minutes)

\- Live Coding AWS CodeWhisperer Part Three (Video - 7 minutes)

\- Using AWS CodeWhisperer CLI (Video - 3 minutes)

\- Bard FAQ (Reading - 10 minutes)

\- External Lab: Build a plot with Colab AI (Reading - 10 minutes)

\- External Lab: AWS Bedrock (Reading - 10 minutes)

\- AWS Cloud Adoption Framework for Artificial Intelligence, Machine Learning, and Generative AI (Reading - 10 minutes)

\- People perspective: Culture and change towards AI/ML-first (Reading - 10 minutes)

\- External Lab: Use CodeWhisperer for Rust Calculator (Reading - 10 minutes)

\- Key LLMOps Technologies (Quiz - 30 minutes)

\- Final-Quiz (Quiz - 30 minutes)


Taught by

Noah Gift

Related Courses

Advanced Golang Concepts
Edureka via Coursera
D
Exercism
Rust
Exercism
Go Programming Language
Edureka via Coursera
Introduction to Open Source Application Development
Illinois Institute of Technology via Coursera