First Principles of OCI Generative AI Service - Architecture and Implementation
Offered By: Oracle via YouTube
Course Description
Overview
Explore the mechanics behind Oracle Cloud Infrastructure's Generative AI Service in this 19-minute video. Dive into the fundamentals of generative AI models, transformer architecture, and their applications in enterprise settings. Learn about achieving high accuracy in large language model outputs, retrieval augmented generation, and the basic OCI Gen AI workflow. Discover how dedicated GPU RDMA clusters enhance performance while maintaining data privacy and security. Understand the fine-tuning process, including efficient techniques like T-Few, and examine the inner workings of transformer layers. Gain insights into OCI Gen AI's cost-effective inferencing methods and the ability to pack multiple models into a single GPU cluster. Conclude with key takeaways to help you leverage this powerful service for AI applications.
Syllabus
- Intro to OCI Generative AI Service
- What are Generative AI Models?
- Transformer Model Architecture
- Encoder-Decoder Transformer Model
- Gen AI in enterprise applications
- Gen AI key to success
- Achieving high accuracy of LLM outputs
- Retrieval Augmented Generations
- Basic OCI Gen AI Workflow
- Dedicated GPU RDMA clusters
- Customer Data Privacy and Security
- Fine-Tuning the Models
- Efficient Fine-Tuning w/ T-Few
- How T-Few Fine-Tuning Works
- Inside the Transformer Layer
- OCI Gen AI's cost effective inferencing
- Packing many models in single GPU cluster
- Key Takeaways
Taught by
Oracle
Tags
Related Courses
Introduction to Data Analytics for BusinessUniversity of Colorado Boulder via Coursera Digital and the Everyday: from codes to cloud
NPTEL via Swayam Systems and Application Security
(ISC)² via Coursera Protecting Health Data in the Modern Age: Getting to Grips with the GDPR
University of Groningen via FutureLearn Teaching Impacts of Technology: Data Collection, Use, and Privacy
University of California, San Diego via Coursera