7-Day Production Engineering
LLM Systems Engineering
GPU architecture, inference engines, distributed training, RAG pipelines, and production observability — everything you need to build and operate LLMs at scale.
schedule42 minlayers9 chapterssignal_cellular_altBeginner to Advanced
Day 1
GPU Foundations
radio_button_uncheckedGPU/VRAM Basics
radio_button_uncheckedQuantization & Batching
schedule10 min2 chapters
Day 2
Inference Engines
radio_button_uncheckedvLLM & TRT-LLM
schedule5 min1 chapter
Day 3
KV Cache & Speed
radio_button_uncheckedKV Cache & Spec Decoding
schedule5 min1 chapter
Day 4
Distributed Training
radio_button_uncheckedDistributed Training
schedule5 min1 chapter
Day 5
Serving at Scale
radio_button_uncheckedModel Serving
schedule5 min1 chapter
Day 6
Vector DBs & RAG
radio_button_uncheckedVector DB & RAG
schedule4 min1 chapter
Day 7
Cost & Observability
radio_button_uncheckedCost Optimization
radio_button_uncheckedLLM Observability
schedule9 min2 chapters