7-Day Production Engineering

LLM Systems Engineering

GPU architecture, inference engines, distributed training, RAG pipelines, and production observability — everything you need to build and operate LLMs at scale.

schedule42 minlayers9 chapterssignal_cellular_altBeginner to Advanced

Day 1

GPU Foundations

radio_button_uncheckedGPU/VRAM Basics

radio_button_uncheckedQuantization & Batching

schedule10 min2 chapters

Day 2

Inference Engines

radio_button_uncheckedvLLM & TRT-LLM

schedule5 min1 chapter

Day 3

KV Cache & Speed

radio_button_uncheckedKV Cache & Spec Decoding

schedule5 min1 chapter

Day 4

Distributed Training

radio_button_uncheckedDistributed Training

schedule5 min1 chapter

Day 5

Serving at Scale

radio_button_uncheckedModel Serving

schedule5 min1 chapter

Day 6

Vector DBs & RAG

radio_button_uncheckedVector DB & RAG

schedule4 min1 chapter

Day 7

Cost & Observability

radio_button_uncheckedCost Optimization

radio_button_uncheckedLLM Observability

schedule9 min2 chapters