Skip to main content

Prerequisite chain

Prerequisites for Post-Training Overview

Topics you need before working through Post-Training Overview. Direct prerequisites are listed first; transitive prerequisites (the chain reachable through them) follow.

Direct prerequisites (6)

  1. RLHF and Alignmentlayer 4, tier 2
  2. Transformer Architecturelayer 4, tier 2
  3. Agentic RL and Tool Uselayer 5, tier 2
  4. BERT and the Pretrain-Finetune Paradigmlayer 4, tier 2
  5. Policy Optimization: PPO and TRPOlayer 3, tier 2
  6. Test-Time Compute and Searchlayer 5, tier 2

Reachable through the chain (383)

These topics are not directly cited as prerequisites but are reached transitively by following the chain upward. Working through the direct prerequisites pulls these in.