Skip to main content

Prerequisite chain

Prerequisites for RLHF and Alignment

Topics you need before working through RLHF and Alignment. Direct prerequisites are listed first; transitive prerequisites (the chain reachable through them) follow.

Direct prerequisites (5)

  1. Policy Gradient Theoremlayer 3, tier 1
  2. Markov Decision Processeslayer 2, tier 1
  3. Actor-Critic Methodslayer 3, tier 2
  4. Fine-Tuning and Adaptationlayer 3, tier 1
  5. Transformer Architecturelayer 4, tier 2

Reachable through the chain (255)

These topics are not directly cited as prerequisites but are reached transitively by following the chain upward. Working through the direct prerequisites pulls these in.