Unlock: Reward Design and Reward Misspecification

The hardest problem in RL: specifying what you want. Reward shaping, potential-based shaping theorem, specification gaming, Goodhart's law in RL, and the bridge from classic RL to alignment.

258 Prerequisites0 Mastered0 Working198 Gaps

Prerequisite mastery23%

Recommended probe

Natural Language Processing Foundations is your weakest prerequisite with available questions. You haven't been assessed on this topic yet.

Reward Design and Reward MisspecificationTARGET

Natural Language Processing FoundationsCoreWEAKEST

Not assessed5 questions

Bellman EquationsCore

Not assessed12 questions

Markov Decision ProcessesCore

Not assessed3 questions

Reinforcement Learning for Drug DiscoveryResearch

No quiz