Back to home
🌳Intermediate(20-50h)
Inner Alignment
Mesa-optimization, deceptive alignment, and instrumental convergence
Articles
3
Estimated time
10-15h
0 / 3
Articles in this module
1
Mesa-Optimization
Start hereWhen learned models develop their own optimization processes
30 min
2
Deceptive Alignment
When AI systems appear aligned but pursue different goals
35 min
3
Proxy Alignment
The risks of optimizing for proxies instead of true objectives
25 min
Next step
Once you've completed this module, move to the next level to deepen your understanding.
View all modules