Reading Lists par Niveau

Débutant (0-20h)

Objectif: Comprendre qu'il y a un problème

1. AGI Ruin: A List of Lethalities (Eliezer)

2. AI Alignment: Why It's Hard, and Where to Start (Video - Eliezer)

3. Rob Miles YouTube Channel (vulgarisation)

https://www.youtube.com/@RobertMilesAI
5-10 videos (5h)

4. The Alignment Problem (Brian Christian - livre)

Accessible, journalistique
10-15h

Intermédiaire (20-100h)

Objectif: Comprendre problèmes techniques principaux

1. Risks from Learned Optimization (Hubinger et al.)

https://arxiv.org/abs/1906.01820
10h (paper + discussions)

2. Concrete Problems in AI Safety (Amodei et al.)

3. Embedded Agency (Sequence)

4. Superintelligence (Nick Bostrom - livre)

Classique, un peu daté (2014) mais fondamental
20h

5. ELK Document (ARC)

6. Alignment Forum (curated posts)

https://www.alignmentforum.org/
20-30h (sélection)

Avancé (100-500h)

Objectif: Comprendre research frontiers, contribuer

1. MIRI Research

https://intelligence.org/research/
Agent foundations papers
50-100h

5. Constitutional AI + Mechanistic Interpretability (Anthropic research)

Papers + discussions
50h

7. Alignment Forum (comprehensive reading)

Major sequences, debates
100-200h

Reading Lists par Niveau

Reading Lists par Niveau

Débutant (0-20h)

1. AGI Ruin: A List of Lethalities (Eliezer)

2. AI Alignment: Why It's Hard, and Where to Start (Video - Eliezer)

3. Rob Miles YouTube Channel (vulgarisation)

4. The Alignment Problem (Brian Christian - livre)

Intermédiaire (20-100h)

1. Risks from Learned Optimization (Hubinger et al.)

2. Concrete Problems in AI Safety (Amodei et al.)

3. Embedded Agency (Sequence)

4. Superintelligence (Nick Bostrom - livre)

5. ELK Document (ARC)

6. Alignment Forum (curated posts)

Avancé (100-500h)

1. MIRI Research

2. Corrigibility (MIRI)

3. Logical Induction (MIRI)

4. Iterated Amplification (Paul Christiano, all posts)

5. Constitutional AI + Mechanistic Interpretability (Anthropic research)

7. Alignment Forum (comprehensive reading)

Articles Connexes

Qu'est-ce que l'AI Alignment ?

Chercheurs Clés

Reading Lists par Niveau

Débutant (0-20h)

1. AGI Ruin: A List of Lethalities (Eliezer)

2. AI Alignment: Why It's Hard, and Where to Start (Video - Eliezer)

3. Rob Miles YouTube Channel (vulgarisation)

4. The Alignment Problem (Brian Christian - livre)

Intermédiaire (20-100h)

1. Risks from Learned Optimization (Hubinger et al.)

2. Concrete Problems in AI Safety (Amodei et al.)

3. Embedded Agency (Sequence)

4. Superintelligence (Nick Bostrom - livre)

5. ELK Document (ARC)

6. Alignment Forum (curated posts)

Avancé (100-500h)

1. MIRI Research

2. Corrigibility (MIRI)

3. Logical Induction (MIRI)

4. Iterated Amplification (Paul Christiano, all posts)

5. Constitutional AI + Mechanistic Interpretability (Anthropic research)

6. Debate, ELK, related (ARC, OpenAI)

7. Alignment Forum (comprehensive reading)

Articles Connexes

Qu'est-ce que l'AI Alignment ?

Chercheurs Clés