The Five Programmes Map

Why do LLMs work?

An opinionated, falsification-anchored atlas of the five research programmes that try to answer that question. Every claim carries an epistemic status. Every status change requires a cited paper.

5 programmes 42 papers ยท verified 41 tracked claims 5 runnable notebooks CC-BY-4.0 ยท MIT

status snapshot

Counts auto-updated from programmes/* via scripts/extract_ledger_json.py.

Why a programme map (not a paper dump)?

The existing awesome-* lists in interpretability and LLM theory are flat directories of papers. They link to good work, but they do not tell you which claims are alive, which are dead, and which are still being fought over. They treat the field as a settled science rather than as a young one with sharply contested foundations.

This repo treats why do LLMs work? as five competing research programmes in the Lakatosian sense. Each programme has a hard core (a falsifiable central claim), a protective belt (auxiliary hypotheses that absorb local refutations), and an evidence ledger. Every tracked claim has a status โ€” ๐ŸŸข supported, ๐ŸŸก contested, ๐Ÿ”ด refuted, โšช open โ€” and a falsifier.

The five programmes

Each card links to the full programme file in the repo.

The taxonomy at a glance

Programmes, key claims with status, and the supports / tension relations between programmes. Rendered from scripts/render_taxonomy.py.

Five Programmes Map

Three things you can do right now

L

Browse the falsification ledger

All 41 tracked claims, filterable by programme and status. The page is the audit trail of what the field actually believes right now.

D

Run the superposition demo

See n features packed into d<n dimensions as a function of sparsity โ€” the Elhage-et-al-2022 toy model, interactive in your browser, no install.

N

Open a notebook on Colab

Five teaching notebooks โ€” one per programme โ€” that each run in under 5 min on a free Colab T4 and end with an explicit "what this does not show."