Why do LLMs work?

Why do LLMs work?

An opinionated, falsification-anchored atlas of the five research programmes that try to answer that question. Every claim carries an epistemic status. Every status change requires a cited paper.

5 programmes 42 papers · verified 41 tracked claims 5 runnable notebooks CC-BY-4.0 · MIT

Why a programme map (not a paper dump)?

The existing awesome-* lists in interpretability and LLM theory are flat directories of papers. They link to good work, but they do not tell you which claims are alive, which are dead, and which are still being fought over. They treat the field as a settled science rather than as a young one with sharply contested foundations.

This repo treats why do LLMs work? as five competing research programmes in the Lakatosian sense. Each programme has a hard core (a falsifiable central claim), a protective belt (auxiliary hypotheses that absorb local refutations), and an evidence ledger. Every tracked claim has a status — 🟢 supported, 🟡 contested, 🔴 refuted, ⚪ open — and a falsifier.

Why a programme map (not a paper dump)?

The five programmes

The taxonomy at a glance

Three things you can do right now

Browse the falsification ledger

Run the superposition demo

Open a notebook on Colab