Reading List

Curated papers, blogs, and posts. The foundation papers are worth reading in full; the rest are worth skimming for mental models.

Best visual explanation of attention. Read this before the original paper.

transformersarchitecture

The paper that started everything. Focus on the architecture section.

transformersarchitecturefoundational

transformerspre-trainingfoundational

Introduced in-context learning as a paradigm. Focus on the scaling results.

llmfew-shotfoundational

Foundation of modern chat models. Understand RLHF from first principles here.

rlhfalignmentfine-tuning

promptingreasoning

The paper behind most production agent loops.

agentsreasoning

Makes fine-tuning accessible without massive GPU budgets.

fine-tuningefficiency

fine-tuningquantization

Go-to eval framework for RAG pipelines.

rageval

Use as a reference, not a linear read.

ragsurvey

Anthropic's approach to training helpful and harmless models.

alignmentsafety

agentstool-use

Practitioner wisdom from teams who have shipped. Read this early and re-read often.

productionapplied

productionengineering