Reading List
Curated papers, blogs, and posts. The foundation papers are worth reading in full; the rest are worth skimming for mental models.
·
Best visual explanation of attention. Read this before the original paper.
transformersarchitecture
The paper that started everything. Focus on the architecture section.
transformersarchitecturefoundational
transformerspre-trainingfoundational
Introduced in-context learning as a paradigm. Focus on the scaling results.
llmfew-shotfoundational
Foundation of modern chat models. Understand RLHF from first principles here.
rlhfalignmentfine-tuning
promptingreasoning
The paper behind most production agent loops.
agentsreasoning
Makes fine-tuning accessible without massive GPU budgets.
fine-tuningefficiency
fine-tuningquantization
Go-to eval framework for RAG pipelines.
rageval
Use as a reference, not a linear read.
ragsurvey
Anthropic's approach to training helpful and harmless models.
alignmentsafety
agentstool-use
Practitioner wisdom from teams who have shipped. Read this early and re-read often.
productionapplied
productionengineering