Research Journal

Most people stop after building GPT from scratch.
This is what happens when you don't.

Building a language model from scratch is a milestone — but finishing the tutorial is not the same as understanding the model. This journal is about what comes after: actually running experiments, reading the training curves, and learning to speak the language of LLMs. The goal is to go as deep as possible — from pretraining to SFT to RLVR — and share every observation along the way.

Experiments