Research Journal

Most people stop after building GPT from scratch.
This is what happens when you don't.

Building a language model from scratch is a milestone — but finishing the tutorial is not the same as understanding the model. This journal is about what comes after: actually running experiments, reading the training curves, and learning to speak the language of LLMs. The goal is to go as deep as possible — from pretraining to SFT to RLVR — and share every observation along the way.

Experiments

Inference · LFM2 MoE July 3, 2026

A Month Inside Open-Source Inference: One MoE, a Laptop, and a GPU

Running LiquidAI's LFM2.5-8B-A1B from a MacBook Air to an RTX 5060 Ti — throughput sweeps, quantization fidelity (perplexity vs KL-divergence), and a private eval against Claude. The reasoning I brought to each run, and where it was corrected.
Pretraining · Phase 1 April 15, 2026

GPT-2 Small from Scratch: Four Pretraining Experiments

Baseline, optimizer × warmup, init × depth, normalization × LR. What the training curves actually say.

Most people stop after building GPT from scratch.This is what happens when you don't.

A Month Inside Open-Source Inference: One MoE, a Laptop, and a GPU

GPT-2 Small from Scratch: Four Pretraining Experiments

Most people stop after building GPT from scratch.
This is what happens when you don't.