2025

LLM post-training with GRPO
DQN for Atari Breakout from Scratch
Ideological Bias Auditing in LLMs
Streaminator: Multi-Answer Speculative Decoding
Minimal C++ Autograd Engine

2024

Torchify: Compiling a json-like file to a torch.nn.Module
Schokoban: Monte Carlo Tree Search for Solving Sokoban