Blog
All posts
Hello World
My first blog post — a quick intro to what I'll be writing about.
PersonalWriting
What Longer-Timeline Intuitions About RL Progress Missed
An argument for why AI progress did not slow in the RL regime as much as some longer-timeline intuitions expected.
AIRLForecasting
PPO Explain for beginners
A beginner-friendly breakdown of Proximal Policy Optimization — the RL algorithm that turned raw base models into useful AI assistants.
AIRLMachineLearningGPU