Safer multi-tenancy with Postgres's row-level security
May 16, 2026
Pushing tenant scoping from application code into the database.
Multi-Turn RL for Code Debugging
March 2026
Training a 7B model to debug code in a custom DSL environment using GRPO. Comparing prompting, supervised fine-tuning, and reinforcement learning across three categories of bugs.
Visualizing Adam's adaptive learning rates
November 2025
An interactive visualization showing how Adam adapts its step sizes in response to different gradient patterns.