Writing

Practitioner notes
from production.

Essays and open-source releases from the team. No hot takes, no vendor pitch — just data and patterns from systems we run ourselves.

Library · MIT 2026-04

context-steward

Lazy skill loading for agent systems. Claude Skills are useful when loaded on demand, expensive when loaded up front. context-steward is the npm package that solves the second problem without breaking the first.

Essay · Forthcoming 2026-05

The 85% Accuracy Trap

Per-job quality scores in multi-agent systems almost always land around 85%. That number is telling you something different than you think it is. 4,600+ scored executions from our own production, and the pattern we found.

Coming soon