BLOG · NOTES FROM THE CREW

FIELD NOTES.

Engineering notes on AI agents, automation and the systems behind them — what works in production, what breaks, and what we would build differently.

Demos are easy; production is where agents meet ambiguity, rate limits and angry edge cases. A field checklist from the last year of deployments.

You cannot improve what you do not measure, and "it feels smarter" is not a metric. How we build eval suites that catch regressions before users do.

The highest-ROI automation we ship is rarely glamorous: report generation, data syncs, handoffs between tools. Boring is a feature.

Most internal tools die of neglect, not bad code. The difference between a dashboard nobody opens and a tool the team fights to keep.

How a mid-size operations team replaced a three-day manual reporting cycle with an automated pipeline — and what it changed downstream.

Provider choice is an engineering decision, not a brand preference. The trade-offs that actually matter when picking a model stack.

Tools change faster than habits. What actually works when you introduce AI agents into a team's daily workflow — beyond the kickoff demo.

Headcount is a cost, not a metric. On staying deliberately small while shipping like a bigger team — with systems doing the heavy lifting.