Blog
Thoughts on building with AI, engineering, and startups.
The Agent Trust Budget You Do Not Measure
3 June 2026AI agents earn trust in small deposits and lose it in large withdrawals. This is the trust budget — and most teams are operating near zero without knowing it.
5 min read
Day 26: Make the Review Path Smaller
3 June 2026Day 26 of the 60 Day OSS Sprint: lockstep, airgapquery, and policydiff all push agent work toward the same discipline - shrink the review path before asking anyone to trust the output.
7 min read
Day 27: Context Has to Travel
3 June 2026Day 27 of the 60 Day OSS Sprint: ctxquilt, secretshape, and smokegrid all point at the same agent workflow problem - context, secrets, and smoke evidence need portable shapes before speed can compound.
7 min read
Day 28: Evidence Needs a Shape
3 June 2026Day 28 of the 60 Day OSS Sprint: repocapsule, agenthandoff, and rundossier all converge on one rule - agent work is only as trustworthy as the evidence shape it leaves behind.
6 min read
Agent Input Contracts Beat Output Polish
2 June 2026AI coding agents get safer and faster when the work starts with explicit inputs: scope, permissions, fixtures, proof requirements, and review contracts instead of prettier final summaries.
5 min read
Why DepScreen Exists
2 June 2026DepScreen is a local-first dependency review CLI for agent-built JavaScript projects. It exists because dependency changes need deterministic prompts before they become supply-chain risk.
4 min read
The Trust Budget Is the Real Agent Constraint
1 June 2026AI coding agents do not only consume tokens and time. They consume trust, and the winning workflows are the ones that spend that trust deliberately with small scopes, receipts, and reviewable proof.
5 min read
Why CutPilot Exists
1 June 2026CutPilot is a local-first agentic video editing helper for transcript-aware EDLs, ffmpeg render plans, and reviewable artifacts around small cuts from local footage.
6 min read
Day 25: The Review Surface Is the Product
31 May 2026Day 25 of the 60 Day OSS Sprint: agentattest, toolbill, and depscreen all point at the same bottleneck in agentic engineering - the work is only as useful as the evidence a reviewer can inspect.
6 min read
Agent Workflows Need Staging Layers
30 May 2026The safest AI agent workflows do not jump straight from prompt to production action. They rehearse in worktrees, fixtures, dry runs, proof bundles, and review queues first.
5 min read
Why TypeGap Exists
30 May 2026TypeGap is a local-first TypeScript type coverage auditor for finding any, unknown, and missing annotations before agent-generated code turns type safety into theatre.
5 min read
Make Uncertainty Cheap
29 May 2026AI agent workflows improve when doubt is cheap to surface, preserve, and review. The strongest harnesses make uncertainty explicit before it becomes rework.
6 min read
Why SlackCache Exists
29 May 2026SlackCache exists because AI agents need scoped, local, reviewable workspace context before Slack history becomes another opaque source of private operational memory.
6 min read
The Agent Should Not Be the Only Witness
28 May 2026AI agent verification gets stronger when proof comes from independent tools instead of the same model that created the work. The witness layer is what makes agent output reviewable.
6 min read
Why ToolMirror Exists
28 May 2026ToolMirror is a local-first registry and drift detector for agent tool catalogs, built because MCP and coding-agent tools need reviewable schemas, docs, diffs, and risk signals.
6 min read
The Fastest Agent Has Less to Decide
27 May 2026AI coding agents get faster and safer when the workflow removes unnecessary choices before the model starts: scope, tools, verification, and handoff shape.
6 min read
Why TaskBrief Exists
27 May 2026TaskBrief exists because AI coding agents need scoped, risk-aware task queues from messy human input before they need more autonomy or bigger context windows.
5 min read
Agent Triage Is the Work Now
26 May 2026AI coding agents are making implementation cheaper, but the operating advantage is moving to triage: deciding what is ready, risky, blocked, duplicated, or worth doing next.
6 min read
Why ReviewCue Exists
26 May 2026ReviewCue exists because AI code review needs a deterministic local packet before the model sees the diff: changed files, relevant context, risk cues, and reviewer questions.
5 min read
Why RunReceipt Exists
25 May 2026RunReceipt is a local-first CLI for capturing command output, hashes, selected environment context, and verification receipts so agent work can prove what actually ran.
6 min read
Day 24: Build for First Failure, Not First Success
23 May 2026Day 24 of the 60 Day OSS Sprint: failureseed, testgold, and testmatrix built the same case from different directions — agent workflows need deterministic failure capture, honest golden comparisons, and repeated verification before anyone trusts a green checkmark.
6 min read
When the Agent Fails First
23 May 2026The most useful tool in an agentic workflow is not the one that makes the agent faster. It is the one that captures exactly how the agent failed so the next person does not have to guess.
5 min read
Docs Are Agent Inputs, Not Afterthoughts
21 May 2026AI coding agents treat documentation as execution context. That makes stale README examples, missing scripts, and broken local links a real software quality problem.
6 min read
Why DocFresh Exists
21 May 2026DocFresh exists because stale README commands and broken local docs are no longer harmless polish issues. They are bad inputs for agents, releases, and reviewers.
6 min read
Review Latency Is the Real Agent Bottleneck
19 May 2026AI coding agents are getting faster, but the next operational advantage comes from reducing review latency with smaller changes, stronger proof, and cleaner handoffs.
6 min read
Why MCPSeal Exists
19 May 2026MCPSeal exists because MCP server configs are permission documents, and agent workflows need a local-first way to review risky commands, env exposure, filesystem access, and tool descriptions before connection.
6 min read
Agent Tools Need Boring Interfaces
18 May 2026The next useful AI agent tools will not win by being more magical. They will win by turning messy agent work into boring contracts, checks, reports, and review surfaces.
7 min read
Why ArtifactMap Exists
18 May 2026ArtifactMap is a local-first CLI for inventorying generated files, build outputs, caches, packages, and report artifacts so agent-built repos can explain what belongs in review.
7 min read
The First Safe Task Is the Missing Agent Primitive
17 May 2026AI coding agents need more than context. They need a first safe task: a bounded, low-risk entry point that turns repository onboarding into reviewable progress.
6 min read
Why GitCleanroom Exists
17 May 2026GitCleanroom exists because AI coding agents need disposable, inspectable git workrooms before risky edits become dirty checkouts, branch collisions, or cleanup chores.
6 min read
Day 23: Fixtures Need a Memory
16 May 2026Day 23 of the 60 Day OSS Sprint: ClipCase, FetchFreeze, and TestSeed made the same point from different directions — agent workflows need deterministic memory for context, HTTP, and fixture data.
6 min read
Why RunbookLint Exists
16 May 2026RunbookLint is a local-first Markdown linter for operational runbooks, built because release, incident, support, and agent handoff docs are executable enough to deserve checks.
6 min read
Day 22: Release Evidence Without Leaking the Run
15 May 2026Day 22 of the 60 Day OSS Sprint: ShipCue, SmokeTape, and ArgVault all pushed on the same constraint — release evidence has to be useful without exposing secrets or pretending advice is authority.
6 min read
Why NativePilot Exists
15 May 2026NativePilot exists because AI mobile app starters need more than screens and dependencies — they need explicit provider seams, secret boundaries, and checks that coding agents can understand.
5 min read
Preflight Is Where Agent Quality Actually Starts
14 May 2026AI coding agents get more reliable when the workflow checks ports, configs, permissions, dependencies, and evidence before the model starts improvising. Preflight is not ceremony; it is how speed becomes reviewable.
5 min read
Why PortPatrol Exists
14 May 2026PortPatrol exists because AI coding agents need a deterministic map of local dev ports before they start servers, run smokes, or accidentally build on top of a mystery process.
5 min read
Agent Speed Has a Packaging Problem
13 May 2026AI coding agents can create repos quickly, but the real leverage shows up only when package metadata, entrypoints, smoke tests, and release checks make the output installable and reviewable.
6 min read
What Budget 2026 Means for AI Startup Founders
13 May 2026A balanced founder-focused read on Australia's 2026 Budget, tax incentives, company structure, PSI risk, trusts, capital formation and AI startup wealth creation.
12 min read
Why ShellGarden Exists
13 May 2026ShellGarden is a local-first harness for keeping README commands, docs examples, and agent handoffs executable with fixture-backed transcripts instead of stale copy-paste snippets.
6 min read
Day 21: Trust the Boring Evidence First
12 May 2026Day 21 of the 60 Day OSS Sprint: SchemaSeal, LogLatch, and FlakeRadar all point at the same operating principle — agents need boring evidence before they need bigger claims.
5 min read
The Next AI Agent Interface Is the Review Queue
12 May 2026AI agent products are still overfocused on chat windows. The durable interface is the review queue: tasks, proof, risk, decisions, and clean human handoffs.
6 min read
Why FlakeRadar Exists
12 May 2026FlakeRadar is a local-first command harness for catching flaky checks before one lucky pass convinces an AI coding agent, CI job, or human reviewer that the work is safe.
6 min read
The Best AI Agent Tools Are Harnesses, Not Copilots
11 May 2026AI coding agents get more useful when the product is not another chat surface, but the harness around the work: scoped inputs, deterministic checks, proof, and reviewable handoffs.
6 min read
Day 20: The Risk Is Usually Hiding in Plain Text
11 May 2026Day 20 of the 60 Day OSS Sprint: ActionPin, PromptLintel, and Lockstep made the same point from different angles — agent workflows need boring checks around CI, instructions, and package drift.
6 min read
Why ActionPin Exists
11 May 2026ActionPin is a local-first GitHub Actions workflow risk checker for the CI mistakes that agents and humans both miss: mutable actions, broad permissions, risky events, and curl-bash patterns.
6 min read
Day 19: Prompt Contracts Need Snapshots Too
10 May 2026Day 19 of the 60 Day OSS Sprint: PromptSnap, ContainerGhost, and Taskbrief all exposed the same operator lesson — agent workflows need reviewable contracts before they need more autonomy.
6 min read
Good Agent Tools Fail Closed
10 May 2026AI agents do not need more theatrical confidence. They need tools that stop safely when input, evidence, permissions, or verification are missing.
6 min read
Why PromptSnap Exists
10 May 2026PromptSnap is a local-first snapshot tester for prompts, skills, and agent instruction packs. It exists because prompt drift should be reviewable before it changes agent behavior.
6 min read
Agent Autonomy Needs Smaller Blast Radiuses
9 May 2026More autonomous AI agents are only useful when their permissions, context, and output paths are small enough to review and recover from.
6 min read
Day 17: Release Checks Are a Product Feature
9 May 2026Day 17 of the 60 Day OSS Sprint: TermAgent, ProtoBridge PHP, and QQBridge pushed the same lesson from different angles: release checks are part of the product, not paperwork after the fact.
5 min read
Day 18: Handoffs Are Where Speed Either Compounds or Dies
9 May 2026Day 18 of the 60 Day OSS Sprint: HandoffPad, ReplayPack, and IssueCraft made the same point from different angles — agent speed only compounds when the next reviewer gets usable evidence.
6 min read
Why GuardrailMD Exists
9 May 2026GuardrailMD is a local-first Markdown safety linter for agent runbooks, built because instructions are now executable enough to deserve review.
6 min read
Why TermAgent Exists
9 May 2026TermAgent is a local-first terminal-agent harness for reproducible tasks, command-review checkpoints, transcript export, and proof bundles. The bigger idea is simple: terminal agents need receipts.
5 min read
Day 16: Proof Before Publish
8 May 2026Day 16 of the 60 Day OSS Sprint: ProofDock, FailureSeed, and EnvProbe made release work feel less like trust and more like evidence a reviewer can actually inspect.
5 min read
Small Contracts Beat Big Prompts for AI Agents
8 May 2026AI agent quality improves when workflows rely less on giant prompts and more on small contracts: inputs, permissions, outputs, checks, and proof that can be reviewed.
5 min read
Why ProofDock Exists
8 May 2026ProofDock exists because AI coding agents need portable proof bundles: artifacts, checks, risks, and review notes that make a change inspectable after the agent leaves.
4 min read
Every AI Agent Needs a Privilege Budget
7 May 2026AI agent workflows get safer when permissions are treated like a budget: local fixtures first, explicit network access, scoped writes, and proof before irreversible actions.
5 min read
Day 15: Fixtures Before Live Data
7 May 2026Day 15 of the 60 Day OSS Sprint: ColbertCache, CrawlForge, and DexWatch made the same argument from different angles — agents need deterministic fixtures before they touch live data.
5 min read
Why ColbertCache Exists
7 May 2026ColbertCache is a tiny fixture mirror for retrieval demos because RAG agents need manifests, checksums, and provenance before anyone should trust the answer.
5 min read
Day 14: Make the Blast Radius Smaller
6 May 2026Day 14 of the 60 Day OSS Sprint: WorktreeGuard, ActionLoom, and LockfileLens all pushed toward the same rule — agent speed only helps when the failure boundary is small.
5 min read
Local-First Is the Trust Layer for AI Agent Tools
6 May 2026AI coding agents become easier to trust when the tools around them inspect local state first, avoid hidden network calls, and hand humans reviewable evidence.
5 min read
Why LockfileLens Exists
6 May 2026LockfileLens is a local-first dependency lockfile explainer for agent handoffs and PR reviews, built because package drift should not hide inside unreadable diffs.
5 min read
Day 13: Release Receipts Before Release Automation
5 May 2026Day 13 of the 60 Day OSS Sprint: ReleaseBox, ProofDock, and LockfileLens pushed the sprint toward release receipts before any agent gets near a publish button.
5 min read
Receipts Beat Autonomy in AI Agent Workflows
5 May 2026The fastest AI agent workflow is not the one with the most autonomy. It is the one where every handoff has receipts, proof, and a clear review decision.
4 min read
Why ReleaseBox Exists
5 May 2026ReleaseBox is a local-first release readiness tool for OSS CLIs and apps. It exists because publishing should stay review-gated until install, smoke, dry-run, and release notes are proven.
5 min read
Day 12: Release Readiness Is Part of the Build
4 May 2026Day 12 of the 60 Day OSS Sprint: ReleaseBox, StackForge, and the dogfood rollout made release readiness a default part of agent-built OSS instead of an afterthought.
6 min read
Why TokenPress Exists
4 May 2026TokenPress is a local-first transcript compactor for AI agent logs, built around one blunt idea: keep the evidence, cut the scrollback, and make handoffs cheaper to review.
5 min read
Day 10: Turning Agent Speed into Something You Can Trust
3 May 2026Day 10 of the 60 Day OSS Sprint: Agent-QC, ProofDock, and ToolTrace all point at the same problem — fast agents need better proof, not louder confidence.
5 min read
Day 11: The Review Queue Became the Product
3 May 2026Day 11 of the 60 Day OSS Sprint: prpack, lockfilelens, and workspacewire all point at the same constraint — agents only scale when the review queue gets sharper.
5 min read
Deterministic Agents Beat Charismatic Agents
3 May 2026The next leap in AI coding agents will come less from smoother personalities and more from deterministic harnesses: briefs, gates, proofs, timelines, and reviewable handoffs.
5 min read
Why WorkspaceWire Exists
3 May 2026WorkspaceWire is a local-first Google Workspace operation planner for agents — built around dry runs, least-privilege scopes, and human review before external action.
5 min read
Day 9: The Difference Between Shipping Fast and Shipping Blind
2 May 2026Day 9 of the 60 Day OSS Sprint: worktree isolation, environment probes, and proof bundles turned fast agentic OSS work into reviewable shipping instead of blind velocity.
6 min read
The Agent Handoff Layer Is Where Trust Gets Built
2 May 2026AI coding agents do not fail only while writing code. They fail in the handoff: missing proof, vague summaries, dirty branches, and review work shifted back onto the human.
5 min read
Why Agent-QC Exists
2 May 2026Agent-QC is a small deterministic gate for agentic development workflows, starting with the boring PR-body failure that quietly breaks human review.
5 min read
Day 8: Shipping Fast Without Letting the Floor Drop Out
1 May 2026Day 8 put the sprint under real pressure: early scaffolded repos, provider friction, and the lesson that speed only matters if verification survives.
3 min read
Day 7: Building the Factory Before Building the Repos
30 Apr 2026Day 7 brought [StackForge](https://github.com/rogerchappel/stackforge) into the sprint: a deterministic CLI-first project factory for scaffolding agent-friendly repos without hidden network calls or autonomous publishing.
3 min read
Day 6: The Agent Playbook Became the Product
29 Apr 2026Day 6 turned one-off agent prompting into an operating model: branches as execution contexts, atomic commits as checkpoints, and verification as the handoff gate.
3 min read
Day 5: The Template Is the Quality Bar
28 Apr 2026Day 5 used the agentic OSS template to turn repo setup from vibes into repeatable defaults: policy, CI, validation, security docs, and review discipline.
3 min read
Day 4: Context Is Infrastructure
27 Apr 2026Day 4 centered on repoctx and the missing memory layer agents need: deterministic repository context, commands, risks, and policy before implementation starts.
3 min read
Day 3: Atomic Commits Are a Safety Feature
26 Apr 2026Day 3 focused on atomcommit and the idea that fast AI-generated code only stays useful when every change can be reviewed, explained, and rolled back.
3 min read
Day 2: Agents Need Briefs, Not Wishes
25 Apr 2026Day 2 turned rough sprint intent into scoped task briefs, proving that AI coding agents need boundaries, risk, and verification before they need more autonomy.
3 min read
Day 1: The OSS Sprint Started With an Idea Queue
24 Apr 2026The first day of the 60 Day OSS Sprint was not about writing code faster. It was about turning open-source ideas into a queue agents could safely build from.
3 min read
Day 0: The Bet Behind the 60 Day OSS Sprint
23 Apr 2026Why I’m attempting a 60-day open source sprint, what I’m trying to prove about AI-assisted building, and how I’ll judge the work beyond raw output volume.
8 min read
Why Most AI SaaS Founders Don't Have a Product Problem, They Have a Distribution Problem
23 Apr 2026A lot of AI founders think they need better features. Most of them need better distribution. Here's how to tell the difference, and what to fix first.
7 min read
How I Decide What SaaS Ideas Are Worth Building
21 Apr 2026My framework for choosing SaaS ideas in 2026, from solving my own problems and automating workflows to testing moats, distribution, and revenue potential before I build.
8 min read
The Founder's Guide to Open Source as Marketing
11 Apr 2026Open source isn't charity. It's the highest-ROI distribution channel for founders who can build. The strategy, the metrics, and the real playbooks.
7 min read
Stop Building Features. Start Building Pipelines.
11 Apr 2026The next wave of software isn't feature-driven. It's workflow-driven. Here's why pipelines create switching distance that features never will.
9 min read
The Content Engine Problem Nobody Talks About
11 Apr 2026Scaling content with AI is easy. Making it actually feel like you, and not generic slop, is the hardest thing about building a brand in public.
8 min read
The Hidden Cost of Running AI (And How to Keep It Profitable)
11 Apr 2026Token costs kill more AI businesses than bad ideas do. Here's the real math behind AI SaaS margins and how to stay profitable.
10 min read
Why AI Agents Are the Hardest Engineering Problem You'll Never Talk About
11 Apr 2026Building the agent is easy. Getting it to actually work reliably in production, without hallucinating, without burning tokens, without going completely off rails, that's the real problem nobody demos.
10 min read
Bring Your Own Agents to Work
9 Apr 2026Personal AI agents are coming to work. The hard part isn't adoption. It's governance, permissions, auditability, and safe interfaces.
11 min read
The Interface Layer for Personal AI
9 Apr 2026Personal AI will not live in more dashboards. It will need one interface layer across watch, phone, car, and headset.
9 min read
The SaaS I'm Building for a Problem Nobody Talks About
9 Apr 2026Why I built Soply around institutional knowledge rot, the invisible tax of undocumented work, and why this problem matters more than most teams realize.
5 min read
How to Manage a Team of AI Agents
8 Apr 2026A practical guide to AI agent orchestration: org charts, task boards, dispatch patterns, and lessons from running 10+ agents daily.
12 min read
How to Secure AI-Generated Code Before Production
8 Apr 2026AI can help you ship faster, but it can also help you ship vulnerabilities faster. Here's a practical security checklist for AI-built apps before they hit production.
9 min read
Building AI Agents So I Can Live More
8 Apr 2026Why I'm building an AI operating layer to create real output, reduce screen time, and buy back more life, not just more commits.
10 min read
Smart Token Consumption Is the New 10x Engineer
8 Apr 2026The biggest gains in agentic engineering come from better token economics: right-sized models, tighter loops, less context bloat, and fewer wasteful check-ins.
9 min read
The 100x Engineer Doesn't Write Code
8 Apr 2026Programming complexity has always been abstracted upward. LLMs are the next layer, and the best engineers are becoming agent orchestrators.
9 min read
The API Is the Product for AI Features
8 Apr 2026The model call is only one dependency. If you want AI features that survive real usage, the API contract, latency budget, fallbacks, and orchestration layer matter more than the demo.
7 min read
Why Most AI Apps Die in the Backend
8 Apr 2026Prompts and polished UI get attention, but most AI products actually fail in queues, retries, state, evals, and data contracts.
8 min read
I Built 5 Products at 6AM and Nobody Cared
7 Apr 2026Five years of building SaaS before sunrise. Five products ready to sell. Zero audience. Here's what went wrong and what I'm doing about it.
6 min read
Why 80% of AI Startups Will Fail
7 Apr 2026AI makes code free. That changes what wins. Here's what I'm betting on instead.
6 min read
I Gave My AI Agent Team an Org Chart
6 Apr 2026What happens when you treat AI agents like employees: task boards, specializations, dispatch rules, and the software I built to manage it all.
4 min read
The $0 Stack: Shipping SaaS on Free Tiers
5 Apr 2026The exact tools I use to build and ship multiple SaaS products without spending a cent on infrastructure. Neon, Vercel, Cloudflare, and more.
4 min read
Why I'm Building a Portfolio of AI-Native SaaS Products
3 Apr 2026The thesis behind Axislabs: multiple AI-native bets, one holding company, and why the timing matters more than people think.
3 min read
Building with AI Agents: What I've Learned So Far
1 Apr 2026Practical lessons from building AI agent-powered products at Axislabs — what works, what doesn't, and where this is all heading.
3 min read