#rules

DISCOVERIES • May 31, 2026

A green build is the weakest definition of done — and the one your agent reaches for first.

Write the exact verification step into the feature spec your agent loads, so 'done' means the test passed and the result is on the table — not that the code compiled.

DISCOVERIES • May 31, 2026

The fastest way to corrupt money is to let an agent build the whole payment system.

Not all code is equally expensive to get wrong. Stop reviewing your agent's output uniformly and start spending your scrutiny where a mistake double-charges a customer.

DISCOVERIES • May 31, 2026

When you know where the context lives, stop letting the agent guess.

Agent-driven file discovery burns turns and fills the window with wrong guesses. A parameterized priming command loads exactly the right files in one deterministic shot — reserve probabilistic search for when you genuinely don't know where to look.

DISCOVERIES • May 31, 2026

Your 600-line rules file is teaching the agent to ignore you.

A single giant root rules file dilutes the agent's attention. Split persistent context into a broad root plus directory-scoped files the agent loads by what it's touching — and push task-specific instructions into commands.

DISCOVERIES • May 31, 2026

Stop writing rules for failures your agent never commits.

Most AGENTS.md files defend against imagined mistakes. Mine your own transcripts, count what the agent actually gets wrong, and let the failure distribution decide what to write — and in what order.

DISCOVERIES • May 31, 2026

You can't prompt your way out of a hallucinated output. You can validate your way out in three days.

A deterministic check on the agent's output, with one retry, kills a class of wrong answers that no prompt rewrite ever reaches — and gives you a number instead of a vibe.

DISCOVERIES • May 31, 2026

Three control axes are sitting in the MCP spec. Most teams use one.

An MCP server exposes resources, tools, and prompts — context the app pulls, actions the model takes, workflows the user invokes. Wiring them to the right controller is how the agent reads your conventions instead of guessing them.

DISCOVERIES • May 31, 2026

The friction that makes humans hate pre-commit hooks is what makes them perfect for agents.

A git pre-commit hook turns 'the agent should run the tests' into 'the agent cannot commit broken code' — a deterministic gate that self-corrects from its own error output and ships with every clone.

DISCOVERIES • May 31, 2026

An agent will gate three routes and forget the fourth — unless the gate is a function it has to call.

Scatter feature-gating logic inline and consistency depends on luck. Collapse the policy into one named helper and make 'call this gate' a rule, and uniform enforcement becomes the default across every endpoint the agent writes.

DISCOVERIES • May 31, 2026

Stop describing your work to the agent. Hand it the diff.

Asking an agent to review what you say you did grades your spin, not your code. A small MCP server that reads the live diff and carries its own rubric grades reality instead.

DISCOVERIES • May 31, 2026

Half your agent's 'hallucinations' never reached the prompt

Before you write one more anti-hallucination instruction, split each wrong answer into a retrieval failure and a generation failure. The two need opposite fixes, and no prompt rescues context that was never pulled into scope.

DISCOVERIES • May 31, 2026

If you can describe your workflow, you don't have a workflow — you have a habit.

The correction you keep re-typing into the chat is a missing command. Fold the recurring ritual into one parameterized verb backed by a rules file, and the workflow stops varying with your mood.

DISCOVERIES • May 31, 2026

Bootstrap a service you don't understand and you've shipped a black box you can't debug.

Wiring up Stripe or a database you've never touched by vibe-coding traps you at the first failure. Point the agent at the service's docs over MCP and force it to teach you back — close its codebase gap and your knowledge gap in the same pass.

DISCOVERIES • May 31, 2026

The smartest worker on your team is the one that forgets every job the moment it ships.

Pairing an isolated subagent with the right tools and a thin slice of your conventions rebuilds the proven agent loop — read, pick a tool, act, report — without dragging dozens of tool-call logs back into your main thread.

DISCOVERIES • May 31, 2026

The best prompt you'll ever write is the one you delete.

An agent that waits for you to remember to ask it is a toy. Wire a deterministic event to a headless run that reads your rules, and the routine context work fires on its own — with your conventions already baked in.

DISCOVERIES • May 31, 2026

You already have labeled data. It's the transcripts you throw away.

Every failed agent session is a labeled example of where your context falls short. Capture them, cluster them, and fix the top category once in your rules file instead of re-correcting it forever.

DISCOVERIES • May 31, 2026

An Agent That Can't Forget Is Worse Than One With No Memory

A rules file is write-once and rots; auto-memory saves corrections but on its own judgment, in its own store. Add the human-gated curation layer the vendors leave open: a command that counts corrections and proposes durable edits, a hook that resolves conflicts, so corrections compound instead of contradicting.

DISCOVERIES • May 29, 2026

A green build is the weakest definition of done — and the one your agent reaches for first.

The fastest way to corrupt money is to let an agent build the whole payment system.

When you know where the context lives, stop letting the agent guess.

Your 600-line rules file is teaching the agent to ignore you.

Stop writing rules for failures your agent never commits.

You can't prompt your way out of a hallucinated output. You can validate your way out in three days.

Three control axes are sitting in the MCP spec. Most teams use one.

The friction that makes humans hate pre-commit hooks is what makes them perfect for agents.

An agent will gate three routes and forget the fourth — unless the gate is a function it has to call.

Stop describing your work to the agent. Hand it the diff.

Half your agent's 'hallucinations' never reached the prompt

If you can describe your workflow, you don't have a workflow — you have a habit.

Bootstrap a service you don't understand and you've shipped a black box you can't debug.

The smartest worker on your team is the one that forgets every job the moment it ships.

The best prompt you'll ever write is the one you delete.

You already have labeled data. It's the transcripts you throw away.

An Agent That Can't Forget Is Worse Than One With No Memory

Build the Glossary the Agent Wrote

Cache the Explore

Destination and Journey

Inject the Reminder When the Agent Forgets

The Loop That Re-Reads Its Diary

The Relay, Not the Window

Write it down once. Every AI coding tool reads the same kind of file.