#rules

24 posts tagged #rules.

When you know where the context lives, stop letting the agent guess.

When you know where the context lives, stop letting the agent guess.

Agent-driven file discovery burns turns and fills the window with wrong guesses. A parameterized priming command loads exactly the right files in one deterministic shot — reserve probabilistic search for when you genuinely don't know where to look.

Your 600-line rules file is teaching the agent to ignore you.

Your 600-line rules file is teaching the agent to ignore you.

A single giant root rules file dilutes the agent's attention. Split persistent context into a broad root plus directory-scoped files the agent loads by what it's touching — and push task-specific instructions into commands.

Stop writing rules for failures your agent never commits.

Stop writing rules for failures your agent never commits.

Most AGENTS.md files defend against imagined mistakes. Mine your own transcripts, count what the agent actually gets wrong, and let the failure distribution decide what to write — and in what order.

Three control axes are sitting in the MCP spec. Most teams use one.

Three control axes are sitting in the MCP spec. Most teams use one.

An MCP server exposes resources, tools, and prompts — context the app pulls, actions the model takes, workflows the user invokes. Wiring them to the right controller is how the agent reads your conventions instead of guessing them.

Stop describing your work to the agent. Hand it the diff.

Stop describing your work to the agent. Hand it the diff.

Asking an agent to review what you say you did grades your spin, not your code. A small MCP server that reads the live diff and carries its own rubric grades reality instead.

Half your agent's 'hallucinations' never reached the prompt

Half your agent's 'hallucinations' never reached the prompt

Before you write one more anti-hallucination instruction, split each wrong answer into a retrieval failure and a generation failure. The two need opposite fixes, and no prompt rescues context that was never pulled into scope.

The best prompt you'll ever write is the one you delete.

The best prompt you'll ever write is the one you delete.

An agent that waits for you to remember to ask it is a toy. Wire a deterministic event to a headless run that reads your rules, and the routine context work fires on its own — with your conventions already baked in.

An Agent That Can't Forget Is Worse Than One With No Memory

An Agent That Can't Forget Is Worse Than One With No Memory

A rules file is write-once and rots; auto-memory saves corrections but on its own judgment, in its own store. Add the human-gated curation layer the vendors leave open: a command that counts corrections and proposes durable edits, a hook that resolves conflicts, so corrections compound instead of contradicting.

Build the Glossary the Agent Wrote

Build the Glossary the Agent Wrote

Vague prompting isn't a skill gap you fix by writing longer prompts. It's a missing shared vocabulary — and the fix is a committed glossary, not a personal habit.

Cache the Explore

Cache the Explore

The codebase is the strongest steering signal you have — stronger than your prompt or your rules. So pay the exploration cost once, freeze it into a file, and feed it to every run.

Destination and Journey

Destination and Journey

Hand the agent two committed documents — a requirements doc that fixes the destination and a phased plan that fixes the journey — and a feature too big for one window ships correctly anyway.

Inject the Reminder When the Agent Forgets

Inject the Reminder When the Agent Forgets

Your rules file is necessary but not sufficient. Persistent context degrades as the window fills — so move the rule that keeps getting ignored into a hook that fires at the exact moment it matters.

The Loop That Re-Reads Its Diary

The Loop That Re-Reads Its Diary

An autonomous agent isn't a bigger context window. It's a tiny window run many times, where the git log — not the chat history — carries the decisions between passes.

The Relay, Not the Window

The Relay, Not the Window

A million-token window won't fix a long build. Externalize state to a plan file and run a relay across fresh sessions, because quality dies long before the window fills.