Guide

How to apply Context Architecture to an existing codebase

Context Architecture is applied to a codebase that already exists, by reworking it in small steps, never by scaffolding a new one. You read the repo as someone with no context, fix the documentation that lies, and back every claim it makes about itself with a check that fails when the claim stops being true.

Read as

A repo starts clean and grows for three years. Conventions drift. The docs stop matching the code. The folder names tell you which framework built the thing, not what the thing does.

Now hand it to a reader with no memory: someone on their first day, or an AI agent started cold. Ask for one change. They cannot tell what anything means, where the change goes, or which of two competing patterns is the current one. The repo grew. Nobody designed it to be read.

That is the problem this fixes. The specification says what Context Architecture is and why. This page is the part you do with your hands: how to apply it to a repo you already have.

The "already have" matters. A new project starts with the structure its framework hands it, and this problem does not exist yet. It shows up later, once the code grows past what one person holds in their head. So this is not a way to start a project. It is a way to fix one. You are not building a new city, you are putting street names on one that already sprawled.

Before you start: is it worth it?

It has a real cost. The restructuring up front, the checks that are themselves code you have to keep green, and a small tax on every pull request to keep each claim tied to a check.

It pays off in proportion to how much agent or multi-person work the repo takes on. Worth it on a codebase that absorbs refactors, migrations, spec-driven features, agent contributions. Not worth it on a throwaway prototype or a problem you have not figured out yet. There the tax costs more than it returns, and skipping it is the right call. Saying that out loud is part of the discipline.

The one reader to design for. Assume a reader who keeps nothing between sessions and knows only what the repo states out loud. An AI agent is exactly that reader. A new teammate is close. The metric underneath all of it is one number: how long until that reader makes a correct change.

The order of operations

Do it in small steps. You do not stop everything and restructure at once. You land one bounded, reversible change at a time, and each one ships with the check that keeps its claim honest. No big-bang rewrite.

The order runs cheapest and safest first:

Read the repo as a cold reader, and name the failure modes you hit.
Fix the docs that lie.
Put AGENTS.md at the top boundaries.
Turn your most-repeated review comment into a lint rule.
Break up one junk-drawer folder.
Make the capabilities findable.
Move toward a domain-first layout, last, and only if it earns the churn.
Wire the loop that keeps it from rotting again.

The rest of the guide is one step per section, then one full example.

Step 1: audit the repo as a cold reader

Open the repo as if you had never seen it and remember nothing. Read the top-level tree, then the boundaries, then a handful of leaf files. At each level, one question: could I make a correct change here without asking anyone? Every "no" is a defect you just found.

The defects come in five shapes. None of them is the model's fault, which is why a better model or a cleverer prompt does not fix any of them:

Reimplementation. The source of truth was not findable, so the reader rebuilds what already exists.
Invented structure. Nothing was imposed, so the reader imposes its own.
Obedience to false docs. It cites deleted files or contradicts the current code, with full confidence.
Deprecated-pattern spread. It copies the loudest pattern even when that pattern is dead.
Coin-flip on ambiguity. Two conventions coexist, so it picks whichever it read first.

Write down, per principle, a verdict and the evidence. Do it by hand, or load the Context Architecture skill into your agent and let it write the report.

What this looks like. On a payments service, the first pass finds refund logic spread across three folders (reimplementation waiting to happen), a README pointing at a deploy script that was deleted months ago (false docs), and two date helpers with different signatures (a coin-flip). Three failure modes named before you touch a line.

Step 2: fix context-rot first

Start by stopping the docs from lying. A doc that cites a deleted file or contradicts the code is worse than no doc at all, because a confident reader does what it says.

Find it by hand or by script: pull every file path, command, symbol, and link out of your README, your AGENTS.md and CLAUDE.md files, and your design docs, and check that each one still exists or still runs. Fix each lie against what the code actually does today.

Then make the rot impossible to bring back. Add a test that asserts every path the docs cite still exists on disk. Now "this doc is accurate" is a claim with a check behind it, instead of a hope.

What this looks like. The README documents a deploy.sh that was deleted a year ago. You drop the dead reference, write down the real command, and add that path-check test. The next time someone moves a file out from under a doc, the suite goes red in the same pull request, not in production six weeks later.

Step 3: place AGENTS.md at the top boundaries

Context belongs next to the code it describes, at every boundary that owns something. Put there, it ages at the same rate as the code and gets read by the same agent about to edit it. Start at the root and the two or three busiest directories. That is where each AGENTS.md buys the most legibility.

Write down only what you cannot get from reading the code: the source of truth, the invariants, the tech debt you accepted on purpose, and the reasoning a spec left behind. Keep each one short.

# AGENTS.md (billing)

Owns invoicing, refunds, and the dunning schedule.

## Source of truth
Prices come from the `pricing-engine` package, never hard-coded here.

## Invariants
- A refund never exceeds the captured amount. Enforced by `refunds/guard.test.ts`.
- All money is integer cents, no floats. Enforced by the `no-float-money` lint rule.

## Accepted tech debt
The legacy `chargeV1` path stays until the 2026-Q3 migration. Do not extend it.

Look at the invariants: each one names the check that enforces it. That is the whole point. An invariant with nothing behind it is just a new line that can rot. If the check does not exist yet, write it in the same change, or phrase the line as a known gap, not a guarantee.

Step 4: codify the loudest convention

Take the comment you leave most often in review, the one that lives only in your team's heads, and put it in the toolchain. A convention an agent cannot read is a convention it will break, every time.

This is the move the rest of the steps lean on. When a claim needs to hold, this is how it holds: a lint rule that states the convention and fails in the same place, or a type that makes the wrong thing refuse to compile.

What this looks like. The comment you leave most is "import from the package root, not deep paths." Today it lives in reviewers' heads, so an agent breaks it on its first commit.

# before: a convention that lives in reviewers' heads
"always import from the package root, never deep paths"

# after: the convention, written down and enforced
.oxlintrc.json   # a no-restricted-imports rule that fails the deep path in CI

Once the rule is in the linter, the deep path fails on the spot, with a message that cites the rule, not a reviewer who happened to be paying attention that day.

Step 5: name a junk-drawer boundary

utils/, common/, helpers/, core/, lib/. This is where responsibility goes to die. Nothing in the name pushes back on unrelated code, so the folder grows forever. Pick the worst one and split it into folders whose names each say what they own.

# before
src/utils/        # 40 unrelated files

# after
src/pricing/      # the price math that was hiding in utils
src/auth-session/ # the session helpers that were hiding in utils
src/shared/       # what is genuinely generic, kept small and dependency-free

The name is doing the work. A folder called pricing resists code that is not about pricing, because it stops fitting. If you cannot name a boundary precisely, the boundary is wrong, and shared is not the answer. A tiny shared/ for a date formatter or a result type is fine. Reaching for the generic name to dodge the question of where something belongs is the debt.

Step 6: make capabilities discoverable

A capability an agent cannot find is, to that agent, a capability that does not exist. It just gets rebuilt, or skipped. Move your scripts, generators, and commands to predictable, named places, and name them for what they do: package.json scripts, a scripts/ or skills/ directory, commands you actually wrote down.

Better, generate the list of capabilities from those conventional paths instead of keeping it by hand, and test that the list is complete. A hand-kept list is one more claim waiting to rot.

What this looks like. Three deploy and seed scripts live in one engineer's home directory and a Slack thread nobody can find. You move them into scripts/ with names that say what they do and list them in package.json. The next agent finds them where it looks first, instead of writing a fourth.

Step 7: move toward a domain-first structure (last)

The top level should say what the system does, not which framework built it: billing/, onboarding/, payments/, not controllers/, services/, utils/. The framework lives one level down, inside the domain it serves.

This is the expensive move and the one most likely to break imports, so it goes last and goes in slices. Often a partial move plus a root AGENTS.md that spells out the structure you are migrating toward buys more legibility, per file moved, than reshuffling everything at once.

# before: organized by technical layer
src/
  controllers/   services/   models/   utils/

# after: organized by domain, framework one level down
src/
  billing/
    controllers/   services/   models/   # the framework, inside the domain it serves
  onboarding/
  payments/

Keep it from sliding back with a lint rule that stops domain code from leaking into a layer folder, and keep the target structure in the root AGENTS.md so a reader who lands mid-migration knows which way is forward.

Step 8: install the metabolism

The mature version does not just check context, it feeds it. A repo that checks itself cannot rot in silence. A repo that feeds itself takes in new knowledge the moment it is created.

One line in the root AGENTS.md and the review checklist does it: when a pull request adds a source of truth or an invariant, it documents that in the same pull request, with a check behind it. Now the context grows with the system instead of trailing six months behind it.

A worked example, end to end

A service that started as one framework app and grew for three years. The tree screams the framework, not the product, and the only doc is a README that is half wrong.

# before
src/
  controllers/      # 22 files, mixed domains
  services/         # 18 files, mixed domains
  models/
  utils/            # the junk drawer
  helpers/          # a second junk drawer
README.md           # points at a deploy script deleted last year

Ask an agent to "add a partial-refund flow" here and watch it hit three of the five failure modes in one task: it cannot find where refunds live (split across controllers/, services/, models/), it rebuilds money math that already sits in utils/, and it follows the README to a deploy script that is gone.

The fix, in the order above, with no big-bang commit:

Context-rot. Drop the dead deploy reference from the README, write the real command, add a test that every path the README and AGENTS.md cite still exists.
Embedded context. A root AGENTS.md (what the service owns, the structure it is migrating toward) and one in the busiest area.
Codify. The most-repeated review comment was "money is integer cents." That becomes a no-float-money lint rule.
Name. Split utils/ and helpers/: money math becomes money/, session code becomes auth-session/, the genuinely generic remainder stays in a small shared/.
Discoverable. The ad-hoc scripts move into scripts/ with real names, listed in package.json.
Domain-first, in slices. Move refunds first: a billing/refunds/ that keeps its controller, service, and model together. The next agent asked about refunds finds them in one place.

# after
src/
  billing/
    AGENTS.md       # invariants and the source of truth for billing
    refunds/        # controller + service + model, together
    invoices/
  auth-session/
  money/            # the math that was buried in utils, now named and enforced
  shared/           # small, generic, dependency-free
scripts/            # named, listed in package.json
AGENTS.md           # the rules of the house, and the structure being migrated toward
README.md           # accurate, and a test keeps it that way

Now the same task lands in billing/refunds/, against an AGENTS.md that states the refund invariant, reusing the money/ package the lint rule already points at. The failure modes have nowhere left to happen.

Run it with the skill

Step 1 and most of the moves above are things an agent can run. The Context Architecture skill is a single file you load into your agent: it reads the repo as a cold reader, finds the docs that lie, and hands back the backlog in the order above. Point it at your repository and start with what it flags first.

Where to go next

The specification: the rule, the four pillars, the mechanism, and the eight principles in full.
The comparison with context engineering and harness engineering: which layer each one designs, and why this one sits below them at design time.
The skill: to run the rework with your own agent.
The glossary: the terms used across the specification, defined.