Six Agent-Memory Disciplines

Six no-training memory disciplines distilled from Stanford's AutoMem ablations: consult before write, upsert over append, and the rest of the improvement loop.

agent-memory-disciplines.md

Six Agent-Memory Disciplines (no training required)
An Ena Pragma method, distilled from Stanford's AutoMem ablations
(arxiv.org/abs/2607.01224). Each discipline is what the paper's automated
optimization converged on; each is adoptable by hand in any agent stack.

1. CONSULT BEFORE WRITE
   Search memory before adding to it. The single behavior the paper's
   training most reinforced (writes-per-search fell 54-72%), and it was
   already expressible as a prompt rule. Mechanize it: scan for near-
   duplicates before any new record lands; track your write/search ratio.

2. UPSERT OVER APPEND (for state-like facts)
   Facts with a current value (status, location, config) get updated in
   place under a stable key; append-only is for genuinely episodic records
   (events, sessions, logs). The paper's largest single structure win:
   keyed updates cut one environment's memory growth 95%. Run version
   control underneath so updating in place never destroys history.

3. AUTO-SYNC MECHANICAL STATE
   Anything derivable from the environment (inventory, status, indexes)
   is maintained by code, not by the model. Spend model judgment only on
   judgment. If a script can keep it current, a script should.

4. PRE-LOAD STANDING KNOWLEDGE
   Goals, rules, and domain facts the agent will always need go into
   memory BEFORE the run, so no capacity is wasted rediscovering them.
   (This is what a well-maintained operating doc for an agent is.)

5. REVIEW TRAJECTORIES, NOT JUST OUTCOMES
   A memory mistake at step 50 may not hurt until step 800; end-of-run
   metrics cannot tell you where memory went wrong. Periodically read
   full session records against the memory they produced, like a code
   reviewer with the execution log in hand, and revise the STRUCTURE
   (schemas, prompts, rules) based on what you find.

6. GATE STRUCTURE CHANGES ON A FIXED TEST SET
   Keep a small, fixed benchmark (same queries or tasks every time).
   Any change to schemas, retrieval, or memory rules must hold or improve
   the score, or it gets adjudicated before it merges. (The paper's rule
   is stricter, improve-only; we relax it to hold-or-improve for
   maintenance changes.) Improvement you did not measure is improvement
   you cannot trust.

Order matters: 1-4 are day-one habits, 5-6 are the improvement loop.
All six are plain text and code. None require training a model.

This method is published in full in the post Does Better AI Agent Memory Come From Training or From Structure?, which covers the evidence behind it and when to reach for it.