Harness Architecture¶
Memex is a harnessed memory operating system. The markdown vault is the source of truth, but reliability depends on the harness around the agent: where it runs, which tools it can use, what context it sees, how state survives, what gets traced, how results are verified, and which policies constrain action.
This is harness engineering applied to a knowledge vault โ the practice of investing in the scaffolding around the model (context, tools, the agentic loop, guardrails, verification, observability, state/lifecycle), because that layer, not the prompt, sets production reliability. The version itself is 0.2.0-harness-preview. The ETCLOVG map below is our enumeration of those layers.
Aligned reading: OpenAI, Harness Engineering โ https://openai.com/index/harness-engineering/ ยท Martin Fowler, Harness engineering for coding agent users โ https://martinfowler.com/articles/harness-engineering.html
ETCLOVG map¶
| Layer | Harness question | Memex implementation | Next hardening step |
|---|---|---|---|
| Execution | Where does the agent run and what bounds it? | Local workspace, Git, validation scripts, public/private vault boundary | Add sandbox profile documentation for local, CI, and cloud runs |
| Tool interface | Which capabilities are exposed and how are they described? | Agent Skills, validation scripts, GitHub remote | Add tool inventory with risk levels and owner notes |
| Context | What does the model see, retrieve, compact, or forget? | README, AGENTS-style rules, docs, vault markdown, retrieval index | Enforce context policy and state templates for long-running work |
| Lifecycle | How does work move from request to artifact? | Git commits, roadmap, templates, validation flow | Add task state files and handoff rules for multi-session work |
| Observability | What can we inspect after a run? | Git history, command outputs, validation logs, memory notes | Add JSONL trace records and run summaries |
| Verification | How do we know a run worked? | validate_vault.py, eval rubric | Add readiness check and trajectory regression suite |
| Governance | What constrains action and publication? | Public/private boundary, open-source checklist, governance.yml | Extend governance.yml with policy-sensitive approval gates |
Design principles¶
- Harness changes are system changes. A prompt, tool schema, context policy, validator, permission rule, or memory path can change agent behavior.
- Context is a scarce resource. Retrieval quality is not enough; placement, compaction, and state preservation matter.
- Observability and governance are first-class layers, not lifecycle footnotes.
- Evaluation should score the trajectory, not only the final answer.
- Durable state beats chat history for long-running work.
Practical rule¶
Every meaningful Memex change should answer seven questions:
- Execution: what environment did it assume?
- Tools: which capabilities did it use?
- Context: what evidence did it load and why?
- Lifecycle: what state or artifact continues the work?
- Observability: what trace or commit explains what happened?
- Verification: what check passed?
- Governance: what boundary or approval applied?