Harness Architecture¶

Memex is a harnessed memory operating system. The markdown vault is the source of truth, but reliability depends on the harness around the agent: where it runs, which tools it can use, what context it sees, how state survives, what gets traced, how results are verified, and which policies constrain action.

This is harness engineering applied to a knowledge vault — the practice of investing in the scaffolding around the model (context, tools, the agentic loop, guardrails, verification, observability, state/lifecycle), because that layer, not the prompt, sets production reliability. The version itself is 0.2.0-harness-preview. The ETCLOVG map below is our enumeration of those layers.

Aligned reading: OpenAI, Harness Engineering — https://openai.com/index/harness-engineering/ · Martin Fowler, Harness engineering for coding agent users — https://martinfowler.com/articles/harness-engineering.html

ETCLOVG map¶

Layer	Harness question	Memex implementation	Next hardening step
Execution	Where does the agent run and what bounds it?	Local workspace, Git, validation scripts, public/private vault boundary	Add sandbox profile documentation for local, CI, and cloud runs
Tool interface	Which capabilities are exposed and how are they described?	Agent Skills, validation scripts, GitHub remote	Add tool inventory with risk levels and owner notes
Context	What does the model see, retrieve, compact, or forget?	README, AGENTS-style rules, docs, vault markdown, retrieval index	Enforce context policy and state templates for long-running work
Lifecycle	How does work move from request to artifact?	Git commits, roadmap, templates, validation flow	Add task state files and handoff rules for multi-session work
Observability	What can we inspect after a run?	Git history, command outputs, validation logs, memory notes	Add JSONL trace records and run summaries
Verification	How do we know a run worked?	validate_vault.py, eval rubric	Add readiness check and trajectory regression suite
Governance	What constrains action and publication?	Public/private boundary, open-source checklist, governance.yml	Extend governance.yml with policy-sensitive approval gates

Design principles¶

Harness changes are system changes. A prompt, tool schema, context policy, validator, permission rule, or memory path can change agent behavior.
Context is a scarce resource. Retrieval quality is not enough; placement, compaction, and state preservation matter.
Observability and governance are first-class layers, not lifecycle footnotes.
Evaluation should score the trajectory, not only the final answer.
Durable state beats chat history for long-running work.

Practical rule¶

Every meaningful Memex change should answer seven questions:

Execution: what environment did it assume?
Tools: which capabilities did it use?
Context: what evidence did it load and why?
Lifecycle: what state or artifact continues the work?
Observability: what trace or commit explains what happened?
Verification: what check passed?
Governance: what boundary or approval applied?