Mneme Ingest Pipeline (v1)

Issue #2 starts simple on purpose.

Mneme does not need ten data sources before it becomes useful. It needs one reliable ingest path that turns OpenClaw memory files into raw evidence.

Script

scripts/mneme_ingest_memory.py

Scope for v1

Input:

MEMORY.md
memory/*.md

Output:

raw/sources.jsonl
raw/items.jsonl
raw/report.json

What it does

creates one EvidenceSource per source file
creates EvidenceItem records from:
- headings
- bullets
- section blocks
preserves provenance
redacts obvious secrets in item text
writes deterministic JSONL output

What it does not do yet

session transcript ingest
repo file ingest
PDFs / web pages / images
entity linking
contradiction resolution
promotion into compiled memory

Usage

./scripts/mneme_ingest_memory.py \
  --root ~/.openclaw/workspace \
  --out ./.mneme-raw

# Opt in only if you truly need host-absolute paths in source URIs
./scripts/mneme_ingest_memory.py \
  --root ~/.openclaw/workspace \
  --out ./.mneme-raw \
  --absolute-file-uris

Why JSONL

JSONL is enough for v1 because it is:

append-friendly
diffable
inspectable
easy to rebuild and pipe into later stages

Relationship to the evidence model

This ingest pipeline is valid because it emits the fields defined in:

docs/evidence-model.md

That is the contract.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mneme Ingest Pipeline (v1)

Script

Scope for v1

What it does

What it does not do yet

Usage

Why JSONL

Relationship to the evidence model

FilesExpand file tree

ingest.md

Latest commit

History

ingest.md

File metadata and controls

Mneme Ingest Pipeline (v1)

Script

Scope for v1

What it does

What it does not do yet

Usage

Why JSONL

Relationship to the evidence model