DSPy-Multi-Document-Agents

A multi-agent document processing system built with DSPy. Partitions documents into per-document agents that independently evaluate and answer queries, coordinated by a master agent with query planning and reranking.

Architecture

User query
  -> QueryPlanner (selects relevant document agents)
    -> DocumentAgents (evaluate relevance, extract answers in parallel)
      -> RerankModule (reorders results by score)
        -> MasterAgent (returns top answer with citations)

Components

Component	Description
`MasterAgent`	Orchestrates query flow across document agents
`QueryPlanner`	DSPy ChainOfThought module that selects which agents to involve
`DocumentAgent`	Per-document agent that evaluates queries, extracts answers, and generates responses via Claude
`RerankModule`	Reranks initial retrieval scores using Qdrant context
`RerankingOptimizer`	Uses DSPy's BootstrapFewShotWithRandomSearch to optimize reranking

Dependencies

Dependency	Purpose
DSPy	LLM orchestration and prompt optimization
Qdrant	Vector database for document embeddings
LlamaIndex	Document loading and vector store indexing
sentence-transformers (all-MiniLM-L6-v2)	Query encoding
Anthropic Claude (claude-3-haiku)	LLM for query planning, evaluation, answer generation
unstructured	Document parsing

Requirements

Python >= 3.8
Running Qdrant instance (default: localhost:6333)
ANTHROPIC_API_KEY environment variable

Setup

git clone https://github.com/jmanhype/DSPy-Multi-Document-Agents.git
cd DSPy-Multi-Document-Agents
pip install -r requirements.txt
# Start Qdrant (e.g., via Docker)
docker run -p 6333:6333 qdrant/qdrant
export ANTHROPIC_API_KEY="your-key"
python main.py

By default, main.py loads documents from docs/latest.md (configurable via DOCUMENT_PATH env var).

How queries are processed

Documents are loaded, partitioned, embedded, and stored in Qdrant
A DocumentAgent is created for each document partition
QueryPlanner uses ChainOfThought to select relevant agents
Selected agents evaluate the query against their content
RerankModule adjusts scores using retrieved context
MasterAgent picks the top-scoring agent and asks it to generate a full answer with citations

Limitations

The reranking logic is simplistic (adds context length to initial score)
Relevance scoring uses keyword heuristics ("not relevant" / "partially relevant" string matching), not a trained classifier
Query decomposition splits on "and"/"or" keywords only
Similarity search uses Jaccard similarity, not semantic embeddings
No web UI; runs as a CLI script
The Nextra documentation site (pages/) describes the architecture but the docs app requires separate npm install && npm run dev

Documentation site

The pages/ directory contains a Nextra (Next.js) documentation site:

npm install
npm run dev
# Open http://localhost:3000

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.github/workflows		.github/workflows
pages		pages
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
README.md		README.md
favicon.ico		favicon.ico
main.py		main.py
mda_cognee_dspy.py		mda_cognee_dspy.py
next.config.js		next.config.js
package-lock.json		package-lock.json
package.json		package.json
requirements.txt		requirements.txt
start.js		start.js
theme.config.js		theme.config.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DSPy-Multi-Document-Agents

Architecture

Components

Dependencies

Requirements

Setup

How queries are processed

Limitations

Documentation site

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DSPy-Multi-Document-Agents

Architecture

Components

Dependencies

Requirements

Setup

How queries are processed

Limitations

Documentation site

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages