Overview

optim-rag-demo.mp4

Overview

optim-rag is a tool designed to simplify managing data in Retrieval-Augmented Generation (RAG) systems. It helps you easily add, edit, and manage document chunks used for knowledge retrieval, making it especially useful when working with frequently changing data.

Key Features

Core Capability	Technical Implementation
Efficient Updates	Edits, additions, and deletions are saved and processed once the user confirms the changes, ensuring only updated chunks are re-indexed and stored through an easy-to-use frontend.
Advanced Document Ingestion	Handles `PDF`, `DOCX`, `MD`, and `TXT` formats. Converts all to a standard format and uses the Mistral OCR engine for text extraction from documents.
Multi-Vector Indexing	Every chunk is indexed using three distinct vector types: Dense (`cli-MiniLM-L6-v2`), Sparse (`BM25`), and Late-Interaction (`ColBERTv2.0`). This ensures a powerful hybrid search that combines semantic context with keyword accuracy.

What Makes `optim-rag` Different

optim-rag focuses on efficient management of embedded vector data. Instead of reprocessing all files, it only updates what has been changed. When users edit or add new contents, the system detects those changes and updates them selectively.

This design is ideal for production setups where data changes frequently and precision in updating is key.

Pipeline

1. Resource Upload and Session Setup

This stage sets up the workspace and prepares the files for processing.

Users must upload a .zip file containing all the required documents such as PDF, DOCX, TXT, or MD.
The zip should contain all files directly inside a single folder, without any subfolders or nested structure.
Once uploaded, the system extracts the files and prepares them for chunking.
Users can either start a new session or load an existing one to continue working with previously uploaded data.

2. Chunk Editing and File Management

This stage allows users to organize, modify, and manage their content through the Chunk Editor interface.

Inside the editor, users can upload additional files (TXT, DOCX, MD, or PDF).
They can add new chunks to an existing file or create chunks under a new file category.
Files or individual chunks can be deleted if no longer needed.
Each chunk can be opened for editing, where users can preview its content or view and modify its markdown code.
Every change updates the chunk’s status, newly added chunks are marked as new, edited chunks as modified, and deleted chunks as deleted with a default of unchanged status to the chunks which are untouched.
To finalize and execute the modifications, users must commit the changes, which applies all pending updates to the datastore.

3. Query and Retrieval

This stage lets users interact with the knowledge base through a chat interface.

After committing changes, users can click on the Chat button to test the vectorstore.
The system performs retrieval over the stored chunks and selects the most relevant ones.
These retrieved chunks are compiled into a structured context and passed to the language model.
The LLM then generates an accurate and context-aware response based on the uploaded and edited data.

Getting Started

optim-rag is a modular Retrieval-Augmented Generation (RAG) framework designed for flexibility and extensibility.
You can set it up in multiple ways depending on your purpose — Docker for a quick launch or Vanilla setup for faster development and debugging.

Prerequisites

Before starting, ensure you have the following installed:

docker and docker-compose – for containerized setup
node.js (≥22) – for frontend
python (≥3.13) – for backend
uv – for Python dependency management

Step 1: Clone the Repository & Setup Environment

Clone the project and prepare your environment variables:

git clone https://github.com/Oqura-ai/optim-rag
cd optim-rag
cp .env.example .env # Fill in the required API/auth keys for your services
cp .env ./backend/.env

Step 2: Docker Setup (Recommended for Quick Start)

Docker setup is the easiest and fastest way to run optim-rag — it handles dependencies and services automatically.

# if you only want to run the main application
docker-compose -f docker-compose.yaml up frontend backend qdrant --build

# if you only want to run the mcp server (we will get into this later)
docker-compose -f docker-compose.yaml up mcp qdrant --build

# if you also want to run the mcp server alongside the main application
docker-compose -f docker-compose.yaml up --build

The first build may take time as it downloads all dependencies.
Once complete, optim-rag will be available at http://localhost:3000

Step 3: Vanilla Setup (For Local Development)

Use the vanilla setup if you want fine-grained control over backend/frontend development or debugging.

1. Start Qdrant (Vector Database)

docker-compose up -d qdrant

Launches Qdrant, the vector store backend used by optim-rag.
Data persists in the local qdrant_data directory.

2. Setup Backend

cd backend
uv venv

./.venv/Scripts/activate  # (Windows)
# or
source .venv/bin/activate  # (Linux/Mac)

uv pip install -e .

3. Setup Frontend

cd frontend
npm install

4. Run the Application

# Run the frontend (in frontend dir)
npm run dev

# Run the backend (in backend dir)
uvicorn main:app --host 0.0.0.0 --port 8000 --reload

Open your browser at: http://localhost:3000

MCP Server Setup (Prototype Stage)

The MCP (Model Context Protocol) server enables AI-assisted coding clients (like Cursor, GitHub Copilot, or VSCode) to directly access optim-rag’s vectorstore endpoints as tools.

Note: MCP integration is currently in prototype stage — features may change in future versions.

For Cursor Editor

Create an MCP configuration file inside your project root:

mkdir .cursor
touch mcp.json
# or (on Windows)
New-Item mcp.json

Add this configuration:

{
  "mcpServers": {
    "optim-rag": {
      "command": "python",
      "args": ["backend/mcp_server.py"],
      "cwd": "${workspaceFolder}/backend",
      "envFile": "${workspaceFolder}/.env",
      "description": "optim-rag vectorstore MCP server"
    }
  }
}

For GitHub Copilot / VSCode MCP Integration

Create the config file:

mkdir .vscode
touch mcp.json
# or (on Windows)
New-Item mcp.json

Add this configuration:

{
  "servers": {
    "optim-rag": {
      "command": "${workspaceFolder}/backend/.venv/Scripts/python.exe",
      "args": ["mcp_server.py"],
      "cwd": "${workspaceFolder}/backend",
      "envFile": "${workspaceFolder}/.env"
    }
  }
}

Run the MCP Server

Once your MCP client (Cursor or VSCode) is configured:

For vanilla run

cd backend
python mcp_server.py

Using docker

docker-compose -f docker-compose.yaml up mcp qdrant --build

The MCP server will start and expose optim-rag’s tools to your connected client.

Note:

Make sure if you have the environment activated if running the server in vanilla setup

Since MCP integration is still experimental, expect rapid iteration and breaking changes.

Authors

Contributing

If something here could be improved, please open an issue or submit a pull request.

License

This project is licensed under the MIT License. See the LICENSE file for more details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Key Features

What Makes `optim-rag` Different

Pipeline

1. Resource Upload and Session Setup

2. Chunk Editing and File Management

3. Query and Retrieval

Getting Started

Prerequisites

Step 1: Clone the Repository & Setup Environment

Step 2: Docker Setup (Recommended for Quick Start)

Step 3: Vanilla Setup (For Local Development)

1. Start Qdrant (Vector Database)

2. Setup Backend

3. Setup Frontend

4. Run the Application

MCP Server Setup (Prototype Stage)

For Cursor Editor

For GitHub Copilot / VSCode MCP Integration

Run the MCP Server

Authors

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.cursor		.cursor
.vscode		.vscode
assets		assets
backend		backend
frontend		frontend
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
docker-compose.yaml		docker-compose.yaml

Folders and files

Latest commit

History

Repository files navigation

Overview

Key Features

What Makes optim-rag Different

Pipeline

1. Resource Upload and Session Setup

2. Chunk Editing and File Management

3. Query and Retrieval

Getting Started

Prerequisites

Step 1: Clone the Repository & Setup Environment

Step 2: Docker Setup (Recommended for Quick Start)

Step 3: Vanilla Setup (For Local Development)

1. Start Qdrant (Vector Database)

2. Setup Backend

3. Setup Frontend

4. Run the Application

MCP Server Setup (Prototype Stage)

For Cursor Editor

For GitHub Copilot / VSCode MCP Integration

Run the MCP Server

Authors

Contributing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

What Makes `optim-rag` Different

Packages