Flash Notes RAG

Natural-language Q&A over a collection of company flash notes (PDF, XLSX). This project implements a RAG (Retrieval-Augmented Generation) pipeline using ChromaDB, Mistral AI, and FastAPI.

Architecture

The project is structured as follows:

flash-notes-rag/
├── data/               # Raw and processed document data
├── db/                 # ChromaDB vector database storage
├── src/                # Project source code
│   ├── load.py         # Handles fetching records from Zenodo
│   ├── extract.py      # Performs OCR and text extraction
│   ├── parse.py        # Parses documents into a unified format
│   ├── populate.py     # Embeds and indexes documents in ChromaDB
│   ├── query.py        # Main retrieval and generation logic
│   ├── chromadb.py     # ChromaDB client and collection utilities
│   ├── mistral.py      # Mistral AI model integration
│   └── utils.py        # Shared utility functions
├── static/             # Frontend assets (HTML, CSS, JS)
├── main.py             # FastAPI entry point
├── makefile            # Build and release orchestration
├── dockerfile          # Container configuration
└── pyproject.toml      # Project dependencies and metadata

Getting Started

This repository uses uv for dependency management and project orchestration.

Installation

# Sync dependencies and create a virtual environment
uv sync

Running Locally

To start the FastAPI server:

uv run fastapi dev main.py

The application will be accessible at http://localhost:8000.

Scripts and Pipelines

The system is designed as a pipeline that can be controlled via the API or individual scripts in src/.

API Endpoints

POST /query: Performs a RAG search.
- Payload: {"question": "What is the ESG score of Company X?", "top_k": 5}
POST /update: Orchestrates the entire ingestion pipeline.
- Options: Skip fetch, skip download, force extract, force parse, reset/override collection.

Direct Script Usage

You can also run individual parts of the pipeline:

uv run src/load.py      # Fetch new data
uv run src/extract.py   # Run OCR/extraction
uv run src/parse.py     # Clean and format data
uv run src/populate.py  # Index into ChromaDB

Docker Workflow

Building and pushing Docker images is managed via the makefile.

Build Image

make build

This builds the image tagged with the current version from pyproject.toml and latest.

Push to Registry

make push

Pushes the images to ghcr.io/dataesr/flash-rag.

Combined Build & Push

make build-push

Release Process

The project follows a semantic versioning pattern. Releases are automated via the makefile.

Tag a new version:
```
make release VERSION=X.Y.Z
```
This updates pyproject.toml, commits the change, and creates a git tag.
Push to main:
```
git push origin main --tags
```

Wait for the CI/CD pipeline to pick up the new tag and deploy the image.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
src		src
static		static
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
dockerfile		dockerfile
explore.ipynb		explore.ipynb
main.py		main.py
makefile		makefile
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Flash Notes RAG

Architecture

Getting Started

Installation

Running Locally

Scripts and Pipelines

API Endpoints

Direct Script Usage

Docker Workflow

Build Image

Push to Registry

Combined Build & Push

Release Process

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Flash Notes RAG

Architecture

Getting Started

Installation

Running Locally

Scripts and Pipelines

API Endpoints

Direct Script Usage

Docker Workflow

Build Image

Push to Registry

Combined Build & Push

Release Process

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages