Tech Stack

Status: Adopted for MVP1. Revisited per release as new layers come online. Source of truth for product context: docs/00_overview/relyloop-spec.md §28 ("Tech stack & implementation decisions"). This document is the engineering-facing distillation of those decisions, scoped to what's relevant for MVP1 with explicit notes on what activates in later releases.

Canonical release matrix

This is the source-of-truth release matrix that every other arch doc derives from. If a row in this table conflicts with another doc, this table wins. Sourced from umbrella spec lines 17–25 and §27 (per-release detail).

Release	Theme	Adds on top of previous
MVP1 / v0.1 (shipped)	"The Loop"	ES + OpenSearch adapter (single `ElasticAdapter`); LLM via single env var (`OPENAI_BASE_URL`) — any OpenAI-compatible endpoint works (OpenAI cloud default; Ollama, LM Studio, vLLM, HuggingFace TGI for air-gapped local; Azure OpenAI's compatible mode; OpenRouter for multi-model routing; LiteLLM proxy in front of Bedrock / Vertex / Anthropic native — see `docs/08_guides/llm-endpoint-setup.md`); GitHub Git provider; single-tenant (no `tenants` table, no `tenant_id`); no auth; basic structured logging; Docker Compose; Apache 2.0 LICENSE; 80% backend coverage gate; Optuna/TPE optimization loop over the full query-time search space; Git-PR apply path; conversational agent that runs the loop. No native non-OpenAI-compatible provider SDKs (backlog — ergonomics upgrade only since LiteLLM proxy + OpenRouter cover the unblocking path today), no observability stack, no audit_log, no lineage, no SSO, no API keys. (Solr was not in MVP1 — it shipped in MVP2; see the row below.)
MVP2 / v0.2 (in progress; Solr + UBI shipped)	"Three-Engine + Real Signals"	Apache Solr adapter (`auth_kind = solr_basic` and `solr_apikey`) covering Solr 9.x + 10.x via `edismax` + `{!ltr}` rescoring — shipped 2026-05-31 (`infra_adapter_solr`); UBI judgments via engine-agnostic `UbiReader` (reads `ubi_queries` + `ubi_events` via any `SearchAdapter`'s `search_batch`) — shipped 2026-05-29 (`feat_ubi_judgments`); pluggable `SignalsConverter` Protocol (position-bias-corrected CTR, dwell-time threshold, hybrid UBI+LLM); `POST /api/v1/judgment-lists/generate-from-ubi` + `generate_judgments_from_ubi` agent tool; mixed-source judgment lists (`llm` + `human` + `click` rows in the same list — the existing `judgments.source` enum already permits this). RelyLoop now runs on all three OSS engines with UBI judgment generation on every one of them. (The live capture component `solr.UBIComponent` is not in the stock Solr images, so on Solr the demo synthesizes UBI events and the probe reports `ubi_component_present=false`; the read path RelyLoop consumes is identical regardless.) No schema migration for UBI (additive — uses existing `source = 'click'` enum value); one small migration (`0022`) extended `engine_type` + `auth_kind` CHECK constraints to accept Solr values.
MVP3 / v0.3	"Observable"	Langfuse + ClickHouse + SigNoz + OpenTelemetry exporters wired; canonical event catalog; `audit_log` table + Postgres immutability trigger (no users/tenants yet — `actor_id`/`tenant_id` nullable, no FKs); lineage columns (`langfuse_trace_id`, `prompt_version`, `input_hash`) on `judgments`/`digests`/`proposals`; PII redaction; trace context propagation through API → Redis → worker → adapter → engine for all three engines.
GA v1 / v1.0	"Production-ready"	LangGraph orchestrator (replaces plain `openai` SDK + function calling); `PostgresSaver` for resumable conversations; full RFC 7807 Problem Details on errors; `Idempotency-Key` header on POST/PATCH/DELETE; full four-layer test pyramid at 90% coverage; complete CI/CD with security gates (Trivy, bandit, pip-audit, npm audit); image signing (cosign keyless OIDC); Helm 3 chart; complete OSS launch infrastructure (docs, ADRs, contributor onboarding, design-partner references); public Optuna-vs-SRW-grid benchmark. No new product surface — all six differentiators are GA by MVP3; GA v1 is polish + governance + hardening.
Backlog	—	Multi-Git provider abstraction (GitLab + Bitbucket); multi-tenancy primitives (`tenants` + `tenant_memberships` + `users` + `api_keys` tables; `tenant_id` columns; roles `viewer`/`runner`/`tenant_admin`/`platform_admin`); SSO via reverse proxy; Argon2id-hashed bearer API keys; native non-OpenAI provider SDKs (Anthropic, Bedrock, Vertex, Azure OpenAI); LTR training; Path B (production-quality monitoring, bandits, shadow validation, manual one-click rollback); Helm chart maturity.

Audit-without-users design: MVP3 ships audit_log with actor_id / tenant_id as nullable UUIDs with no FK constraints, plus an actor_type ENUM constrained to system / agent / anonymous. The FK constraints and the user actor type ship when multi-tenancy is promoted from backlog. Pre-multi-tenancy audit rows keep actor_id = NULL. See data-model.md §"audit_log" for the schema.

Backend

Layer	Choice	Notes
Language	Python 3.13+ (async)	Type hints required; `mypy --strict` is enforced. Bumped from 3.12 → 3.13 on 2026-05-12 (current stable; 3.12 still works since `requires-python = ">=3.13"` is the new floor — 3.12 callers must upgrade).
Web framework	FastAPI	Auto-generates OpenAPI from Pydantic models.
ORM	SQLAlchemy 2.0 (async)	All queries through the ORM; no raw SQL except in migrations.
DB driver	asyncpg	Required by SQLAlchemy async.
Migrations	Alembic	`alembic revision --autogenerate` is the standard workflow.
Validation	Pydantic v2	Used for: API request/response, tool args, eval datasets, settings.
Settings	Pydantic Settings	Loads from env + mounted secrets files.
HTTP client	httpx (async)	One client instance per upstream service (cluster, OpenAI, GitHub).
Logging	structlog	Structured JSON to stdout.
Queue / workers	Arq + Redis 7	Async-native; workers are separate processes.
Optimization	Optuna with TPE sampler + RDBStorage	RDBStorage points at the same Postgres as the app.
IR evaluation	ir_measures	Provider-abstracted; wraps multiple IR-evaluation backends behind a typed metric-object DSL; consistent metrics across engines.
LLM SDK (MVP1)	`openai` Python SDK with function calling	LangGraph deferred to GA v1. No provider-abstraction layer in MVP1 — direct OpenAI calls.
Auth — humans (backlog)	SSO via reverse proxy (oauth2-proxy or Authelia); proxy injects `X-Auth-Email` header; API trusts the header only when verified by mTLS or a shared secret	Not present through GA v1. No password storage in RelyLoop itself — identity provider owns credentials.
Auth — service accounts (backlog)	Bearer API keys (`Authorization: Bearer <key>`); keys hashed with Argon2id (passlib) at rest	Not present through GA v1. Per-key role + scopes + expiration; revocation via `revoked_at`.
Testing	pytest + pytest-asyncio + pytest-mock + pytest-recording	`pytest-recording` cassettes are checked in for every external HTTP integration.
Coverage	coverage.py	CI gate: 80% backend Python (MVP1) → 90% (GA v1).
Linter / formatter	ruff (`check` + `format`)	Replaces flake8 + isort + black.
Type checker	mypy `--strict`	No `Any` without explicit annotation.
Dependency mgmt	uv	Lockfile-based; replaces pip + pip-tools + virtualenv.
Pre-commit	pre-commit framework	Hooks: ruff, mypy, eslint, prettier.

Frontend

Layer	Choice	Notes
Language	TypeScript (`--strict` + `noUncheckedIndexedAccess`)
Framework	Next.js 16 (App Router, Turbopack)	Bumped from 14 on 2026-05-12 (`infra_frontend_stack_refresh`); React 19 as a peer.
UI components	shadcn/ui	Components copied into the repo, not an npm dependency — fully customizable.
Styling	Tailwind CSS 4 (CSS-first config via `@import "tailwindcss"`)	Bumped from 3 on 2026-05-12; legacy `tailwind.config.ts` deleted, source paths auto-detected.
Server state	TanStack Query	Caching, retries, optimistic updates, mutations.
Forms	React Hook Form + Zod	Zod schemas can be reused for API request validation.
Charts	Recharts	Sufficient for parameter-importance bars, scatter plots, trial-progress lines.
Streaming	`fetch()` with `ReadableStream` (SSE-framed body over POST)	Native `EventSource` is GET-only; the chat surface POSTs the user message in the body so we use `fetch()` streaming. See `ui-architecture.md` §"Streaming chat".
Testing	Vitest 4 + msw	msw mocks HTTP at the network layer. Vitest bumped from 2 on 2026-05-12.
Linter	ESLint 9 (flat config, `eslint.config.mjs`) + Next + security plugins	ESLint 10 was attempted but hit an `eslint-plugin-react` 7.37 vs ESLint-10 API incompat; backed off to 9 (matches `eslint-config-next` 16's tested baseline).
Formatter	prettier
Type checker	`tsc --noEmit --strict`	Runs in CI.
Dependency mgmt	pnpm	Lockfile-based.

Infrastructure

Layer	Choice	MVP1 status
Database (app)	Postgres 16	Single instance. Holds app state + Optuna RDBStorage.
Cache / queue	Redis 7	Used by Arq for the worker queue.
Search engines (targets)	Elasticsearch 8.11+ / 9.x; OpenSearch 2.x / 3.x; Apache Solr 9.x / 10.x	These three OSS engines are the only supported targets — all three shipped (ES + OpenSearch in MVP1, Solr in MVP2).
Reverse proxy	Caddy 2	NOT in MVP1. Production-style install (TLS via Caddy + Let's Encrypt) lands as GA v1 hardening for trusted-network deployments. SSO (oauth2-proxy or Authelia in front of Caddy) is in the backlog with multi-tenancy.
Trace storage (LLM)	ClickHouse 24	NOT in MVP1 (Langfuse is MVP2+).
Container runtime	Docker 24+ with Compose v2	MVP1 deployment target.
Helm chart	Helm 3	NOT in MVP1 (v1.5+).
Secrets at runtime	Mounted secret files	Never in env vars — see §"Secrets" below.
Backup target	Encrypted S3-compatible	NOT in MVP1 (operator's responsibility for laptop installs).

CI/CD

Layer	Choice	MVP1 status
CI/CD platform	GitHub Actions	One workflow in MVP1 (`.github/workflows/pr.yml`); five workflows by GA v1.
Container scanning	Trivy	NOT in MVP1 (GA v1).
Python SAST	bandit	NOT in MVP1 (GA v1).
Python deps audit	pip-audit	NOT in MVP1 (GA v1).
TS deps audit	npm audit	NOT in MVP1 (GA v1).
Image signing	cosign (keyless OIDC via GitHub)	NOT in MVP1 (target: chore_tutorial_polish if cheap, otherwise MVP3).
Branching	Trunk-based	Short-lived feature branches off `main`.
Commit format	Conventional Commits	Auto-generated changelogs in GA v1; MVP1 enforces format via pre-commit.
Versioning	SemVer 2.0	MVP1 = `0.1.0`; the leading zero signals pre-1.0 instability.

Conventions

Code organization

Single monorepo: relyloop/relyloop on GitHub.
Top-level structure: backend/, ui/, worker/, migrations/, prompts/, templates/, samples/, scripts/, docs/, tests/.
One test file per source file; mirror the source tree under tests/.
Adapters live under backend/app/adapters/ (engine), backend/llm/ (LLM provider), backend/git/ (Git provider).

Python coding standards

100-character line limit (ruff default).
Ruff rules: defaults + B (bugbear), S (security/bandit), UP (pyupgrade), D (docstrings on public APIs).
mypy --strict; no Any without explicit annotation.
Public functions, classes, modules have Google-style docstrings.
All Pydantic models have field descriptions (used in OpenAPI auto-generation).
snake_case for variables, functions, modules; PascalCase for classes; SCREAMING_SNAKE for constants.

TypeScript coding standards

100-character line limit (prettier default).
ESLint Next.js + security + react-hooks plugins.
tsc --strict and noUncheckedIndexedAccess.
camelCase for variables and functions; PascalCase for components and types.

Database conventions

UUIDv7 primary keys on every table (lexicographically sortable, time-ordered, generated client-side).
All timestamps TIMESTAMPTZ, stored UTC.
Soft delete via deleted_at on user-facing tables; hard delete on internal append-only tables (e.g., trials).
snake_case table and column names.
JSONB for flexible structured fields (settings, params, metrics, payloads).
All foreign keys explicit; no implicit relationships.
Indexes on (tenant_id, created_at) for tenant-scoped tables — backlog only; RelyLoop is single-tenant through GA v1 with no tenant_id column.

Logging conventions

Structured JSON via structlog.
Required fields: ts, lvl, msg, service, trace_id, span_id.
msg field draws from a canonical event catalog in backend/app/events.py — MVP2+.
PII redaction processor runs before emission — MVP2+.

Secrets management

Mounted secret files only — never set in environment variables.
Source of truth: 1Password / Vault / SSM / equivalent (operator's choice).
API keys hashed with Argon2id at rest — backlog (no auth through GA v1).
For MVP1: .env.example enumerates every secret; .env is gitignored; Docker secrets mount each value as a file inside the container.

Reserved for later releases

These appear in the umbrella spec because the spec covers all releases. None of them are MVP1 work. Per-release timing per the §"Canonical release matrix" above:

MVP2 (Three-Engine + Real Signals) — shipped: Apache Solr adapter (Solr 9.x + 10.x; edismax + {!ltr} rescore) and UBI judgments via engine-agnostic UbiReader; pluggable SignalsConverter Protocol (CTR threshold, dwell-time, hybrid UBI+LLM); POST /api/v1/judgment-lists/generate-from-ubi + generate_judgments_from_ubi agent tool. (Remaining MVP2 work is Idea-stage ergonomics, tracked in docs/00_overview/planned_features/02_mvp2/.)
MVP3 (Observable): Langfuse + ClickHouse + SigNoz + OpenTelemetry exporters; canonical event catalog; audit_log table + immutability trigger (no users/tenants yet); lineage columns; PII redaction; trace context propagation through DB/Redis/worker/adapter/engine.
GA v1 (Production-ready): LangGraph orchestrator + PostgresSaver; full RFC 7807 Problem Details on errors; Idempotency-Key header; full four-layer test pyramid at 90% coverage; complete CI/CD with security gates (Trivy, bandit, pip-audit, npm audit); image signing (cosign); production-style install (TLS via Caddy + Let's Encrypt, managed Postgres/Redis); design-partner references; public Optuna-vs-SRW-grid benchmark. No new product surface — all six differentiators are GA by MVP3.
Backlog: Multi-Git provider abstraction (GitLab, Bitbucket); multi-tenancy (tenants, tenant_memberships, users, api_keys tables; tenant_id columns; roles viewer/runner/tenant_admin/platform_admin); SSO via reverse proxy for humans; Argon2id-hashed bearer API keys for service accounts; native non-OpenAI provider SDKs (Anthropic, AWS Bedrock, Google Vertex AI, Azure OpenAI); LangChain RedisCache for LLM responses; Helm chart maturity; Kubernetes-native operator; LTR training; Path B (production monitoring, bandits, shadow validation).
Out of scope (no scheduled release): Mobile UI, i18n, WCAG AA gating, Kubernetes-native operator, multi-region.

Cross-references

Per-service topology and message flow: system-overview.md
Postgres schema and conventions: data-model.md
HTTP API conventions (endpoint prefixes, error envelope, pagination): api-conventions.md
Engine adapter Protocol: adapters.md
Docker Compose layout for local dev: deployment.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tech Stack

Canonical release matrix

Backend

Frontend

Infrastructure

CI/CD

Conventions

Code organization

Python coding standards

TypeScript coding standards

Database conventions

Logging conventions

Secrets management

Reserved for later releases

Cross-references

FilesExpand file tree

tech-stack.md

Latest commit

History

tech-stack.md

File metadata and controls

Tech Stack

Canonical release matrix

Backend

Frontend

Infrastructure

CI/CD

Conventions

Code organization

Python coding standards

TypeScript coding standards

Database conventions

Logging conventions

Secrets management

Reserved for later releases

Cross-references