Embedding the SQLRite engine

SQLRite ships as a library that other programs can embed — the REPL and desktop app are just thin UIs over the same core. Phase 5 built out the embedding surface across every reasonable runtime; Phase 7g/7h layered the natural-language ask() family + an MCP server on top.

✅ Phase 5a — stable public Rust API (Connection / Statement / Rows / Row / Value) plus structured row return. Parameter binding + a streaming cursor abstraction are deferred to 5a.2.
✅ Phase 5b — C FFI shim (libsqlrite_c.{so,dylib,dll} + cbindgen-generated sqlrite.h) that every non-Rust SDK binds against.
✅ Phase 5c – 5e — Python (PyO3 → PyPI), Node.js (napi-rs → npm), Go (cgo against the FFI shim → git tag) SDKs published to their respective registries.
⏳ Phase 5f — Rust crate polish (deferred — Phase 6c shipped the actual crates.io publish; 5f's polish work folded into ongoing maintenance).
✅ Phase 5g — WASM build (@joaoh82/sqlrite-wasm on npm) so the engine runs entirely in a browser tab.
✅ Phase 7g — ask() natural-language → SQL family across every embedding surface — see ask.md.
✅ Phase 7h — sqlrite-mcp, a Model Context Protocol stdio server that wraps a database for LLM agents (Claude Code, Cursor, mcp-inspector, …) without any custom integration code on the LLM side. Sibling product to the SDKs, not a replacement — use the SDKs when your code drives the database, use the MCP server when an LLM agent drives it.

See roadmap.md for the detailed Phase 5 breakdown.

The Rust public API (Phase 5a)

use sqlrite::Connection;

// Open a file-backed connection (exclusive lock, auto-save on every
// write). File is materialized if it doesn't exist.
let mut conn = Connection::open("foo.sqlrite")?;

// …or open read-only (shared lock, multi-reader safe):
let ro = Connection::open_read_only("foo.sqlrite")?;

// …or spin up a transient in-memory DB (no file, no locks):
let mem = Connection::open_in_memory()?;

// `execute` — parses and runs one SQL statement. Returns the status
// message the engine produced ("INSERT Statement executed." etc.).
conn.execute("CREATE TABLE users (id INTEGER PRIMARY KEY, name TEXT);")?;
conn.execute("INSERT INTO users (name) VALUES ('alice');")?;

// `prepare` + `query` — typed row iteration.
let mut stmt = conn.prepare("SELECT id, name FROM users;")?;
let mut rows = stmt.query()?;
while let Some(row) = rows.next()? {
    let id: i64 = row.get(0)?;
    let name: String = row.get_by_name("name")?;
    println!("{id}: {name}");
}

Typed row access

Row::get::<T>(idx) and Row::get_by_name::<T>(name) both go through the FromValue trait. Built-in impls cover:

i64, f64, String, bool
Option<T> — NULL columns resolve to None
raw Value — fall-through for untyped access

FromValue is pub trait, so downstream crates can impl it for their own types (custom enums, chrono timestamps, etc.) without a PR to SQLRite.

Transactions through the public API

Transactions go through the same execute call as everything else — BEGIN, COMMIT, and ROLLBACK are just SQL statements. Connection::in_transaction() exposes the flag for callers that want to branch on it.

conn.execute("BEGIN;")?;
conn.execute("INSERT INTO users (name) VALUES ('bob');")?;
conn.execute("INSERT INTO users (name) VALUES ('carol');")?;
if looks_good {
    conn.execute("COMMIT;")?;   // one WAL commit frame for both inserts
} else {
    conn.execute("ROLLBACK;")?; // restores pre-BEGIN snapshot
}

See Phase 4f notes in roadmap.md for the snapshot semantics and the auto-rollback-on-failed-COMMIT guarantee.

Sharing one database across threads

Phase 11.1. Connection is a thin handle over the engine state. Call Connection::connect() to mint a sibling handle that shares the same in-memory tables and persistent pager — typically one handle per worker thread. Connection: Send + Sync, so the handles can be moved across threads without an outer Mutex.

use sqlrite::Connection;

let mut primary = Connection::open("foo.sqlrite")?;
primary.execute("CREATE TABLE log (id INTEGER PRIMARY KEY, who TEXT);")?;

let mut writers = Vec::new();
for tid in 0..4 {
    let mut conn = primary.connect();   // sibling handle, same backing DB
    writers.push(std::thread::spawn(move || {
        conn.execute(&format!("INSERT INTO log (who) VALUES ('t{tid}');"))
    }));
}
for h in writers { h.join().unwrap()?; }
# Ok::<(), sqlrite::SQLRiteError>(())

Today every commit still serializes through the per-database mutex (and the pager's existing process-level flock); the goal of 11.1 is capability, not throughput. True multi-writer throughput on disjoint rows arrives with BEGIN CONCURRENT in 11.4 — see below, plus the canonical docs/concurrent-writes.md reference for the full Phase 11 surface.

Per-handle state — the prepared-statement cache (LRU populated by prepare_cached), the cache capacity setter — stays on each handle, by design (no extra mutex traffic for a per-thread accelerator). The shared state is the Database (tables, pager, transaction snapshot, auto-VACUUM threshold).

Concurrent writes via `BEGIN CONCURRENT` (Phase 11.4)

Canonical reference: docs/concurrent-writes.md — the full Phase 11 user-facing reference (conceptual model, SQL, SDK error mapping, durability, limitations). The summary below is the embedding-API view of the same surface.

Runnable example: examples/rust/concurrent_writers.rs — interleaved BEGINs across two sibling handles, demonstrating both the disjoint-row happy path and the same-row retry. Run with cargo run --example concurrent_writers.

Multi-writer concurrency is opt-in: PRAGMA journal_mode = mvcc; once per database, then each writer wraps its work in BEGIN CONCURRENT; … COMMIT;. Sibling Connection::connect handles can each hold their own open BEGIN CONCURRENT; commits are validated against the MvStore version index and abort with SQLRiteError::Busy if another writer superseded one of our rows.

use sqlrite::{Connection, SQLRiteError};

fn transfer(primary: &mut Connection, src: i64, dst: i64, amount: i64)
    -> Result<(), SQLRiteError>
{
    let mut conn = primary.connect();
    loop {
        conn.execute("BEGIN CONCURRENT")?;
        conn.execute(&format!(
            "UPDATE accounts SET balance = balance - {amount} WHERE id = {src}"
        ))?;
        conn.execute(&format!(
            "UPDATE accounts SET balance = balance + {amount} WHERE id = {dst}"
        ))?;
        match conn.execute("COMMIT") {
            Ok(_) => return Ok(()),
            Err(e) if e.is_retryable() => continue, // Busy / BusySnapshot
            Err(e) => return Err(e),
        }
    }
}

The retryable-error branch is the headline new flow: pick a backoff policy that suits your workload (constant, exponential, jittered) and call the same closure again. SQLRiteError::is_retryable() covers both Busy and BusySnapshot so callers don't have to match each variant individually.

Memory bounding. Every successful commit triggers a per-row GC sweep over the write-set's chains, reclaiming versions no in-flight reader can possibly see anymore. For workloads where you want a deterministic full drain (memory-pressure testing, debug snapshots), call conn.vacuum_mvcc() — returns the count of versions reclaimed across the whole store. Both paths are correct against in-flight readers: a reader holding BEGIN CONCURRENT; SELECT … keeps every version its begin_ts snapshot needs.

What shipped after 11.4:

11.5 — reads inside the transaction see the BEGIN-time snapshot through Statement::query / Statement::query_with_params as well as Connection::execute("SELECT…").
11.6 — per-commit GC + Connection::vacuum_mvcc() bound version-chain growth.
11.7 + 11.8 — every SDK (C FFI / Python / Node / Go) propagates Busy / BusySnapshot as a typed retryable error; the FFI's sqlrite_connect_sibling, Python's Connection.connect(), and Node's db.connect() mint sibling handles that share backing state.
11.9 — every successful BEGIN CONCURRENT commit writes a typed MvccCommitBatch frame to the WAL (covered by the same fsync as the legacy page commit), and reopen replays those frames into MvStore so the conflict-detection window survives a process restart.
11.11a — the REPL ships .spawn / .use / .conns for interactive multi-handle demos; the prompt shows the active handle.

What's deferred (see docs/concurrent-writes.md for the full list): DDL inside BEGIN CONCURRENT, CREATE INDEX while journal_mode = mvcc, cross-process MVCC, the checkpoint-drain path that would re-enable set_journal_mode(Mvcc → Wal), and the "N concurrent writers" benchmark workload (carved out as Phase 11.11b).

What's deferred

Parameter binding. stmt.query(&[&"alice"]) is the intended shape but the current implementation takes no arguments — use string interpolation for now. Parameter binding lands with the cursor refactor.
Cursor abstraction. The Pager still eagerly loads every row at open time; Rows today wraps an in-memory Vec. Phase 5a's follow-up refactor streams rows through the B-Tree on demand — same public API, much lower memory for big SELECTs.

Natural-language → SQL — `sqlrite::ask` + `sqlrite-ask`

Phases 7g.1 + 7g.2. The companion crate sqlrite-ask provides the LLM-talking machinery (provider adapters, prompt construction, response parsing) over a deliberately small surface — &str schema in, generated SQL out. The engine wraps it under a new ask feature (default-on) so library users get the ergonomic Connection::ask form without composing the schema dump themselves.

[dependencies]
# `ask` is a default feature on sqlrite-engine; opt out with
# default-features = false if you don't want the LLM stack pulled in.
sqlrite-engine = "0.1"
sqlrite-ask    = "0.1"

use sqlrite::{Connection, ConnectionAskExt};
use sqlrite_ask::{AskConfig, AskResponse};

let conn = Connection::open("foo.sqlrite")?;
let cfg  = AskConfig::from_env()?;          // reads SQLRITE_LLM_API_KEY etc.
let resp: AskResponse = conn.ask("How many users are over 30?", &cfg)?;

println!("Generated SQL: {}", resp.sql);
println!("Rationale: {}",     resp.explanation);
println!("Tokens: in={}, out={}, cache_hit={}",
    resp.usage.input_tokens,
    resp.usage.output_tokens,
    resp.usage.cache_read_input_tokens);

// Caller decides whether to execute the generated SQL — the library
// does NOT auto-execute. SDK convenience wrappers (Python's
// `conn.ask_run()`, Node's `db.askRun()`, etc.) add a one-shot
// generate-and-execute helper, but the default Rust API is
// "generate, return, let me decide".
let mut conn = conn;  // need &mut for execute
let _ = conn.execute(&resp.sql)?;

Where what lives:

Crate	Provides
`sqlrite-engine` (with `ask` feature)	`sqlrite::ConnectionAskExt`, `sqlrite::ask::ask` / `ask_with_database` / `ask_with_provider` / `ask_with_database_and_provider`, `sqlrite::ask::schema::dump_schema_for_connection` / `_for_database`. Pure engine-side glue: dump schema → call into `sqlrite-ask`.
`sqlrite-ask`	`ask_with_schema` / `ask_with_schema_and_provider`, `AskConfig`, `AskResponse`, `AskError`, `Provider` trait + `AnthropicProvider`. Pure `&str` inputs, no engine dep — keeps the LLM stack independently testable + plugable.

Provider: Anthropic only in 7g.1; the Provider trait lets OpenAI / Ollama slot in without touching consumers. AnthropicProvider does sync ureq POSTs to /v1/messages. Defaults: claude-sonnet-4-6, max_tokens: 1024, 5-minute prompt-cache TTL on the schema dump (configurable via AskConfig::cache_ttl / SQLRITE_LLM_CACHE_TTL).

Why the split (Phase 7g.2 retro): the REPL binary needed to import the LLM crate to wire up .ask, but sqlrite-ask 0.1.18 imported sqlrite-engine for the Connection integration. That's a cargo cycle (engine[bin] → sqlrite-ask → engine[lib]) — even with optional = true, the static cycle detector rejects the graph. Flipping the dep direction broke it: sqlrite-ask is pure now, the engine carries the integration weight behind a feature flag. See docs/roadmap.md for the full retrospective.

REPL surface (7g.2): type .ask <question> at the prompt. Prints generated SQL + rationale, asks Run? [Y/n], executes through the same process_command pipeline as a typed statement on confirm. Requires SQLRITE_LLM_API_KEY.

Per-product wrappers shipped across 7g.3 – 7g.8 — desktop "Ask…" composer, Python/Node/Go SDKs' conn.ask() / db.ask(), WASM SDK with the split JS-callback shape (so the API key stays out of the browser tab; see ask-backend-examples.md for backend templates), and the MCP ask tool exposed by sqlrite-mcp. ask.md is the canonical reference covering all of them.

The C FFI (Phase 5b)

The sqlrite-ffi/ crate wraps the Rust API in a C ABI that every non-Rust SDK binds against. Build the shared library:

cargo build --release -p sqlrite-ffi
# produces target/release/libsqlrite_c.{so,dylib,dll} + libsqlrite_c.a

The matching sqlrite.h is generated by cbindgen at build time and committed at sqlrite-ffi/include/sqlrite.h so consumers can grab it without running cargo themselves.

#include "sqlrite.h"

struct SqlriteConnection *conn;
sqlrite_open_in_memory(&conn);
sqlrite_execute(conn, "CREATE TABLE users (id INTEGER PRIMARY KEY, name TEXT)");
sqlrite_execute(conn, "INSERT INTO users (name) VALUES ('alice')");

struct SqlriteStatement *stmt;
sqlrite_query(conn, "SELECT id, name FROM users", &stmt);
while (sqlrite_step(stmt) == Row) {
    int64_t id;
    char *name;
    sqlrite_column_int64(stmt, 0, &id);
    sqlrite_column_text(stmt, 1, &name);
    printf("%lld %s\n", (long long)id, name);
    sqlrite_free_string(name);  // heap-allocated text columns must be freed
}
sqlrite_finalize(stmt);
sqlrite_close(conn);

The runnable examples/c/hello.c walks through all of this end-to-end (cd examples/c && make run).

Memory rules

API	Ownership of return
`sqlrite_open*`	caller frees via `sqlrite_close`
`sqlrite_query`	caller frees via `sqlrite_finalize`
`sqlrite_column_text`	caller frees via `sqlrite_free_string`
`sqlrite_column_name`	caller frees via `sqlrite_free_string`
`sqlrite_last_error`	library-owned thread-local, do not free

Error handling

Every mutating call returns a SqlriteStatus int. Ok = 0; nonzero means check sqlrite_last_error() for the descriptive message. sqlrite_step additionally returns Row (102) to signal a row is available or Done (101) when the query is exhausted.

Language SDKs (Phases 5c – 5g)

Shape stays consistent across bindings — connect(path) → cursor/prepare → execute / query / iteration, plus explicit transaction statements.

Python (Phase 5c) ✅

sdk/python/ — PyO3 (abi3-py38) + maturin, PEP 249 / stdlib-sqlite3 shape:

import sqlrite

with sqlrite.connect("foo.sqlrite") as conn:
    cur = conn.cursor()
    cur.execute("INSERT INTO users (name) VALUES ('alice')")
    for row in cur.execute("SELECT id, name FROM users"):
        print(row)  # tuples

The Python binding wraps the Rust Connection directly (not via the C FFI) — PyO3 marshals types cheaper than a C round-trip. Build via cd sdk/python && maturin develop; tests via python -m pytest sdk/python/tests/. Phase 6f publishes abi3-py38 wheels to PyPI on every release via OIDC trusted publishing.

Full API tour: sdk/python/README.md; runnable walkthrough: examples/python/hello.py.

Node.js (Phase 5d) ✅

sdk/nodejs/ — napi-rs 2.x (N-API v9, Node 18+), better-sqlite3-style sync API:

import { Database } from 'sqlrite';

const db = new Database('foo.sqlrite');
db.prepare("INSERT INTO users (name) VALUES ('alice')").run();
for (const row of db.prepare('SELECT id, name FROM users').all()) {
  console.log(row); // { id: 1, name: 'alice' } — object per row
}
db.close();

Unlike the C SDK, the Node.js binding wraps the Rust Connection directly (via napi-rs, no C FFI hop). Phase 6g publishes prebuilt .node binaries per platform under the @joaoh82/sqlrite scope on npm, with sigstore-signed provenance attestations via OIDC trusted publishing — no node-gyp install step on the user side. TypeScript definitions (index.d.ts) auto-generated from the Rust source.

Full API tour: sdk/nodejs/README.md; runnable walkthrough: examples/nodejs/hello.mjs.

Go (Phase 5e) ✅

sdk/go/ — cgo-linked against libsqlrite_c from Phase 5b, implementing database/sql/driver:

import (
    "database/sql"
    _ "github.com/joaoh82/rust_sqlite/sdk/go"
)

db, _ := sql.Open("sqlrite", "foo.sqlrite")
_, _ = db.Exec("INSERT INTO users (name) VALUES ('alice')")
rows, _ := db.Query("SELECT id, name FROM users")
for rows.Next() {
    var id int64; var name string
    rows.Scan(&id, &name)
}

Unlike the Python and Node.js bindings, Go goes through the C ABI — cgo is Go's FFI shape, so leveraging the existing sqlrite-ffi shim is natural and free. The driver implements every major database/sql/driver interface (Driver, Conn, Stmt, Rows, Tx, plus the context-aware variants), so every standard library construct works: QueryRow, Prepare, transactions via db.Begin(), *sql.Stmt.ExecContext, etc.

Full API tour: sdk/go/README.md; runnable walkthrough: examples/go/hello.go.

WASM (Phase 5g) ✅

sdk/wasm/ — wasm-bindgen compiles the Rust engine directly to wasm32-unknown-unknown. The whole database runs in a browser tab:

import init, { Database } from '@joaoh82/sqlrite-wasm';
await init();

const db = new Database();
db.exec("CREATE TABLE users (id INTEGER PRIMARY KEY, name TEXT)");
db.exec("INSERT INTO users (name) VALUES ('alice')");
const rows = db.query("SELECT id, name FROM users");
// → [{ id: 1, name: 'alice' }]

In-memory only in the MVP — file-backed mode needs OS file locks and a -wal sidecar that don't exist in a tab's sandbox. OPFS-backed persistence is a natural follow-up.

Build locally via wasm-pack build --target web --release (or bundler / nodejs). Phase 6h publishes @joaoh82/sqlrite-wasm (bundler target) to npm via wasm-pack build + npm publish (OIDC trusted publisher) on every release.

The root engine crate is feature-gated (cli for rustyline/clap/env_logger; file-locks for fs2) so default-features = false strips out everything that wouldn't compile on wasm32-unknown-unknown.

Full API tour: sdk/wasm/README.md; runnable browser demo: examples/wasm/ (cd examples/wasm && make spins up a local SQL console).

A fix in the Rust engine propagates through one wrapper update per language rather than four separate binding rewrites.

Distribution (Phase 6)

Phase 6 lands GitHub Actions CI + release automation:

crates.io — sqlrite-engine crate (published under a different name from the sqlrite lib target because the short name was already taken; users cargo add sqlrite-engine but still write use sqlrite::…)
PyPI — sqlrite wheels (manylinux x86_64/aarch64, macOS universal, Windows x86_64)
npm — @joaoh82/sqlrite (Node) + @joaoh82/sqlrite-wasm (browser) packages
Go modules — sdk/go/v* git tags
GitHub Releases — Tauri desktop builds + C FFI prebuilt libraries

A single git tag v0.2.0 && git push --tags turns into the full cross-platform, cross-language release.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Embedding the SQLRite engine

The Rust public API (Phase 5a)

Typed row access

Transactions through the public API

Sharing one database across threads

Concurrent writes via `BEGIN CONCURRENT` (Phase 11.4)

What's deferred

Natural-language → SQL — `sqlrite::ask` + `sqlrite-ask`

The C FFI (Phase 5b)

Memory rules

Error handling

Language SDKs (Phases 5c – 5g)

Python (Phase 5c) ✅

Node.js (Phase 5d) ✅

Go (Phase 5e) ✅

WASM (Phase 5g) ✅

Distribution (Phase 6)

FilesExpand file tree

embedding.md

Latest commit

History

embedding.md

File metadata and controls

Embedding the SQLRite engine

The Rust public API (Phase 5a)

Typed row access

Transactions through the public API

Sharing one database across threads

Concurrent writes via BEGIN CONCURRENT (Phase 11.4)

What's deferred

Natural-language → SQL — sqlrite::ask + sqlrite-ask

The C FFI (Phase 5b)

Memory rules

Error handling

Language SDKs (Phases 5c – 5g)

Python (Phase 5c) ✅

Node.js (Phase 5d) ✅

Go (Phase 5e) ✅

WASM (Phase 5g) ✅

Distribution (Phase 6)

Concurrent writes via `BEGIN CONCURRENT` (Phase 11.4)

Natural-language → SQL — `sqlrite::ask` + `sqlrite-ask`