Backend Storage Options

Pangolin supports multiple backend storage options for persisting catalog metadata. Choose the backend that best fits your deployment requirements.

Overview

Backend storage is where Pangolin stores catalog metadata including:

Tenant information
Warehouse configurations
Catalog definitions
Namespace hierarchies
Asset (table/view) metadata
Branch and tag information
Audit logs

Note: Backend storage is separate from warehouse storage (S3, Azure, GCS) which stores the actual data files.

Performance Features

All backends (including Memory/SQLite) now feature:

Metadata Cache: In-memory LRU cache (moka) for high-latency Iceberg metadata files (manifests, snapshots), defaulting to 5-minute TTL.
Object Store Connection Pooling: Reuses S3/GCS/Azure connections to reduce handshake overhead.
Unified Search: Optimized full-text search across Catalogs, Namespaces, and Tables regardless of the backing store.

Quick Comparison

Choosing a Backend

Use In-Memory When:

✅ You're developing locally
✅ You're running tests (unit or integration)
✅ You need instant setup with zero configuration
✅ You're prototyping or learning
✅ Data persistence is not required
✅ You're running in CI/CD pipelines

Use SQLite When:

✅ You're developing locally and need persistence
✅ You need embedded database
✅ You're deploying to edge/IoT devices
✅ You want zero configuration with persistence
✅ You have low concurrent write needs
✅ You want minimal resource usage

Use PostgreSQL When:

✅ You need a proven, battle-tested SQL database
✅ You want strong consistency and ACID guarantees
✅ You're deploying to traditional infrastructure
✅ You need complex queries and joins
✅ You want managed cloud options (RDS, Azure Database, Cloud SQL)

Use MongoDB When:

✅ You prefer document-based storage
✅ You need horizontal scalability
✅ You're building cloud-native applications
✅ You want flexible schema evolution
✅ You're already using MongoDB in your stack

Configuration

Set the backend using the DATABASE_URL environment variable:

# In-Memory (default - no DATABASE_URL needed)
# Just don't set DATABASE_URL

# SQLite
DATABASE_URL=sqlite:///path/to/pangolin.db

# PostgreSQL
DATABASE_URL=postgresql://user:password@localhost:5432/pangolin

# MongoDB
DATABASE_URL=mongodb://user:password@localhost:27017/pangolin

Next Steps

Migration Between Backends

Currently, Pangolin does not provide automated migration tools between backends. If you need to migrate:

Export metadata from source backend (custom script)
Transform to target backend format
Import into target backend

Tip: Start with SQLite for development, then migrate to PostgreSQL or MongoDB for production.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Backend Storage Options

Overview

Performance Features

Quick Comparison

Choosing a Backend

Use In-Memory When:

Use SQLite When:

Use PostgreSQL When:

Use MongoDB When:

Configuration

Next Steps

Migration Between Backends

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Backend Storage Options

Overview

Performance Features

Quick Comparison

Choosing a Backend

Use In-Memory When:

Use SQLite When:

Use PostgreSQL When:

Use MongoDB When:

Configuration

Next Steps

Migration Between Backends