feat(vector-quantization): add RaBitQ support for VectorChord by qdrddr · Pull Request #386 · vectorize-io/hindsight

qdrddr · 2026-02-16T18:00:34Z

Add environment variables and validation for vector quantization. Update alembic migrations to handle dynamic embedding dimensions and a new migration that converts embeddings to rabitq8/rabitq4 types. Provide helper to quantize embeddings on insert and query. Pass quantization flags through memory engine and extension context. Add docker‑compose file reference and ignore it in .gitignore. Include comprehensive tests for config, helpers and migration. Document quantization options, trade‑offs and usage examples.

#385

Add environment variables and validation for vector quantization. Update alembic migrations to handle dynamic embedding dimensions and a new migration that converts embeddings to rabitq8/rabitq4 types. Provide helper to quantize embeddings on insert and query. Pass quantization flags through memory engine and extension context. Add docker‑compose file reference and ignore it in .gitignore. Include comprehensive tests for config, helpers and migration. Document quantization options, trade‑offs and usage examples.

nicoloboschi · 2026-02-18T12:11:29Z

hindsight-api/hindsight_api/alembic/versions/z1a2b3c4d5e6_add_rabitq_quantization.py

+    return f'"{schema}".' if schema else ""
+
+
+def upgrade() -> None:


this new file is not needed - check the migration we do at api startup

nicoloboschi · 2026-02-18T12:12:22Z

hindsight-api/hindsight_api/alembic/versions/n9i0j1k2l3m4_learnings_and_pinned_reflections.py

    text_search_ext = _detect_text_search_extension()

+    # Read embedding dimension from environment, defaulting to 384
+    embedding_dimension = int(os.getenv("DEFAULT_EMBEDDING_DIMENSION", "384"))


why?

this env is not using Hindsight config convention

why we need it? it just adds confusion - just let Hindsight handles the correct dimension from the configured embedding model

nicoloboschi · 2026-02-18T12:13:00Z

hindsight-api/hindsight_api/engine/consolidation/consolidator.py

+    if embedding_str:
+        embedding_expr = quantize_embedding(eval(embedding_str), config.vector_quantization_type)
+    else:
+        embedding_expr = "NULL"


we should raise exception here, something went really wrong

nicoloboschi · 2026-02-18T12:13:07Z

hindsight-api/hindsight_api/engine/consolidation/consolidator.py

+    from ..embeddings import quantize_embedding
+    config = get_config()
+    if embedding_str:
+        embedding_expr = quantize_embedding(eval(embedding_str), config.vector_quantization_type)


nicoloboschi · 2026-02-18T12:13:19Z

hindsight-api/hindsight_api/engine/consolidation/consolidator.py

+    if embedding_str:
+        embedding_expr = quantize_embedding(eval(embedding_str), config.vector_quantization_type)
+    else:
+        embedding_expr = "NULL"


we should raise exception here, something went really wrong

nicoloboschi · 2026-02-18T12:14:24Z

hindsight-api/hindsight_api/engine/search/mpfp_retrieval.py

        from .tags import build_tags_where_clause_simple

+        # Build vector reference with optional quantization
+        config = get_config()


use quantize_embedding

nicoloboschi · 2026-02-18T12:14:31Z

hindsight-api/hindsight_api/engine/search/retrieval.py

+    # Get config for quantization settings
+    config = get_config()
+    # Build vector reference with optional quantization
+    if config.vector_quantization_enabled:


quantize_embedding

nicoloboschi · 2026-02-18T12:14:41Z

hindsight-api/hindsight_api/engine/search/retrieval.py

+    # Get config for quantization settings
+    config = get_config()
+    # Build vector reference with optional quantization
+    if config.vector_quantization_enabled:


quantize_embedding

nicoloboschi · 2026-02-18T12:14:57Z

hindsight-api/hindsight_api/engine/memory_engine.py

+        # Convert embedding to string for asyncpg vector type and apply quantization if enabled
+        from .embeddings import quantize_embedding
+        config = get_config()
+        embedding_expr = quantize_embedding(embedding[0], config.vector_quantization_type) if embedding else None


raise if None pls

nicoloboschi · 2026-02-18T12:16:10Z

hindsight-api/hindsight_api/config.py

    otel_deployment_environment: str

+    # Vector quantization (RaBitQ for VectorChord)
+    vector_quantization_enabled: bool = static(DEFAULT_VECTOR_QUANTIZATION_ENABLED)


since this is specifically for vchord, it should be vector_extension_vchord_quantization_enabled and vector_extension_vchord_quantization_type

qdrddr added 2 commits February 16, 2026 11:51

feat(embeddings): add optional vector quantization to embedding handling

616fac1

nicoloboschi requested changes Feb 18, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(vector-quantization): add RaBitQ support for VectorChord#386

feat(vector-quantization): add RaBitQ support for VectorChord#386
qdrddr wants to merge 2 commits intovectorize-io:mainfrom
qdrddr:vchord-rabitq-quantized

qdrddr commented Feb 16, 2026

Uh oh!

nicoloboschi Feb 18, 2026

Uh oh!

nicoloboschi Feb 18, 2026

Uh oh!

nicoloboschi Feb 18, 2026

Uh oh!

nicoloboschi Feb 18, 2026

Uh oh!

nicoloboschi Feb 18, 2026

Uh oh!

nicoloboschi Feb 18, 2026

Uh oh!

nicoloboschi Feb 18, 2026

Uh oh!

nicoloboschi Feb 18, 2026

Uh oh!

nicoloboschi Feb 18, 2026

Uh oh!

nicoloboschi Feb 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		return f'"{schema}".' if schema else ""


		def upgrade() -> None:

Conversation

qdrddr commented Feb 16, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants