MB-59670: Add GPU support by capemox · Pull Request #385 · blevesearch/zapx

capemox · 2026-03-25T09:14:28Z

New faissGPUFloat32Index implementing faissIndex. It opaquely performs operations on cpu or gpu when appropriate.
Supports training and search on gpu, falls back to cpu when appropriate
Reorganize makeFaissIndex to avoid losing direct map information when transferring to gpu

faiss_vector_cache.go

faiss_vector_index_gpu_float32.go

section_faiss_vector_index.go

CascadingRadium · 2026-03-27T11:00:38Z

hey @capemox, please check the new batchSearch API in the request batcher module.

[v17] Add Request Batcher Module #384

Please add the batcher as an optional object in your struct, with the struct implementing the batchSearch API. That way when we call SearchWithoutIDs we essentially forward the request to the batcher module via its search API.

When it has finished batching and needs to execute a batch, it will call the batchSearch of your GPU struct, which will execute the search on the GPU index. Take special care to NOT involve the batcher if the clone to GPU fails and if the GPU index is unavailable, in which case we do the CPU fallback.

Resolve the comments and fix the merge conflicts ASAP, I am working on resolving merge conflicts on the base pre_gpu branch.

Thanks
cc @abhinavdangeti

CascadingRadium · 2026-03-27T15:26:39Z

Hey @capemox, i think youre right, I was rebasing over fastmerge on local and should have pushed it later on. Have reverted.

… cpu index before cloning from gpu to cpu.

…y checking for gpu existence beforehand

…ts the gpu index from being set to nil before the last flush is called

faiss_vector_index_bivf.go

faiss_vector_request_batcher.go

section_faiss_vector_index.go

CascadingRadium

first pass

faiss_vector_index_gpu_float32.go

Copilot

Pull request overview

Adds GPU acceleration support for Faiss vector indexes by introducing a GPU-backed float32 index wrapper and threading a per-field useGPU option through index creation, merge, and cache-loading paths. It also adjusts the IVF/SQ build flow to preserve direct map / nprobe behavior across GPU↔CPU sync and improves shutdown semantics in the request batcher.

Changes:

Introduce faissGPUFloat32Index and select it via faissIndexFactory when useGPU is enabled for IVF-based float32 indexes.
Plumb useGPU through vector indexing/merging and vector index cache loading/creation.
Update training/build sequence to trainAndAdd() (then set direct map / nprobe) and improve batcher shutdown by waiting for monitor goroutine exit.

Reviewed changes

Copilot reviewed 8 out of 8 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
section_faiss_vector_index.go	Threads `useGPU` into index build/merge; reorganizes IVF/SQ train+add vs add flow.
faiss_vector_request_batcher.go	Adds `doneCh` to allow `stop()` to wait for monitor goroutine exit.
faiss_vector_posting.go	Passes per-field `useGPU` into vector index cache load/create.
faiss_vector_index.go	Updates IVF/SQ interfaces to use `trainAndAdd`.
faiss_vector_index_gpu_float32.go	New GPU-backed float32 index wrapper with async GPU init + batched GPU search + CPU fallback.
faiss_vector_index_float32.go	Adds `trainAndAdd` for CPU float32 index wrapper.
faiss_vector_index_bivf.go	Replaces `train` with `trainAndAdd` for binary IVF wrapper.
faiss_vector_cache.go	Extends cache load/create API to optionally wrap indexes with GPU support.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

faiss_vector_index_float32.go

faiss_vector_index_gpu_float32.go

CascadingRadium

Please pull in minor fixes from the batcher branch as well.

CascadingRadium · 2026-04-03T17:50:21Z

faiss_vector_index_gpu_float32.go

+func (f *faissGPUFloat32Index) setNProbe(nprobe int32) {
+	f.cpuIdx.SetNProbe(nprobe)
+	f.waitGPU()
+	if f.gpuIdx != nil {


I dont think GPU index supports this API. We should be setting nprobe to only the cpu index, since this is in the indexing path and we serialize the cpu index only anyway.

capemox requested review from CascadingRadium, Likith101, Thejas-bhat, abhinavdangeti and maneuvertomars March 25, 2026 09:14

CascadingRadium reviewed Mar 25, 2026

View reviewed changes

faiss_vector_cache.go Outdated Show resolved Hide resolved

faiss_vector_index_gpu_float32.go Outdated Show resolved Hide resolved

faiss_vector_index_gpu_float32.go Outdated Show resolved Hide resolved

CascadingRadium added this to GPU-Accelerated Vector Search Mar 25, 2026

github-project-automation bot moved this to Todo in GPU-Accelerated Vector Search Mar 25, 2026

CascadingRadium requested changes Mar 26, 2026

View reviewed changes

capemox force-pushed the gpu-support branch from 384f13a to e9a5b38 Compare March 27, 2026 15:25

CascadingRadium force-pushed the pre_gpu branch from 4ab241c to 525cec1 Compare March 27, 2026 15:26

capemox force-pushed the gpu-support branch from e9a5b38 to 384f13a Compare March 27, 2026 18:04

capemox added 6 commits March 27, 2026 23:35

add gpu support

595bc9a

encapsulate GPU move failuers

76747c2

move to cpu only after adding the vectors to the gpu index. close old…

bfe938a

… cpu index before cloning from gpu to cpu.

addressing reviews

cd77d91

Merge remote-tracking branch 'origin/pre_gpu' into gpu-support

c939041

add batching support

a13156c

capemox force-pushed the gpu-support branch from 384f13a to a13156c Compare March 30, 2026 18:02

add async move to gpu

1e09c43

capemox requested a review from CascadingRadium March 31, 2026 08:32

capemox self-assigned this Apr 1, 2026

capemox added 4 commits April 1, 2026 10:42

async move to gpu made optional. avoid calling batcher on cpu index b…

bca196d

…y checking for gpu existence beforehand

return from requestBatcher.stop after monitor exits only. this preven…

dc91cc1

…ts the gpu index from being set to nil before the last flush is called

streamline close

2f60999

simplified locking

8d951eb

CascadingRadium reviewed Apr 1, 2026

View reviewed changes

faiss_vector_index_bivf.go Outdated Show resolved Hide resolved

faiss_vector_request_batcher.go Show resolved Hide resolved

CascadingRadium reviewed Apr 1, 2026

View reviewed changes

section_faiss_vector_index.go Show resolved Hide resolved

section_faiss_vector_index.go Outdated Show resolved Hide resolved

address comments

2509ebc

CascadingRadium reviewed Apr 1, 2026

View reviewed changes

code review

ca2f37b

CascadingRadium requested a review from Copilot April 1, 2026 20:59

Copilot started reviewing on behalf of CascadingRadium April 1, 2026 20:59 View session

Copilot AI reviewed Apr 1, 2026

View reviewed changes

faiss_vector_index_float32.go Outdated Show resolved Hide resolved

faiss_vector_index_gpu_float32.go Show resolved Hide resolved

faiss_vector_index_gpu_float32.go Show resolved Hide resolved

address reviews, improve commentary

4470241

CascadingRadium reviewed Apr 3, 2026

View reviewed changes

Conversation

capemox commented Mar 25, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

CascadingRadium commented Mar 27, 2026

Uh oh!

CascadingRadium commented Mar 27, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

CascadingRadium left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

CascadingRadium left a comment

Choose a reason for hiding this comment

Uh oh!

CascadingRadium Apr 3, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants