Low-Rank Adaptation (LoRA) is a widely adopted method for customizing large-scale language models. In distributed, untrusted training environments, an open source base model user may want to use LoRA weights created by an external contributor, leading to two requirements:
- Base Model User Verification: The user must confirm that the LoRA weights are effective when paired with the intended base model.
- LoRA Contributor Protection: The contributor must keep their proprietary LoRA weights private until compensation is assured.
To solve this, we created zkLoRA a zero-knowledge verification protocol that relies on commitments, succinct proofs, and multi-party inference to verify exact LoRA delta computation for a pre-agreed adapter without exposing LoRA weights.
This implementation uses a native Halo2 backend for transcript-bound proof artifacts. The v2 proof contract verifies exact quantized LoRA delta correctness for the statement the base user actually sent and received, and binds the proof to a pre-inference adapter manifest. It does not claim an end-to-end proof that the base model computed those activations.
Verifier trust boundary: expected_adapters must be obtained and pinned by the verifier out-of-band before inference starts, for example by recording the exact manifest file or digest. A contributor-generated adapter manifest is only a convenience handoff artifact; if it is first generated after inference or supplied only alongside proofs, it is not trusted verifier input.
For detailed information about this research, please refer to our paper.
First, install zkLoRA using pip:
pip install zkloraUse src/scripts/lora_contributor_sample_script.py to:
- Host LoRA submodules
- Write a pre-inference adapter manifest for the verifier to pin out-of-band
- Handle inference requests
- Generate proof artifacts
import argparse
import threading
import time
from zklora import LoRAServer, LoRAServerSocket
def main():
parser = argparse.ArgumentParser(
description=(
"Run a sample LoRA contributor server and write the adapter manifest "
"that the verifier should pin out-of-band before inference."
)
)
parser.add_argument("--host", default="127.0.0.1")
parser.add_argument("--port_a", type=int, default=30000)
parser.add_argument("--base_model", default="distilgpt2")
parser.add_argument("--lora_model_id", default="ng0-k1/distilgpt2-finetuned-es")
parser.add_argument("--out_dir", default="a-out")
parser.add_argument(
"--adapter_manifest",
default="adapter-manifest.json",
help=(
"Convenience manifest handoff path. The verifier must obtain and pin "
"this manifest out-of-band before inference; a post-inference manifest "
"is not trusted expected_adapters input."
),
)
args = parser.parse_args()
stop_event = threading.Event()
server_obj = LoRAServer(args.base_model, args.lora_model_id, args.out_dir)
server_obj.write_adapter_manifest(args.adapter_manifest)
print(f"[A-Server] wrote adapter manifest => {args.adapter_manifest}")
print(
"[A-Server] verifier must pin this manifest out-of-band before inference; "
"post-inference manifests are not trusted expected_adapters."
)
t = LoRAServerSocket(args.host, args.port_a, server_obj, stop_event)
t.start()
try:
while True:
time.sleep(1)
except KeyboardInterrupt:
print("[A-Server] stopping.")
stop_event.set()
t.join()
if __name__ == "__main__":
main()Use src/scripts/base_model_user_sample_script.py to:
- Load and patch the base model
- Connect to A's submodules
- Perform inference
- Trigger proof generation
import argparse
from zklora import BaseModelClient
def main():
parser = argparse.ArgumentParser()
parser.add_argument("--host_a", default="127.0.0.1")
parser.add_argument("--port_a", type=int, default=30000)
parser.add_argument(
"--contributors",
nargs="*",
help="Additional LoRA contributors as host:port",
)
parser.add_argument("--base_model", default="distilgpt2")
parser.add_argument("--combine_mode", choices=["replace","add_delta"], default="add_delta")
args = parser.parse_args()
contributors = [(args.host_a, args.port_a)]
if args.contributors:
for item in args.contributors:
host, port = item.split(":")
contributors.append((host, int(port)))
client = BaseModelClient(
base_model=args.base_model,
combine_mode=args.combine_mode,
contributors=contributors,
)
client.init_and_patch()
# Run inference => triggers remote LoRA calls on A
text = "Hello World, this is a LoRA test."
loss_val = client.forward_loss(text)
print(f"[B] final loss => {loss_val:.4f}")
# End inference => A finalizes native zkLoRA proof artifacts
client.end_inference()
client.transcript.write("b-transcript.json")
print("[B] done. B can now fetch proof files from A and verify them against b-transcript.json and the pre-agreed adapter manifest.")
if __name__=="__main__":
main()Use src/scripts/verify_proofs.py to validate the proof artifacts:
--expected_adapters must point to the verifier's pinned pre-inference adapter manifest. Do not accept a contributor manifest that was generated after inference, or first delivered with the proof bundle, as trusted verifier input; it is useful only as a handoff artifact to compare against the pinned expectation.
#!/usr/bin/env python3
"""
Verify LoRA proof artifacts in a given directory.
Example usage:
python verify_proofs.py --proof_dir a-out --transcript b-transcript.json --expected_adapters adapter-manifest.json --verbose
"""
import argparse
from zklora import batch_verify_proofs
def main():
parser = argparse.ArgumentParser(
description="Verify LoRA proof artifacts in a given directory."
)
parser.add_argument(
"--proof_dir",
type=str,
default="proof_artifacts",
help="Directory containing native .zklora proof artifacts."
)
parser.add_argument(
"--transcript",
type=str,
required=True,
help="Base user transcript JSON captured during inference."
)
parser.add_argument(
"--expected_adapters",
type=str,
required=True,
help=(
"Verifier-pinned pre-inference adapter manifest JSON. This must be "
"obtained out-of-band before inference, not first supplied with proofs."
)
)
parser.add_argument(
"--verbose",
action="store_true",
help="Print more details during verification."
)
args = parser.parse_args()
total_verify_time, num_proofs = batch_verify_proofs(
proof_dir=args.proof_dir,
transcript=args.transcript,
expected_adapters=args.expected_adapters,
verbose=args.verbose
)
print(f"Done verifying {num_proofs} proofs. Total time: {total_verify_time:.2f}s")
if __name__ == "__main__":
main()zkLoRA includes a robust polynomial commitment scheme for securely committing to neural network activations without revealing the underlying data. This cryptographic primitive enables privacy-preserving verification of computations.
from zklora import commit_activations, verify_commitment
# Commit to activation data stored in JSON format
commitment = commit_activations("activations.json")
# Verify the commitment against original data
is_valid = verify_commitment("activations.json", commitment)
assert is_validThe polynomial commitment scheme provides several key properties:
- Zero-Knowledge: Commitments reveal no information about the underlying activation data
- Binding: Once created, commitments cannot be changed to refer to different data
- Deterministic Verification: Given the same data and nonce, verification is consistent
- Cryptographic Security: Uses BLAKE3 hashing and polynomial arithmetic over finite fields
Committing to Different Data Types:
import json
from zklora import commit_activations, verify_commitment
# Example with floating point activations
activation_data = {
"input_data": [1.5, 2.7, -3.14, 0.0, 42.8]
}
with open("float_activations.json", "w") as f:
json.dump(activation_data, f)
commitment = commit_activations("float_activations.json")
assert verify_commitment("float_activations.json", commitment)
# Example with nested activation structures (automatically flattened)
nested_data = {
"input_data": [[1, 2], [3, [4, 5]], 6]
}
with open("nested_activations.json", "w") as f:
json.dump(nested_data, f)
nested_commitment = commit_activations("nested_activations.json")
assert verify_commitment("nested_activations.json", nested_commitment)Batch Processing for Multiple Modules:
import os
from zklora import commit_activations, verify_commitment
# Commit to activations from multiple LoRA modules
module_commitments = {}
activation_files = ["module1_acts.json", "module2_acts.json", "module3_acts.json"]
for file_path in activation_files:
if os.path.exists(file_path):
commitment = commit_activations(file_path)
module_commitments[file_path] = commitment
print(f"Committed to {file_path}: {commitment[:50]}...")
# Verify all commitments
for file_path, commitment in module_commitments.items():
is_valid = verify_commitment(file_path, commitment)
print(f"Verification for {file_path}: {'✓ VALID' if is_valid else '✗ INVALID'}")Understanding Commitment Structure:
import json
from zklora import commit_activations
# Create a commitment and examine its structure
commitment_str = commit_activations("activations.json")
commitment_data = json.loads(commitment_str)
print("Commitment structure:")
print(f"Root hash: {commitment_data['root']}") # Merkle tree root
print(f"Nonce: {commitment_data['nonce']}") # Cryptographic nonce
print(f"Root length: {len(commitment_data['root'])}") # 66 chars (0x + 64 hex)
print(f"Nonce length: {len(commitment_data['nonce'])}") # 66 chars (0x + 64 hex)- Collision Resistance: Different activation datasets produce different commitments
- Hiding Property: Commitments reveal no information about the committed data
- Non-Malleability: Cannot modify commitments without detection
- Efficient Verification: Verification scales logarithmically with data size
- Activation Integrity: Ensure base model activations haven't been tampered with
- Privacy-Preserving Audits: Allow verification without revealing sensitive data
- Multi-Contributor Scenarios: Enable secure collaboration between multiple LoRA providers
- Proof Generation: Create verifiable evidence of correct computation
Run unit tests with:
pytestFor detailed information about the codebase organization and implementation details, see Code Structure.
| ✓ | Trust-Minimized Delta Verification: Zero-knowledge proofs validate exact quantized LoRA deltas for a pre-agreed adapter |
| ✓ | Native Halo2 Backend: Proofs no longer depend on EZKL/ONNX artifacts |
| ✓ | Multi-Party Inference: Protected activation exchange between parties |
| ✓ | Adapter Weight Privacy: LoRA weights remain confidential while the committed adapter identity is checked |
| ✓ | Benchmark Required: Real-shape proving and verification performance should be measured for each deployment target (see benchmarks/run_benchmarks.py and cargo run --release --example bench_prove) |
| ✓ | Fast Native Backend (v3): Lookup-based range checks, shape-keyed SRS/key caches, and parallel batch proving/verification deliver order-of-magnitude speedups over v2 while keeping the same statement format and adapter commitment scheme |
Polynomial commitments for base model activations and multi-contributor LoRA scenarios are supported starting in version 0.1.2.
zkLoRA is built upon these outstanding open source projects:
| Project | Description |
|---|---|
| PEFT | Parameter-Efficient Fine-Tuning library by Hugging Face |
| Transformers | State-of-the-art Natural Language Processing |
| dusk-merkle | Merkle tree implementation in Rust |
| BLAKE3 | Cryptographic hash function |
| Halo2 | Native zero-knowledge proving system used by the zkLoRA backend |
Licensed under Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International
