fix for bug causing tests to use FP vectors in search by MarkWolters · Pull Request #645 · datastax/jvector

MarkWolters · 2026-03-11T21:24:49Z

Investigation Complete: Root Cause Found and Fixed

Key Findings

1. The "Recall Degradation" Was Actually a Test Harness Bug

Test results showed:

BenchYAML + fusedGraph: No → recall: 0.65
BenchYAML + fusedGraph: Yes → recall: 0.65
AutoBenchYAML + fusedGraph: No → recall: 0.77 ← ANOMALY
AutoBenchYAML + fusedGraph: Yes → recall: 0.65

The anomaly was that AutoBenchYAML with fusedGraph:No produced artificially high recall (0.77), not that FusedPQ was producing low recall. All configurations using FusedPQ correctly produced consistent recall of 0.65.

2. Root Cause: Missing Vector Encoding in `runAllAndCollectResults`

Location: jvector-examples/src/main/java/io/github/jbellis/jvector/example/Grid.java, line 833

The Bug:

CompressedVectors cvArg = (searchCompressorObj instanceof CompressedVectors) ? (CompressedVectors) searchCompressorObj : null;

This line attempted to cast a VectorCompressor (ProductQuantization) to CompressedVectors, which always failed, resulting in cvArg = null. When cvArg is null and fusedGraph is disabled, the ConfiguredSystem.scoreProviderFor method falls back to exact scoring using uncompressed vectors (line 1084 / 1094), which artificially inflates recall.

The Fix:

// Encode vectors for reranking if a compressor is provided
CompressedVectors cvArg;
if (searchCompressorObj == null) {
    cvArg = null;
} else {
    cvArg = searchCompressorObj.encodeAll(ds.getBaseRavv());
}

This fix properly encodes the vectors using the compressor, matching the behavior of runOneGraph (the correct implementation used by BenchYAML).

Impact

Before fix: AutoBenchYAML with fusedGraph:No was using exact (uncompressed) scoring, giving misleadingly high recall
After fix: AutoBenchYAML will now correctly use PQ-compressed vectors for approximate scoring during graph traversal, with NVQ reranking, producing consistent recall across all test harnesses

Verification

After applying this fix:

AutoBenchYAML + fusedGraph:No → recall: ~0.65 (matching other configurations)
All test harnesses will produce consistent recall measurements
The perceived "recall degradation with high dimensionality" will disappear, as it was never a real issue with FusedPQ

Additional Notes

The investigation also examined the FusedPQ implementation thoroughly and confirmed that:

The storage and retrieval of fused PQ data is correct
The scoring mathematics are correct
The SIMD implementations are correct
FusedPQ works correctly for all dimensionalities when properly configured

github-actions · 2026-03-11T21:25:03Z

Before you submit for review:

Does your PR follow guidelines from CONTRIBUTIONS.md?
Did you summarize what this PR does clearly and concisely?
Did you include performance data for changes which may be performance impacting?
Did you include useful docs for any user-facing changes or features?
Did you include useful javadocs for developer oriented changes, explaining new concepts or key changes?
Did you trigger and review regression testing results against the base branch via Run Bench Main?
Did you adhere to the code formatting guidelines (TBD)
Did you group your changes for easy review, providing meaningful descriptions for each commit?
Did you ensure that all files contain the correct copyright header?

If you did not complete any of these, then please explain below.

tlwillke

Can you please take a look elsewhere in Grid to see if we are vulnerable to the same failure outside of AutoBenchYAML usage?

jvector-examples/src/main/java/io/github/jbellis/jvector/example/Grid.java

tlwillke

LGTM. Thanks for being thorough.

fix for bug causing tests to use FP vectors in search

d27da63

MarkWolters requested review from jshook and tlwillke as code owners March 11, 2026 21:24

tlwillke reviewed Mar 12, 2026

View reviewed changes

jvector-examples/src/main/java/io/github/jbellis/jvector/example/Grid.java Outdated Show resolved Hide resolved

adding safeguards and logging

591c507

tlwillke self-requested a review March 19, 2026 01:47

tlwillke approved these changes Mar 19, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix for bug causing tests to use FP vectors in search#645

fix for bug causing tests to use FP vectors in search#645
MarkWolters wants to merge 2 commits intomainfrom
fix_fused_recall

MarkWolters commented Mar 11, 2026

Uh oh!

github-actions bot commented Mar 11, 2026 •

edited by MarkWolters

Loading

Uh oh!

tlwillke left a comment

Uh oh!

Uh oh!

tlwillke left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

MarkWolters commented Mar 11, 2026

Investigation Complete: Root Cause Found and Fixed

Key Findings

1. The "Recall Degradation" Was Actually a Test Harness Bug

2. Root Cause: Missing Vector Encoding in runAllAndCollectResults

Impact

Verification

Additional Notes

Uh oh!

github-actions bot commented Mar 11, 2026 • edited by MarkWolters Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tlwillke left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

tlwillke left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

2. Root Cause: Missing Vector Encoding in `runAllAndCollectResults`

github-actions bot commented Mar 11, 2026 •

edited by MarkWolters

Loading