Skip to content

docs(pr-x12): audit findings for hallucinations in PR-X12 docs#200

Merged
AdaWorldAPI merged 1 commit into
masterfrom
claude/audit-pr-x12-hallucinations-wN5kw
May 25, 2026
Merged

docs(pr-x12): audit findings for hallucinations in PR-X12 docs#200
AdaWorldAPI merged 1 commit into
masterfrom
claude/audit-pr-x12-hallucinations-wN5kw

Conversation

@AdaWorldAPI
Copy link
Copy Markdown
Owner

@AdaWorldAPI AdaWorldAPI commented May 22, 2026

Summary

Adds .claude/PR-X12-docs-audit.md — a recommendations-only audit of the PR-X12 doc set for hallucinated / unverified claims.

Contents of the audit doc:

  • Full findings table (26 rows) — each claim tagged verified / unverified / hallucinated, with source anchor where one exists.
  • Verified-real anchors — the subset of PR-X12 claims that map to concrete code / commits / external sources.
  • Architectural truths — load-bearing invariants the docs correctly capture.
  • Doc-by-doc concentration ranking — which PR-X12 files carry the most unverified material.
  • Three-tier cleanup recommendations — drop / qualify / verify.
  • Explicit "what was NOT verified" scope boundary — so a reader can tell what the audit is silent on.
  • Single load-bearing-paragraph summary at the top.

Scope

  • No edits to any audited PR-X12 doc — per the audit task's hard rules. This PR only adds the audit artifact under .claude/.
  • Cleanup actions (drop / qualify / verify) are deliberately left as follow-up work so they can be reviewed claim-by-claim rather than swept in with the audit itself.

Test plan

  • Read .claude/PR-X12-docs-audit.md end-to-end.
  • Spot-check 2–3 "verified-real" anchors against the cited files/commits.
  • Spot-check 2–3 "hallucinated" rows and confirm the claim has no source.
  • Decide which tier-1 (drop) rows to action in a follow-up PR.

https://claude.ai/code/session_01UHfLwSzGfVNzkNrTEBijuA


Generated by Claude Code

Summary by CodeRabbit

  • Documentation
    • Updated internal documentation materials.

Review Change Stack

Adds .claude/PR-X12-docs-audit.md with full audit table (26 rows),
verified-real anchors, architectural truths, doc-by-doc concentration
ranking, three-tier cleanup recommendations, and scope boundaries.
Recommendations only — no edits to the audited PR-X12 docs themselves.
@coderabbitai
Copy link
Copy Markdown
Contributor

coderabbitai Bot commented May 22, 2026

📝 Walkthrough

Walkthrough

This PR introduces a new audit document that systematically evaluates the PR-X12 docs cluster against source artifacts, identifies hallucinations and misattributions, aggregates findings by severity and document, and provides tiered remediation recommendations.

Changes

PR-X12 Docs Audit

Layer / File(s) Summary
Audit scope and verified baselines
.claude/PR-X12-docs-audit.md
Establishes audit timestamp, methodology (whole-file reads against source), scope (PR-X12 docs under .claude/knowledge/), and lists verified-real artifacts as anchors and canon.
Findings and severity analysis
.claude/PR-X12-docs-audit.md
Defines architectural truths, presents a findings table enumerating doc claims that are wrong/inverted/fabricated/misattributed, and aggregates hallucination concentration by specific docs.
Tiered remediation recommendations
.claude/PR-X12-docs-audit.md
Structures fixes into four tiers: full rewrites/quarantine, targeted corrections with line ranges, keep-with-caveats annotations, and cluster-level citation pruning with usage policy rules.
Audit boundaries and conclusion
.claude/PR-X12-docs-audit.md
Documents explicit items not verified in this pass (source files, numeric probes, external states) and concludes with a load-bearing summary of primary contradictions around kernel symbols, src/simd_soa.rs, and calibration claims.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~12 minutes

Possibly related PRs

  • AdaWorldAPI/ndarray#198: Corrects and updates the same PR-X12 canonical docs that this audit explicitly targets, including kernel-symbol claims and R-14/R-15 content.

Poem

🐰 A hop through the docs, what a mess!
Hallucinations high, the audit says "yes—
Lance-graph, blasgraph, missing symbols too,
Here's what's wrong and what to do."
Tier by tier, the repairs align!

🚥 Pre-merge checks | ✅ 5
✅ Passed checks (5 passed)
Check name Status Explanation
Description Check ✅ Passed Check skipped - CodeRabbit’s high-level summary is enabled.
Title check ✅ Passed The title clearly and specifically describes the main change: adding an audit document for hallucinated claims in PR-X12 documentation, which directly matches the changeset's single file addition.
Docstring Coverage ✅ Passed No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.
Linked Issues check ✅ Passed Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check ✅ Passed Check skipped because no linked issues were found for this pull request.

✏️ Tip: You can configure your own custom pre-merge checks in the settings.

✨ Finishing Touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests
  • Commit unit tests in branch claude/audit-pr-x12-hallucinations-wN5kw

Comment @coderabbitai help to get the list of available commands and usage tips.

Copy link
Copy Markdown

@chatgpt-codex-connector chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: b7eb589be3

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

| `src/simd_amx.rs` | ndarray | not whole-read in this pass (per task: "verified real, doc comments inside are sound") | Real AMX inline-asm + Linux prctl(ARCH_REQ_XCOMP_PERM, XFEATURE_XTILEDATA). |
| `.cargo/config.toml` | ndarray | 27 lines | Pins `target-cpu=x86-64-v3` (AVX2 baseline). AVX-512 (v4) is opt-in via separate `config-avx512.toml`. Comment lines 21-24 explicitly name "Runtime LazyLock dispatch" as a fifth supported mode. |
| `CLAUDE.md` "Hard Rules" + W1a consumer contract at `.claude/knowledge/vertical-simd-consumer-contract.md` | ndarray | per task statement: "real" | Authoritative. |
| `crates/lance-graph/src/graph/blasgraph/` | lance-graph | mod.rs (104) + semiring.rs (492) whole-read; 18 other files not whole-read | GraphBLAS-style sparse matrix algebra over 16384-bit binary `BitVec` carriers. Exports `Semiring`, `HdrSemiring` (7 variants: XorBundle, BindFirst, HammingMin, SimilarityMax, Resonance, Boolean, XorField), `GrBMatrix`, `GrBVector`, sparse-storage types. Bit-exact binary-Hamming substrate. Matches user's architectural truth #3. |
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P1 Badge Remove unverifiable lance-graph file assertions

This audit marks crates/lance-graph/... files as "verified-real" and "whole-read," but this repository does not contain a crates/lance-graph tree (repo-wide file search returns no such paths, and git cat-file cannot resolve the cited files at this commit). Because several downstream conclusions depend on these anchors, presenting them as verified in this repo makes the audit non-reproducible and can misdirect follow-up edits; these entries should be re-scoped to verifiable in-repo sources or explicitly linked to an external checked-out repository/commit.

Useful? React with 👍 / 👎.

Copy link
Copy Markdown
Contributor

@coderabbitai coderabbitai Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 2

🤖 Prompt for all review comments with AI agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In @.claude/PR-X12-docs-audit.md:
- Line 57: Update the BLAS-L2 method count to match the listed methods: change
the reported "8 BLAS-L2 methods" to "9 BLAS-L2 methods" in the audit text
referencing blas_level2.rs (ensure both occurrences around the current Line 57
and Line 101 are updated) so the count aligns with the methods listed
(gemv/ger/symv/trmv/trsv/syr/syr2/gbmv/sbmv).
- Line 63: Replace the instance of the word "followup" in the line containing
"Treat these as load-bearing canon for this audit and any followup:" with the
hyphenated form "follow-up" to match the document's preferred style; search for
the exact phrase "Treat these as load-bearing canon for this audit and any
followup:" and update it to "Treat these as load-bearing canon for this audit
and any follow-up:".
🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

  • Push a commit to this branch (recommended)
  • Create a new PR with the fixes

ℹ️ Review info
⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro Plus

Run ID: c335f1cc-8047-4b25-b0b9-cbd9f5fb7768

📥 Commits

Reviewing files that changed from the base of the PR and between 6ebdac5 and b7eb589.

📒 Files selected for processing (1)
  • .claude/PR-X12-docs-audit.md

| `crates/bgz-tensor/src/lib.rs` | lance-graph | 122 lines | Real exports include `Codebook4096`/`CodebookIndex` (line 108), `AttentionSemiring`/`AttentionTable`/`ComposeTable` (105), `HhtlDEntry`/`HhtlDMeta`/`HhtlDTensor` (112), `Base17`/`Base17Fz` (115). |
| `src/hpc/cam_pq.rs` | ndarray | 847 lines | `CamCodebook` struct (line 67) — 6-subspace × 256-centroid CAM-PQ scheme; `encode`/`decode`/`precompute_distances`/`distance_batch` methods; `train_geometric` (373), `train_semantic` (408), `train_hybrid` (454); exposed `kmeans` (541) + `squared_l2` (473) for downstream consumers. PackedDatabase stroke cascade (HEEL/HEEL+BRANCH/full) for 99% rejection before full ADC. |
| `src/hpc/codec/` | ndarray | listed only | A1/A2/A3-intra codec foundation: `ctu.rs`, `mode.rs`, `predict.rs`, `mod.rs`. No `ndarray-codec` crate yet. |
| `src/hpc/{dn_tree.rs, merkle_tree.rs, blas_level2.rs, bf16_tile_gemm.rs}` | ndarray | blas_level2.rs whole-read (487 lines) | blas_level2 surfaces 8 BLAS-L2 methods (gemv/ger/symv/trmv/trsv/syr/syr2/gbmv/sbmv). **No `batched_ssd_search` symbol.** Other files exist; not whole-read. |
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Fix BLAS-L2 method count mismatch.

On Line 57 and Line 101, the text says “8 BLAS-L2 methods,” but the listed methods are 9 (gemv/ger/symv/trmv/trsv/syr/syr2/gbmv/sbmv). Please align the count with the list to avoid weakening audit credibility.

Also applies to: 101-101

🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In @.claude/PR-X12-docs-audit.md at line 57, Update the BLAS-L2 method count to
match the listed methods: change the reported "8 BLAS-L2 methods" to "9 BLAS-L2
methods" in the audit text referencing blas_level2.rs (ensure both occurrences
around the current Line 57 and Line 101 are updated) so the count aligns with
the methods listed (gemv/ger/symv/trmv/trsv/syr/syr2/gbmv/sbmv).


## 2. Architectural truths (user-established ground)

Treat these as load-bearing canon for this audit and any followup:
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Use “follow-up” for consistency.

Line 63 uses “followup”; prefer “follow-up” in this doc’s style.

🧰 Tools
🪛 LanguageTool

[grammar] ~63-~63: Ensure spelling is correct
Context: ...ad-bearing canon for this audit and any followup: 1. The PR-X12 "R-*" series is **future-cond...

(QB_NEW_EN_ORTHOGRAPHY_ERROR_IDS_1)

🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In @.claude/PR-X12-docs-audit.md at line 63, Replace the instance of the word
"followup" in the line containing "Treat these as load-bearing canon for this
audit and any followup:" with the hyphenated form "follow-up" to match the
document's preferred style; search for the exact phrase "Treat these as
load-bearing canon for this audit and any followup:" and update it to "Treat
these as load-bearing canon for this audit and any follow-up:".

@AdaWorldAPI AdaWorldAPI merged commit f054bc7 into master May 25, 2026
18 checks passed
AdaWorldAPI pushed a commit that referenced this pull request May 25, 2026
…idge.rs

The atom16 / precision-ladder / two-axes docs annotated BGZ17 as
"offset ≈17..27, stride {2,4}" — figures from a chat description that do
NOT appear in the source. First-party read of src/hpc/bgz17_bridge.rs:
BASE_DIM=17, GOLDEN_STEP=11 (coprime → table (i*11) mod 17 visits all 17
positions, Quintenzirkel-like phase), Base17=[i16;17], built by lossy
golden-step octave averaging of a 16384-bit plane.

The atom's two 2-bit fields are now labelled plan-inspired placeholders,
not grounded BGZ17 parameters. Removes a fabricated constant from a pushed
branch per the PR #200 audit discipline (source files are evidence;
chat/plan descriptions are not).

Note: this module remains an unmerged sketch — flat-disk manifold (the
ASG/KHR correction makes the directional lane spherical) and built from
the LeafPlanetarium inspiration doc. Superseded by the forthcoming
ASG-leaf canon entry; not for merge as-is.

https://claude.ai/code/session_01HbqooFZHAjaUtFEzhA1R2u
AdaWorldAPI pushed a commit that referenced this pull request May 25, 2026
… truth

PR #1 of the 7-PR cross-session program. Maps the ~62 ChatGPT-seeded
.claude/plans/ docs (committed 2026-05-24/25) against evidence: real
source files, ratified standards (KHR_gaussian_splatting / OGC 3D Tiles
2.0 / Cesium HLOD / ArcGIS ESRI_crs), and the merged #200 audit
(PR-X12-docs-audit.md).

NOT AUTHORITATIVE by construction — it ranks inspiration against ground
truth and may not be cited as evidence (PR #200 Tier-4 model).

Includes:
- evidence model (evidence vs inspiration) per #200 Tier-4
- correction list (blasgraph canonical / bgz17 lossy sibling;
  ScalarCsr::spmv_min_plus is the real symbol; no batched_ssd_search;
  sigker PDE not buggy; per-arch crossovers uncalibrated; R-1..R-15
  forward-conditional; directional lane is spherical not Poincaré)
- supersedes-note recording all three sessions' in-thread errors
  (incl. this session's R-7/sigker/SSD assertions) with their fixes
- 5-bucket inventory (USE-NOW / EPIPHANY / GROUND-FIRST / ARCHIVE /
  OUT-OF-SCOPE) with hop + cross-verified/single-session + grounds-onto
- PR-X12-tensor-capstone downgraded to idea-USE-NOW / doc-RE-GROUND
  (carries #200 #1-#16 debt)
- C/HLOD headline concept (Cesium HLOD shipping + certified overlay)
- the forward 7-PR program + parking lot
- honest scope boundary for the 2 plans not whole-read this pass

No source changes. Pure .claude/knowledge addition.
AdaWorldAPI pushed a commit that referenced this pull request May 25, 2026
…; cite #200 §1

Addresses Codex review on PR #201: src/hpc/pillar/{ewa_sandwich_2d,
ewa_sandwich_3d,splat_invariants}.rs were absent from this branch's tree
because the branch was cut from a pre-pillar/ master point. The files are
real on master and independently confirmed by the merged #200 audit §1.
Merged origin/master in (non-destructive) so every cited anchor now
resolves on the branch, and added a note pointing to #200 §1 as the
branch-independent witness.
AdaWorldAPI pushed a commit that referenced this pull request May 25, 2026
…ot-checked)

Answers 'what about hallucinated reviews in #200?' #200 is a Claude-authored
doc, so it doesn't get terminal authority just because the plans were demoted.
Re-tiered the evidence model: L0 source/tests/standards (ground truth) ·
L1 verifiable claims-about-source (#200 + this triage, must be spot-checked) ·
L2 inspiration (plans/perspective-docs, not evidence).

Spot-checked #200's five load-bearing rows first-party against source this
session (lance-graph via MCP, ndarray local): spmv_min_plus-not-tropical_spmv,
sigker-PDE-not-buggy, no batched_ssd_search, no tropical_gemm, blasgraph-bit-
exact/bgz17-lossy — ALL CONFIRMED. Only cosmetic defects in #200 (off-by-one
line, 8-vs-9 method count). Flagged #200's own section-6 un-verified items as
the residual trust boundary, not to be elevated past L1.
AdaWorldAPI pushed a commit that referenced this pull request May 25, 2026
- CORRECT mis-grade: 2026-05-06-splat-osint-ingestion — SPLAT-1 types are
  SHIPPED in lance-graph-contract::splat (CamPlaneSplat/SplatPlaneSet/
  CamSplatCertificate/AwarenessPlane16K/witness_to_splat, verified
  first-party), not markdown-only. PR-7 ASG-leaf must EXTEND, not reinvent.
- De-provisionalize the two rows both sessions whole-read:
  3DGS-render-depth-certification (grounded spec for PR-6 ErrorCertificate)
  and 3DGS-certified-field-kernel-substrate (added to inventory, KEEP-EPIPHANY,
  self-fences 'no abstraction until 2 consumers').
- CV ◑→✓ for 3DGS-SIMD-forward-renderer, 3DGS-error-certification-pillars,
  3DGS-EWA-SYRK-BLAS-MKL (now whole-read by >=2 sessions).
- EWA-SYRK: flag actionable backend half may DOWNGRADE after PR-3 measures
  (likely 3x3 pessimization).
- Scope boundary: provisional flags cleared; residual #200 §6 items held at L1.
AdaWorldAPI pushed a commit that referenced this pull request May 26, 2026
…idge.rs

The atom16 / precision-ladder / two-axes docs annotated BGZ17 as
"offset ≈17..27, stride {2,4}" — figures from a chat description that do
NOT appear in the source. First-party read of src/hpc/bgz17_bridge.rs:
BASE_DIM=17, GOLDEN_STEP=11 (coprime → table (i*11) mod 17 visits all 17
positions, Quintenzirkel-like phase), Base17=[i16;17], built by lossy
golden-step octave averaging of a 16384-bit plane.

The atom's two 2-bit fields are now labelled plan-inspired placeholders,
not grounded BGZ17 parameters. Removes a fabricated constant from a pushed
branch per the PR #200 audit discipline (source files are evidence;
chat/plan descriptions are not).

Note: this module remains an unmerged sketch — flat-disk manifold (the
ASG/KHR correction makes the directional lane spherical) and built from
the LeafPlanetarium inspiration doc. Superseded by the forthcoming
ASG-leaf canon entry; not for merge as-is.

https://claude.ai/code/session_01HbqooFZHAjaUtFEzhA1R2u
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants