Skip to content

Add edge-predicate regression check framework#43

Open
maximusunc wants to merge 1 commit into
mainfrom
claude/validate-edge-predicates-t9QXE
Open

Add edge-predicate regression check framework#43
maximusunc wants to merge 1 commit into
mainfrom
claude/validate-edge-predicates-t9QXE

Conversation

@maximusunc
Copy link
Copy Markdown
Collaborator

Introduces test_harness/regression_checks/, a pluggable framework for TRAPI-response regression checks that run alongside the existing acceptance/pathfinder pass-fail analysis without overriding genuine failures. The first check (EdgePredicateMatchCheck) verifies that returned knowledge-graph predicates are TRAPI-compatible with the query graph (the queried predicate or a biolink descendant of it), catching cases where the right output curie is reached via the wrong-shape edge.

Mismatches surface as a new AgentStatus.REGRESSION so reports can distinguish "wrong answer" from "wrong-shaped answer". biolink-toolkit init failure degrades to SKIPPED rather than crashing the run.

Introduces test_harness/regression_checks/, a pluggable framework for
TRAPI-response regression checks that run alongside the existing
acceptance/pathfinder pass-fail analysis without overriding genuine
failures. The first check (EdgePredicateMatchCheck) verifies that
returned knowledge-graph predicates are TRAPI-compatible with the query
graph (the queried predicate or a biolink descendant of it), catching
cases where the right output curie is reached via the wrong-shape edge.

Mismatches surface as a new AgentStatus.REGRESSION so reports can
distinguish "wrong answer" from "wrong-shaped answer". biolink-toolkit
init failure degrades to SKIPPED rather than crashing the run.

https://claude.ai/code/session_012pDrWMNSK3p4cjJ2yp1FjF
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants