Skip to content

Snapshot record then reply testing#740

Merged
mateusz834 merged 1 commit into
developfrom
dev/MP/snapshots
Apr 27, 2026
Merged

Snapshot record then reply testing#740
mateusz834 merged 1 commit into
developfrom
dev/MP/snapshots

Conversation

@mateusz834
Copy link
Copy Markdown
Member

@mateusz834 mateusz834 commented Apr 23, 2026

This change implements snapshot, record then reply testing for all integration tests.

Re-enables recently skipped (flaky) tests:

  • test_agent_understands_other_agents (snapshot was edited manually)
  • test_supervisor_resumes_subagent_thread_across_invocations
  • test_supervisor_resumes_subagent_thread_across_invocations_structured

Introduces a deterministic thread_id mock generator, such that snapshots are deterministic, for:

  • test_supervisor_resumes_subagent_thread_across_invocations
  • test_supervisor_resumes_subagent_thread_across_invocations_structured

Modified test_tool_execution_service_access using tool middleware, to make the test deterministic.

E2E tests still call the LLMs directly.

@mateusz834 mateusz834 force-pushed the dev/MP/snapshots branch 16 times, most recently from 1198d47 to d029c1d Compare April 24, 2026 06:45
@mateusz834 mateusz834 marked this pull request as ready for review April 24, 2026 06:45
@mateusz834 mateusz834 requested review from Ickerday and szykol and removed request for szykol April 24, 2026 07:16
Comment thread tests/ai_testlib.py
Comment thread tests/integration/ai/test_agent.py
Comment thread tests/ai_testlib.py
Copy link
Copy Markdown
Contributor

@splunk-dtaborski splunk-dtaborski left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I really like the solution.

Comment thread tests/integration/ai/test_agent.py Outdated
Comment thread tests/integration/ai/test_agent_mcp_tools.py
Comment thread tests/ai_testlib.py Outdated
Comment thread tests/ai_testlib.py
Comment thread tests/ai_testlib.py Outdated
Comment thread tests/ai_testlib.py
@mateusz834 mateusz834 force-pushed the dev/MP/snapshots branch 2 times, most recently from 32aadd8 to 6aa74cd Compare April 27, 2026 08:15
This change implements snapshot, record then reply testing
for all integration tests.

Re-enables recently skipped (falky) tests:

- test_agent_understands_other_agents (snapshot was edited manually)
- test_supervisor_resumes_subagent_thread_across_invocations
- test_supervisor_resumes_subagent_thread_across_invocations_structured

Introduces a deterministic thread_id mock generator, such that
snapshots are deterministic, for:

- test_supervisor_resumes_subagent_thread_across_invocations
- test_supervisor_resumes_subagent_thread_across_invocations_structured

Modified test_tool_execution_service_access using tool middleware, to make
the test deterministic.
@mateusz834 mateusz834 merged commit 1984965 into develop Apr 27, 2026
4 of 6 checks passed
@mateusz834 mateusz834 deleted the dev/MP/snapshots branch April 27, 2026 13:14
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants