feat: automation script with MAB modes and context passing #4

yash-yv-verma · 2026-02-12T23:02:38Z

automation_runner.py: run test cases with multiple models, optional feedback (stationary/non-stationary MAB)
main.py: allStepsAtOnceWithoutKnowledgeAgent, return debug/verification for context, handle dict knowledge_response
agents.py: model/temperature override, tool_call_limit, timeout 180s, generic verification prompt
metrics_db.py: temperature column, attempt_context table for storing/retrieving context between attempts
.gitignore: logs, debug_logs, db, cache, env, docs

- automation_runner.py: run test cases with multiple models, optional feedback (stationary/non-stationary MAB) - main.py: allStepsAtOnceWithoutKnowledgeAgent, return debug/verification for context, handle dict knowledge_response - agents.py: model/temperature override, tool_call_limit, timeout 180s, generic verification prompt - metrics_db.py: temperature column, attempt_context table for storing/retrieving context between attempts - .gitignore: logs, debug_logs, db, cache, env, docs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: automation script with MAB modes and context passing #4

feat: automation script with MAB modes and context passing #4

Uh oh!

yash-yv-verma commented Feb 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat: automation script with MAB modes and context passing #4

Are you sure you want to change the base?

feat: automation script with MAB modes and context passing #4

Uh oh!

Conversation

yash-yv-verma commented Feb 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant