[audit-workflows] Daily Agentic Workflow Audit — 2026-04-12 #25938
Closed
Replies: 1 comment
-
|
This discussion has been marked as outdated by Agentic Workflow Audit Agent. A newer discussion is available at Discussion #26090. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Automated daily audit covering 85 workflow runs from April 12, 2026 (16:00–21:04 UTC). Overall health is good with an 88.1% success rate (37 successes, 5 failures, 39 skipped). Token usage and cost rose significantly day-over-day: 26.5M effective tokens and $15.73 today vs. 8.5M tokens / $8.44 yesterday, driven by more active PR/discussion triggers and the
Qworkflow hitting a 10M-token run.Summary
❌ Failed Workflows (5)
Notable: The
Qworkflow consumed 10.1M tokens in 103 turns on a single discussion_comment trigger — this is an outlier and likely indicates runaway exploration or an unusually complex query. TheDaily Documentation Updaterfailed at thesafe_outputsstage despite 162 turns, suggesting it produced output but the write step failed.agenticworkflows audit— permission denied when verifying repository access. This tool is legitimately needed for its DIFC gateway analysis task.📊 Top Token Consumers
Qaccounts for 50% of all token usage across 5 runs, including one catastrophic 10M-token run. This warrants investigation.🔍 Agentic Assessment Patterns (100 assessments across 85 runs)
View high-severity assessments (15 runs flagged)
Workflows flagged as
resource_heavy_for_domainat HIGH severity:Model downgrade recommendations (7 workflows)
These workflows may not need frontier models:
claude-haiku-4-5orgpt-4.1-miniPoor agentic control signals (6 workflows)
Workflows showing broad/weakly controlled exploration with zero friction:
Recommendation: tighten instructions, reduce tool breadth, delay write actions until stronger evidence.
💡 Recommendations
Auto-Triage Issues,Daily Workflow Updater,Daily Documentation Updater,PR Triage Agent, andDesign Decision Gatecould use smaller models (haiku/mini) to reduce cost.overkill_for_agenticflags — Many slash-command workflows (Issue Monster, Resource Summarizer, /cloclo, Plan Command, Archie, Scout) are running agentic inference for tasks that could be handled deterministically. Consider converting to pre-agent steps.📈 Trend Charts
Charts uploaded as workflow run artifacts.
Workflow Health (Apr 11–12): Success count tripled day-over-day (14→37) as workflow triggers became more active. The failure count also rose (1→5) but is proportionally in line with the volume increase. Skipped runs stayed roughly constant at ~40.
Token & Cost (Apr 11–12): Effective token usage increased 3× (8.5M → 26.5M) and cost nearly doubled ($8.44 → $15.73), primarily driven by the
Qworkflow and several heavy research/audit workflows running for the first time today.References:
Beta Was this translation helpful? Give feedback.
All reactions