v1.33.2.0 fix: parser kind field, PhaseKind cleanup, critical-exit lock leaks by anbangr · Pull Request #1444 · garrytan/gstack

anbangr · 2026-05-12T02:20:00Z

Summary

This PR ships parser fixes, PhaseKind cleanup, and critical-exit reliability improvements for the build orchestrator.

Parser & types

parser.ts — emit kind: p.kind ?? \"code\" on parsed phases (restores backward compat)
types.ts — narrow PhaseKind to "code"; make kind optional for strict-mode fixtures

CLI cleanup

Remove switch (phase.kind) branches for writing/experiment/research/manual in buildKindInstructions
Remove kind-conditional review-rubric line in buildCodexReviewBody
Remove exit-code 13 override for --skip-ship; origin-verified features now pause naturally

Reliability

Persist critical_exit_pending sentinel to state instead of raw process.exit(3), letting the finally block release the lock

Tests

integration.test.ts — add --skip-ship resume + origin-verification tests and critical_exit lock-release test
parser.test.ts — add kind-annotation parsing tests and ReferenceError-guard tests

Build skill

build/SKILL.md.tmpl — bump frontmatter version to 1.22.0

Repo hygiene

.gitignore — add .llm-tmp/

Test Coverage

All new code paths have test coverage.
Tests: 808 pass, 0 fail (build/orchestrator/tests/)

Pre-Landing Review

Fixed duplicate kind property merge artifact in parser.ts
Fixed test expectations for --skip-ship exit code (0, not 13)
Fixed duplicate describe block in parser.test.ts from merge

TODOS

No TODO items completed in this PR.

Test plan

All build/orchestrator tests pass (808 runs, 0 failures)

🤖 Generated with Claude Code

^{Need help on this PR? Tag @codesmith with what you need.}

Let Codesmith autofix CI failures and bot reviews

# Conflicts: # README.md

- Adds implement/SKILL.md.tmpl to execute plans in phases - Updates GSTACK_PLAYBOOK.md to include the new workflow

…g loop

…oices

…tructions

… execution

…cations

…loops

…te dispatch

…s review and ship, add implement reexamine mode

… and sonnet for review/qa

…erative fix, and deployment

… of at the end

…subagent loop

… review instead of /review

Remove the `--skip-sweep` flag and the unshipped feat/* sweep bullet from the Startup Gates section and flags table. Aligns with the code removal in 3e2b8b2. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

- Adds mock configure.cm file to prevent jq from failing in Step M3.5 mock

…11-074503-fabe4c3f-4-e2e-test-touchfile-registration'

1. plan-selection (6 tests): `defaultActiveRunRegistryDir()` hardcoded `~/.gstack/build-state/active-runs` and ignored `GSTACK_BUILD_STATE_DIR`, causing 11 real active-run records to leak into unit tests and inflate candidate counts (turning expected "selected" into "ambiguous"). Fix: honour the env var consistently, the same way `state.ts` already does. 2. integration (3 tests): plan review subprocess called `codex` with `OPENAI_API_KEY` from the inherited `process.env`, triggering a real ~30s API call against the LLM. These tests exercise feature lifecycle, not plan review. Fix: add `--no-plan-review` to each CLI invocation. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…estSpec detection Four improvements identified during code review of 3e2b8b2: - Move `extractCoverageTarget` from cli.ts to sub-agents.ts (alongside parseCoveragePercent); re-export via import in cli.ts. Eliminates the circular-import risk when phase-runner.ts calls coverage functions. - Fix decimal truncation in extractCoverageTarget: `(\d+)` only matched integers, silently returning 80 for targets like ≥90.5%. Changed to `([\d.]+)` + parseFloat. - Fix `hasTestSpec` detection in buildGeminiTestSpecPrompt: was `phase.body.includes("#### Test Spec")` (fragile string match, false negative when body text differs). Now `phase.testSpecCheckboxLine !== -1` (parser already computes this — zero extra overhead). - Wire coverage gate in RUN_TESTS handler: after GREEN tests pass and the phase has a test spec (`testSpecCheckboxLine !== -1`), call parseCoveragePercent(result.stdout, testCmd) and compare against extractCoverageTarget(phase.body). Below target → set coverageResult and route to test_fix_running. Unknown framework → log advisory, proceed. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Complete the coverage gate: `injectCoverageFlags(testCmd)` appends the appropriate flag for the detected framework before the GREEN test run, so `parseCoveragePercent` reliably finds coverage data in stdout even when projects don't pre-configure coverage in their test script. Framework → flag mapping: jest → --coverage --coverageReporters text vitest → --coverage bun test → --coverage pytest → --cov --cov-report term-missing go test → -cover unknown → unchanged (advisory log, gate skips) Injection is idempotent (no-op if flag already present) and only fires when the phase has a test spec (testSpecCheckboxLine !== -1) — VERIFY_RED and legacy phases use the bare test command unchanged. 11 unit tests added covering each framework, idempotency, and unknowns. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

`phase.kind !== "code" ? "" : ""` always evaluated to "" regardless of the condition, and was silently filtered by .filter(Boolean). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…d, document sweep removal

…p (Bug D1) Two failing tests document the bug: 1. After CRITICAL verdict, state.planReview must be persisted with status "critical_exit_pending" — currently cli.ts does not persist anything before process.exit(3), so planReview stays undefined on disk. 2. On resume with the sentinel set, the plan-review gate must still fire — the current guard (!state.planReview) is false when planReview is truthy, so the gate is skipped after the sentinel is introduced. Two GREEN tests confirm baseline behavior: APPROVE verdict suppresses the gate; undefined planReview (first run) fires the gate. Tests MUST fail until Feature 4 implementation lands. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Before this fix, a CRITICAL plan-review verdict caused process.exit(3) without saving any sentinel to state. On resume, !state.planReview was true → review ran again → CRITICAL again → infinite loop. Fix: 1. Save state.planReview = { ...verdict, status: "critical_exit_pending" } before releaseLock + process.exit(3) so the sentinel survives on disk. 2. Widen the plan-review gate guard from !state.planReview to !state.planReview || state.planReview.status === "critical_exit_pending" so the gate re-fires on resume when the sentinel is present. Tests: two new tests in phase-runner.test.ts cover both the sentinel persistence and the widened gate; 90/90 passing. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…g D2) Introduces ExitError (errors.ts) — thrown instead of process.exit(N) inside try/finally blocks so the finally clause runs cleanup before the process terminates. Changes: - errors.ts: new ExitError class (instanceof Error, numeric code field) - cli.ts: import ExitError; replace critical_exit process.exit(3) with throw new ExitError(3); update main().catch to call process.exit(err.code) when err instanceof ExitError - phase-runner.test.ts: 5 new tests (ExitError shape, propagation through finally, default and custom messages); 95/95 passing Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…ature 6) applyResult() now populates phaseState.coverageResult when: - action is RUN_TESTS - tests are GREEN (status = "tests_green") - extra.phaseBody is provided - parseCoveragePercent() returns a non-null value for the stdout Coverage below target emits an advisory warning but keeps status "tests_green" — not blocking. The target defaults to 80 when no "**Coverage target: ≥N%**" line appears in the phase body. 6 new tests in phase-runner.test.ts; 101/101 passing. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…ics + test assertions - Add errors.ts to MODULE_TEST_OWNERS in coverage-matrix.test.ts - Fix analytics logActivity to emit "success" for exit code 13 (FINALIZATION_REQUIRED), which is a success state (pending ship), not a failure - Fix integration test assertions: --skip-ship correctly exits 13, not 0, when features reach origin_verified (pre-existing test/impl mismatch)

…d [Phase 1.1] RED phase TDD: 11 tests fail because the parser does not yet stamp kind: "code" on emitted phases, and existing Phase literal construction sites have no kind field (undefined fails the VALID_KINDS.includes runtime assertion). 11 tests pass immediately: direct Phase construction with explicit kind values, and PhaseKind union membership checks (both already exist in types.ts). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

… loop

Add required kind: PhaseKind field to the parser factory init and to every Phase literal construction site in tests/fixtures. This ensures backward-compatible default of kind: "code" for all existing phases while the type system enforces correctness going forward. - parser.ts: stamp kind: "code" on every emitted Phase - state.test.ts, cli.test.ts, phase-runner.test.ts, feature-review.test.ts, cli-guardrails.test.ts, phase-kind.test.ts: add kind: "code" to all helpers and inline literals

…tations - Fix PHASE_HEADING regex to allow optional [kind] bracket between number and colon - Add BODY_KIND_PATTERN for  HTML comment fallback - Add IMPL_LABELS_BY_KIND and REVIEW_LABELS_BY_KIND maps for all 5 PhaseKind values - Parser now stamps kind from heading bracket (primary), body comment (fallback), or defaults to "code" - Inline kind-comment detection ensures kind is set before checkbox processing - Add implCheckboxRe/reviewCheckboxRe for kind-specific checkbox matching - Add 16 new parser tests covering all bracket annotations, HTML fallback, checkbox recognition Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

- Add IMPL_MARKER_BY_KIND and REVIEW_MARKER_BY_KIND lookup tables - Update flipPhaseCheckboxes signature to accept optional kind?: PhaseKind - Derives implMarker/reviewMarker from kind ?? "code" (backward compat) - Update reconcilePhaseCheckboxes to pass phase.kind - Update both cli.ts call sites (lines ~3870, ~4282) to pass kind: phase.kind - Add 9 kind-aware mutator tests covering all 5 kinds + error cases + backward compat Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…EW gates, ship gate

- Fix critical-exit path skipping active-run registry cleanup and telemetry (cli.ts) - Remove dead phase kind branches from build instructions (cli.ts) - Hardcode phase kind to 'code' in parser to clarify intent (parser.ts) - Collapse PhaseKind union to a single literal to match parser reality (types.ts) - Add test coverage for critical_exit releasing the lock and exiting 3 (integration.test.ts) - Remove exitCode 13 override for --skip-ship to fix failing integration tests (cli.ts)

- Fix integration test mock codex script sed pattern to correctly extract output file path (was capturing trailing sentence period) - Remove stale contradictory comment block about exit code 13 in --skip-ship path - Remove dead exitCode === 13 branch in active-run registry update - Remove dead conditional for non-code phase rubric in buildCodexReviewBody All build/orchestrator tests pass (330 tests).

…e describe block

…-storm-20260511-122548-fabe4c3f-3-commit-housekeeping-parser-fix-readme-regenera # Conflicts: # CHANGELOG.md # test/gen-skill-docs.test.ts # test/helpers/touchfiles.ts

anbangr added 30 commits April 22, 2026 19:18

Add architecture-focused planning review skills

190e6c4

docs: add GStack Playbook for workflow guidance and skill reference

6638051

Merge remote-tracking branch 'upstream/main'

2ad9e73

# Conflicts: # README.md

Merge origin/main into main

946e9f5

feat: add /implement autonomous coding skill

d3b148b

- Adds implement/SKILL.md.tmpl to execute plans in phases - Updates GSTACK_PLAYBOOK.md to include the new workflow

feat(implement): add model routing discipline for gemini and sonnet

7b6bc1b

feat(implement): add living implementation plan synthesis and checkin…

073eee2

…g loop

feat(implement): add feature branching and auto-deploy

c16834b

feat(implement): add opus and codex consensus for ambiguous review ch…

6ed7d95

…oices

feat(implement): process entire plan and use proper plan naming

7039ec0

feat(implement): add verbose state narration and autonomous continuity

b15bdf2

fix(implement): enforce automatic deploy skill invocation without asking

43300a9

feat(implement): use sub-agent delegation to prevent context compaction

87040dc

feat(implement): add iterative github ci/cd checking to sub-agent ins…

318504f

…tructions

fix(implement): explicit bash tool instruction for ship skill invocation

1bacade

feat(implement): mandate autonomous execution of skills via bash tool

4d6a8a2

feat(implement): run both ship and land-and-deploy sequentially

f3c6208

feat(implement): explicitly mandate sonnet model for autonomous skill…

f517f2c

… execution

feat(implement): explicitly set sonnet model for sub-agent skill invo…

5e9df85

…cations

feat(implement): mandate /review and /qa skills during sub-agent phases

193dfa6

feat(implement): mandate agents to fix issues found during QA/review …

e1e051b

…loops

feat(implement): mandate bash tool for autonomous opus and codex deba…

2462748

…te dispatch

feat: replace AskUserQuestion with autonomous Opus/Codex debate acros…

72fc1f7

…s review and ship, add implement reexamine mode

revert(skills): restore AskUserQuestion to review and ship skills

5a0dd78

feat(implement): sync execution status back to original autoplan file

4b524b1

feat(implement): strictly enforce gemini for phase execution via bash…

bb5b1ee

… and sonnet for review/qa

feat(implement): spawn dedicated sonnet subagent for final review, it…

86d7a05

…erative fix, and deployment

feat(implement): execute continuous deployment loop per phase instead…

9b4f9fc

… of at the end

feat(implement): replace sonnet with codex for review and deployment …

d689127

…subagent loop

fix(implement): restore sonnet subagent but instruct it to use /codex…

2a09300

… review instead of /review

anbangr and others added 30 commits May 11, 2026 13:24

docs(build): remove startup sweep from README startup gates

1d79ecd

Remove the `--skip-sweep` flag and the unshipped feat/* sweep bullet from the Startup Gates section and flags table. Aligns with the code removal in 3e2b8b2. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

test(e2e): complete build fault investigator test structure

779d79f

- Adds mock configure.cm file to prevent jq from failing in Step M3.5 mock

qa(e2e): fix HOME isolation and report path in fault investigator test

523d7f8

Merge branch 'feat/gstack-gstack-now-i-want-the-virtual-minsky-202605…

4070c04

…11-074503-fabe4c3f-4-e2e-test-touchfile-registration'

chore: bump test phase timeout to 900000ms (suite grew past 5min budget)

412ade4

fix(review): remove dead-code noop in buildCodexReviewBody

4b385a4

`phase.kind !== "code" ? "" : ""` always evaluated to "" regardless of the condition, and was silently filtered by .filter(Boolean). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

chore(build): fix parser kind field, bump v1.22.0, regen host SKILL.m…

97e4569

…d, document sweep removal

fix(test): add build/orchestrator/__tests__/ to bun test path for TDD…

e093b14

… loop

feat(cli): Phase 1.4 — buildKindInstructions for kind-specific prompts

0b5388b

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

chore: regenerate SKILL.md files after Phase 1.2-1.5 template updates

f752e7e

feat(templates): Phase 1.5 — non-coding phase templates, CONTENT_REVI…

8542048

…EW gates, ship gate

merge: resolve origin/main conflicts

ddd1598

test: fix --skip-ship exit code expectations and parser test duplicat…

c094352

…e describe block

fix(parser): remove duplicate kind property from merge

bb12ac4

chore: bump version and changelog (v1.33.2.0)

5f013a0

Merge remote-tracking branch 'github/main' into feat/gstack-delegated…

e0d0192

…-storm-20260511-122548-fabe4c3f-3-commit-housekeeping-parser-fix-readme-regenera # Conflicts: # CHANGELOG.md # test/gen-skill-docs.test.ts # test/helpers/touchfiles.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v1.33.2.0 fix: parser kind field, PhaseKind cleanup, critical-exit lock leaks#1444

v1.33.2.0 fix: parser kind field, PhaseKind cleanup, critical-exit lock leaks#1444
anbangr wants to merge 214 commits into
garrytan:mainfrom
anbangr:feat/gstack-delegated-storm-20260511-122548-fabe4c3f-3-commit-housekeeping-parser-fix-readme-regenera

anbangr commented May 12, 2026 •

edited by blacksmith-sh Bot

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

anbangr commented May 12, 2026 • edited by blacksmith-sh Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test Coverage

Pre-Landing Review

TODOS

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

anbangr commented May 12, 2026 •

edited by blacksmith-sh Bot

Loading