Docbot Development Checklist

Single source of truth for development progress, file ownership, and task tracking. Check items off as you complete them. If swapping roles mid-sprint, the incoming dev reads the checked/unchecked state to know exactly where things stand.

File Ownership Map

Every source file has exactly one owner. No file is touched by more than one developer during a given phase. Dependencies between developers are managed through interface contracts defined upfront.

Current ownership (Phase 3, updated for planned package reorganization)

Current file	Planned location (after 3C reorg)	Owner
`src/docbot/cli.py`	`cli.py` (stays at top)	Dev A
`src/docbot/models.py`	`models.py` (stays at top)	Dev A
`src/docbot/llm.py`	`llm.py` (stays at top)	Dev A
`src/docbot/__init__.py`	`__init__.py` (stays at top)	Dev A
`pyproject.toml`	(root)	Dev A
`src/docbot/project.py`	`git/project.py`	Dev A
`src/docbot/scanner.py`	`pipeline/scanner.py`	Dev B
`src/docbot/orchestrator.py`	`pipeline/orchestrator.py`	Dev B
`src/docbot/git_utils.py`	`git/utils.py`	Dev B
`src/docbot/hooks.py`	`git/hooks.py`	Dev B
`src/docbot/extractors/*`	`extractors/*` (already a package)	Dev B
`src/docbot/explorer.py`	`pipeline/explorer.py`	Dev B
`src/docbot/search.py`	`web/search.py`	Dev B
`src/docbot/planner.py`	`pipeline/planner.py`	Dev C
`src/docbot/reducer.py`	`pipeline/reducer.py`	Dev C
`src/docbot/renderer.py`	`pipeline/renderer.py`	Dev C
`src/docbot/tracker.py`	`pipeline/tracker.py`	Dev C
`src/docbot/server.py`	`web/server.py`	Dev C
`src/docbot/viz_server.py`	`viz/viz_server.py`	Dev C
`src/docbot/_viz_html.py`	`viz/_viz_html.py`	Dev C
`src/docbot/mock_viz.py`	`viz/mock_viz.py`	Dev C
`webapp/*`	`webapp/*`	Dev D
`tests/*`	`tests/*`	Dev B

New files (planned):

Planned file	Owner	Phase
`src/docbot/git/history.py`	Dev B	3D
`src/docbot/git/diff.py`	Dev B	3E

Agent exploration files (LangGraph refactor):

File	Owner	Status
`src/docbot/exploration/__init__.py`	Dev B	Complete
`src/docbot/exploration/graph.py`	Dev B	Complete
`src/docbot/exploration/tools.py`	Dev B	Complete
`src/docbot/exploration/store.py`	Dev B	Complete
`src/docbot/exploration/prompts.py`	Dev B	Complete
`src/docbot/exploration/callbacks.py`	Dev B	Complete
`webapp/src/features/exploration/AgentExplorer.tsx`	Dev D	Complete
`webapp/src/features/exploration/AgentDetail.tsx`	Dev D	Complete
`webapp/src/features/exploration/NotepadViewer.tsx`	Dev D	Complete
`webapp/src/features/exploration/useAgentStream.ts`	Dev D	Complete
`webapp/src/features/exploration/types.ts`	Dev D	Complete
`docs/AGENT_ARCHITECTURE.md`	Dev B	Complete
`docs/MIGRATION_NOTES.md`	Dev B	Complete
`tests/test_exploration_graph.py`	Dev B	Complete

Phase 1: Multi-Language Support [COMPLETE]

All items complete. Tree-sitter + LLM fallback extraction implemented across Python, TypeScript, JavaScript, Go, Rust, Java, Kotlin, C#, Swift, Ruby. Scanner generalized, explorer refactored, planner/reducer/renderer prompts updated for dynamic language info, CLI/orchestrator wired.

Expand Phase 1 checklist (all checked)

Phase 0: Interface Contracts -- Dev A

Add SourceFile model, FileExtraction model
Update ScanResult, ScopeResult, DocsIndex with language fields
Define Extractor protocol, review and merge

Dev A -- Core Infrastructure

Scanner generalization (LANGUAGE_EXTENSIONS, entrypoint/package detection, SKIP_DIRS)
LLM client review, pyproject.toml deps, exports update
Webapp server skeleton (FastAPI with /api/index, /api/scopes, /api/graph, /api/search, /api/files, /api/fs)

Dev B -- Extraction Engine

Extractors package (base.py, python_extractor.py, treesitter_extractor.py, llm_extractor.py)
Explorer refactor (remove AST code, use get_extractor())
Semantic search (SearchIndex class)

Dev C -- Pipeline & Presentation

Planner updates (crosscutting patterns, dynamic language prompts)
Reducer updates (generalized edge computation, dynamic language prompts)
Renderer updates (dynamic language prompts/templates)

Dev D -- Frontend Experience

React SPA scaffold (Vite + React + TypeScript + Tailwind)
Interactive system graph (ReactFlow), chat panel, code viewer, documentation browser

Phase 2: Interactive Webapp [COMPLETE]

All items complete. FastAPI backend serves analyzed data + AI chat. React frontend with interactive graph, chat panel, code viewer, guided tours, documentation browser. docbot serve launches the full experience.

Expand Phase 2 checklist (all checked)

Dev A -- Integration Wiring

Orchestrator adapted to source_files, languages pass-through
Server completion (source endpoint, search, chat, tours)
CLI updates (help text, serve subcommand, --no-llm behavior)

Dev B -- Extended Coverage

Additional tree-sitter grammars (Kotlin, C#, Swift, Ruby)
Test suite (test_python_extractor, test_treesitter_extractor, test_llm_extractor, test_scanner, test_explorer)

Dev C -- Webapp Backend

Serve static files from webapp/dist/

Dev D -- Webapp Integration

Switch from mocks to real API endpoints
End-to-end testing, polish loading states
Legacy viz integration decision (marked as legacy)

Phase 3: Git-Integrated CLI

Goal: Transform docbot from a standalone doc generator into a git-aware CLI tool with persistent .docbot/ project directory, incremental updates based on git diffs, documentation history with snapshots, before/after comparison, git lifecycle hooks, and a change-aware webapp.

Design decisions: CWD default (optional path override), only config.toml git-tracked, init and generate are separate commands, git hooks opt-in via docbot hook install, last N snapshots for history (configurable, default 10), both explicit docbot update + optional hooks (post-commit and post-merge), change-aware chat via context injection into default /api/chat + dedicated /api/changes endpoint.

3A: Foundation (CLI + Project + Git Basics) [COMPLETE]

Owner: Dev A (CLI, models, project), Dev B (git_utils, hooks, scanner)

Models (`src/docbot/models.py`) -- Dev A

Project Module (`src/docbot/project.py`) -- Dev A

CLI Restructure (`src/docbot/cli.py`) -- Dev A

init [path] command
generate [path] command (calls run_async currently; will call generate_async after 3B)
update command (stub -- falls back to full generate; will call update_async after 3B)
status command (shows last commit, changed files, affected scopes)
config [key] [value] command (view all / get one / set one)
hook install / hook uninstall subcommands
serve [path] adapted to default to .docbot/ via find_docbot_root()
run kept as hidden alias for generate

Git Utilities (`src/docbot/git_utils.py`) -- Dev B

get_current_commit(repo_root) -- git rev-parse HEAD
get_changed_files(repo_root, since_commit) -- git diff --name-only
is_commit_reachable(repo_root, commit) -- git cat-file -t
get_repo_root(start) -- git rev-parse --show-toplevel

Git Hooks (`src/docbot/hooks.py`) -- Dev B

install_hook(repo_root) -- post-commit hook with sentinel comments
uninstall_hook(repo_root) -- remove docbot section, delete if empty

Scanner Update (`src/docbot/scanner.py`) -- Dev B

Add ".docbot" to SKIP_DIRS

3B: Incremental Pipeline

Owner: Dev B (orchestrator, git integration), Dev C (renderer refactor)

Depends on: 3A (complete)

Orchestrator Refactor (`src/docbot/orchestrator.py`) -- Dev B

Renderer Refactor (`src/docbot/renderer.py`) -- Dev C

Extract render_scope_doc(scope, index, out_dir, llm_client) -- single scope markdown
Extract render_readme(index, out_dir, llm_client) -- README.generated.md
Extract render_architecture(index, out_dir, llm_client) -- architecture.generated.md
Extract render_api_reference(index, out_dir) -- api.generated.md (template-only)
Extract render_html_report(index, out_dir) -- index.html
Refactor render() and render_with_llm() to call individual functions (no behavior change)

CLI Update -- Dev A

Update generate command to call generate_async() instead of run_async()
Update update command to call update_async() instead of falling back to generate

Verification

run_async() still works identically after refactor (backward compat)
generate_async() produces same output as run_async() but writes to .docbot/
generate_async() writes correct state.json with commit hash and scope_file_map
update_async() only re-explores affected scopes
update_async() falls back to generate when state is invalid
Individual render functions work standalone

3C: Src Reorganization

Owner: All devs (coordinated, touching only owned files)

Depends on: 3B (complete)

Move from 20 flat files in src/docbot/ to organized packages.

Create package structure

Update imports

Update all internal imports across the codebase
Update cli.py imports to use new package paths
Update pyproject.toml entry points if needed
Verify no import errors across the package

3D: Documentation Snapshots & History

Owner: Dev A (models), Dev B (history management)

Depends on: 3B (complete -- needs generate_async/update_async to hook into)

Models (`src/docbot/models.py`) -- Dev A

Add DocSnapshot model:
- commit_hash: str -- git commit at snapshot time
- run_id: str and timestamp: str
- scope_summaries: dict[str, ScopeSummary] -- scope_id -> { file_count, symbol_count, summary_hash }
- graph_digest: str -- hash of dependency graph edges
- doc_hashes: dict[str, str] -- doc filename -> content hash
- stats: SnapshotStats -- total files, scopes, symbols, edges
Add max_snapshots: int = 10 field to DocbotConfig

Snapshot Management (`src/docbot/git/history.py`) -- Dev B

save_snapshot(docbot_dir, docs_index, scope_results, run_id, commit) -- create DocSnapshot + save scope results
load_snapshot(docbot_dir, run_id) -- load a specific snapshot
list_snapshots(docbot_dir) -- list available snapshots with metadata
prune_snapshots(docbot_dir, max_count) -- remove oldest beyond limit
Snapshot storage: .docbot/history/<run_id>.json (metadata) + .docbot/history/<run_id>/ (scope results)

Pipeline Integration -- Dev B

Hook save_snapshot() into generate_async() after state save
Hook save_snapshot() into update_async() after state save
Call prune_snapshots() after each save

3E: Before/After Comparison (`docbot diff`)

Owner: Dev A (CLI), Dev B (diff logic, models)

Depends on: 3D (complete -- needs snapshots to compare)

Models (`src/docbot/models.py`) -- Dev A

Diff Logic (`src/docbot/git/diff.py`) -- Dev B

compute_diff(snapshot_from, snapshot_to) -> DiffReport
Compare scope lists (added/removed/modified)
Per modified scope: compare file lists, symbol lists, doc hashes
Compare graph edges (added/removed)
Compute stats deltas

CLI Command -- Dev A

Add docbot diff [--from <commit-or-run>] [--to <commit-or-run>] command
Defaults: --from = previous snapshot, --to = current state
Output: human-readable summary of what changed

3F: Git Lifecycle Hooks

Owner: Dev B (hooks expansion), Dev A (CLI flags)

Depends on: 3B (complete -- needs working update_async)

Expand Hook Support (`src/docbot/hooks.py`) -- Dev B

Add install_post_merge_hook(repo_root) -- same pattern as post-commit
Update install_hook() to install both post-commit and post-merge by default
Add --commit-only flag to install only post-commit
Update uninstall_hook() to remove from both hook files

CLI Updates -- Dev A

Update docbot hook install to accept --commit-only flag
Update help text to describe post-merge behavior

Verification

docbot hook install creates both post-commit and post-merge hooks
docbot hook install --commit-only creates only post-commit
docbot hook uninstall removes all docbot hooks
git pull with post-merge hook triggers docbot update
(Verified via manual code review and hook installation test)

3G: Change-Aware Webapp

Owner: Dev C (server endpoints), Dev D (frontend UI)

Depends on: 3D (snapshots), 3E (diff)

API Endpoints (`src/docbot/server.py`) -- Dev C

GET /api/changes -- returns DiffReport between current and previous snapshot
GET /api/changes?from=<run_id>&to=<run_id> -- compare specific snapshots
GET /api/history -- list available snapshots with metadata
GET /api/history/<run_id> -- specific snapshot detail
Update POST /api/chat system prompt to inject recent DiffReport when available

Webapp UI (`webapp/`) -- Dev D

Changes banner -- summary banner when changes exist since last view
Architecture graph diff view -- overlay showing added (green), removed (red), modified (yellow) nodes/edges
Scope diff panel -- side-by-side or inline diff of scope documentation
Timeline view -- visual timeline of snapshots, click to compare any two
Chat change context -- suggested questions update when changes detected

Verification

/api/changes returns correct DiffReport
/api/history lists all snapshots
Changes banner appears in webapp after an update
Graph highlights changed nodes/edges
Chat can answer "what changed?" questions with accurate references

3H: Pipeline Visualization Replay

Owner: Dev C (tracker, viz_server, viz HTML), Dev B (orchestrator save), Dev A (CLI command)

Depends on: 3B (needs generate_async/update_async to save events), 3D (events stored alongside snapshots)

Event Recording (`src/docbot/tracker.py`) -- Dev C

Add _events: list[dict] and _start_time: float to PipelineTracker
Record "add" event on every add_node() call
Record "state" event on every set_state() call
Implement export_events() -> {"run_id": ..., "total_duration": ..., "events": [...]}
Add no-op export_events() to NoOpTracker
Add set_run_id() method to both tracker classes

Save Events to Disk (`src/docbot/orchestrator.py`) -- Dev B

Call tracker.export_events() at end of generate_async() and update_async()
Write to .docbot/history/<run_id>/pipeline_events.json

Replay Server (`src/docbot/viz_server.py`) -- Dev C

Implement start_replay_server(events_path):
- GET / serves replay HTML
- GET /events serves recorded event log as JSON
- Auto-open browser
- Blocking server with Ctrl+C shutdown

Replay UI (`src/docbot/_viz_html.py`) -- Dev C

Create REPLAY_HTML constant (~430 lines)
JavaScript event player: virtual clock, applies events up to current time
Play / Pause control
Speed selector (1x, 2x, 4x, 8x)
Timeline scrubber (click to seek)
Step forward / back (one event at a time)
Elapsed time display (current position / total duration)
Same D3 radial tree rendering as live mode

CLI Command (`src/docbot/cli.py`) -- Dev A

Add docbot replay [run_id] command
Default to most recent run if no run_id given
Start replay server + open browser
Port configuration via --port flag

Verification

Live pipeline run saves pipeline_events.json to history
docbot replay opens replay of most recent run
docbot replay <run_id> replays a specific past run
Playback controls (play/pause/speed/scrub/step) work correctly
Replay visualization matches what the live view showed during the original run
NoOpTracker.export_events() returns empty data without errors
All 99 unit tests passing

End-to-End Verification

After all Phase 3 sections complete:

Role Swap Guide

If a developer needs to take over another's work mid-sprint:

Read their checklist above -- checked items are done, unchecked items remain
Check out their branch -- all their work-in-progress is there
Only touch their owned files -- the file ownership table above is the source of truth
Update this checklist as you complete items

FilesExpand file tree

CHECKLIST.md

Latest commit

History

CHECKLIST.md

File metadata and controls

Docbot Development Checklist

File Ownership Map

Current ownership (Phase 3, updated for planned package reorganization)

Phase 1: Multi-Language Support [COMPLETE]

Phase 0: Interface Contracts -- Dev A

Dev A -- Core Infrastructure

Dev B -- Extraction Engine

Dev C -- Pipeline & Presentation

Dev D -- Frontend Experience

Phase 2: Interactive Webapp [COMPLETE]

Dev A -- Integration Wiring

Dev B -- Extended Coverage

Dev C -- Webapp Backend

Dev D -- Webapp Integration

Phase 3: Git-Integrated CLI

3A: Foundation (CLI + Project + Git Basics) [COMPLETE]

Models (src/docbot/models.py) -- Dev A

Project Module (src/docbot/project.py) -- Dev A

CLI Restructure (src/docbot/cli.py) -- Dev A

Git Utilities (src/docbot/git_utils.py) -- Dev B

Git Hooks (src/docbot/hooks.py) -- Dev B

Scanner Update (src/docbot/scanner.py) -- Dev B

3B: Incremental Pipeline

Orchestrator Refactor (src/docbot/orchestrator.py) -- Dev B

Renderer Refactor (src/docbot/renderer.py) -- Dev C

CLI Update -- Dev A

Verification

3C: Src Reorganization

Create package structure

Update imports

3D: Documentation Snapshots & History

Models (src/docbot/models.py) -- Dev A

Snapshot Management (src/docbot/git/history.py) -- Dev B

Pipeline Integration -- Dev B

3E: Before/After Comparison (docbot diff)

Models (src/docbot/models.py) -- Dev A

Diff Logic (src/docbot/git/diff.py) -- Dev B

CLI Command -- Dev A

3F: Git Lifecycle Hooks

Expand Hook Support (src/docbot/hooks.py) -- Dev B

CLI Updates -- Dev A

Verification

3G: Change-Aware Webapp

API Endpoints (src/docbot/server.py) -- Dev C

Webapp UI (webapp/) -- Dev D

Verification

3H: Pipeline Visualization Replay

Event Recording (src/docbot/tracker.py) -- Dev C

Save Events to Disk (src/docbot/orchestrator.py) -- Dev B

Replay Server (src/docbot/viz_server.py) -- Dev C

Replay UI (src/docbot/_viz_html.py) -- Dev C

CLI Command (src/docbot/cli.py) -- Dev A

Verification

End-to-End Verification

Role Swap Guide

Models (`src/docbot/models.py`) -- Dev A

Project Module (`src/docbot/project.py`) -- Dev A

CLI Restructure (`src/docbot/cli.py`) -- Dev A

Git Utilities (`src/docbot/git_utils.py`) -- Dev B

Git Hooks (`src/docbot/hooks.py`) -- Dev B

Scanner Update (`src/docbot/scanner.py`) -- Dev B

Orchestrator Refactor (`src/docbot/orchestrator.py`) -- Dev B

Renderer Refactor (`src/docbot/renderer.py`) -- Dev C

Models (`src/docbot/models.py`) -- Dev A

Snapshot Management (`src/docbot/git/history.py`) -- Dev B

3E: Before/After Comparison (`docbot diff`)

Models (`src/docbot/models.py`) -- Dev A

Diff Logic (`src/docbot/git/diff.py`) -- Dev B

Expand Hook Support (`src/docbot/hooks.py`) -- Dev B

API Endpoints (`src/docbot/server.py`) -- Dev C

Webapp UI (`webapp/`) -- Dev D

Event Recording (`src/docbot/tracker.py`) -- Dev C

Save Events to Disk (`src/docbot/orchestrator.py`) -- Dev B

Replay Server (`src/docbot/viz_server.py`) -- Dev C

Replay UI (`src/docbot/_viz_html.py`) -- Dev C

CLI Command (`src/docbot/cli.py`) -- Dev A