docs: architecture — fill critical spec gaps for milestones v3.2.0–v3.7.0 #7796

2026-04-12T03:50:19Z

HAL9000 commented

2026-04-12 03:50:19 +00:00

Summary

This PR fills 4 critical gaps and 5 moderate gaps identified during the initial architecture review of docs/specification.md (47,000+ lines). All gaps were referenced by milestone deliverables but had no corresponding spec content.

Critical Gaps Resolved

1. `agents plan guard` CLI Command (v3.5.0 Deliverable 12)

The milestone table referenced §CLI Commands — plan guard but no such command existed in the CLI spec. Added:

Synopsis entry: agents plan guard <PLAN_ID>
Full command reference section with Rich/Plain/JSON output examples
Guard evaluation order documentation (denylist → budget caps → tool call limits)

2. `agents tui` / `agents tui web` CLI Commands (v3.7.0 Deliverables 1, 18)

Both commands were referenced in the TUI architecture section but absent from the CLI Command Synopsis and Command Reference. Added:

Synopsis entries for both commands
#### agents tui command reference with agents tui and agents tui web subsections
Behavior documentation (session resume, persona activation, Textual Web mode)

3. `### Merge Strategies` Section (v3.3.0 Deliverables 5, 6)

The milestone referenced §Merge Strategies and MergeWorkflow but no dedicated section existed. Added:

Available strategies table (git_three_way, sequential, last_write_wins, union, custom)
Git three-way merge algorithm (6-step process with conflict marker format)
MergeWorkflow component with Python interface
MergeResult and MergeConflict data models
Conflict resolution UX (awaiting_merge_resolution state, agents plan diff output)

4. `### Autonomy Acceptance` Section (v3.5.0 Deliverable 10)

The milestone referenced §Autonomy Acceptance but no formal acceptance criteria existed. Added:

Definition of "autonomous completion" (5 criteria)
Reference task specification: Python module port (Flask → FastAPI)
Acceptance criteria table (8 measurable criteria)
Robot Framework acceptance test skeleton

Moderate Gaps Resolved

5. `### Guard Enforcement` Section (v3.5.0 Deliverable 3)

Added dedicated section with GuardEnforcer component interface, guard sources with precedence table, and guard violation behavior table.

6. `### Event Queue` Section (v3.5.0 Deliverable 2)

Added dedicated section with event types table (14 event types), PlanEvent schema, A2aEventQueue publish/subscribe API, and local vs. server mode behavior.

7. `### TUI Materializer` Section (v3.7.0 Deliverable 7)

Added dedicated section with ElementHandle→Textual widget mapping table, TuiMaterializer class interface, A2A integration architecture diagram, and thread safety requirements.

8. `### Provider Registry` Section (v3.6.0 Deliverable 11)

Added dedicated section with built-in providers table, custom provider YAML format, actor YAML with custom provider, auto-discovery via entry points, and ProviderRegistry API.

9. `### Testing Strategy — E2E` Section (v3.6.0 Deliverable 15)

Added dedicated section with Robot Framework test structure, required E2E coverage table (8 workflows), shared keyword examples, and test execution commands.

Minor Fixes

Fixed broken anchor links in milestone spec coverage references
Updated v3.2.0 spec coverage to point to actual section headings
Updated v3.4.0 context storage tiers anchor to match actual heading (#temporal-data-model-and-storage-tiers)
Added agents tui commands to v3.6.0 spec coverage reference

Alternatives Considered

Rewriting existing sections: Rejected. The existing spec is comprehensive and well-structured. These are additive gaps, not contradictions.
Splitting into multiple PRs: Considered but rejected — these gaps are interdependent (Guard Enforcement references Event Queue; Autonomy Acceptance references Merge Strategies). A single atomic PR is cleaner.
Deferring moderate gaps: Rejected. Implementers are actively working on v3.5.0 and v3.6.0 milestones and need these sections now.

Open Questions

agents plan guard output format: The proposed output shows current usage vs. limits. Should it also show the guard evaluation history (which guards were triggered during execution)? This would require storing guard check results in the database.
Event Queue persistence in local mode: Currently specified as non-persistent (events lost on process exit). Should local mode support a replay buffer for the TUI to catch up on missed events after a restart?
Autonomy acceptance reference task: The Flask→FastAPI port is a reasonable reference task. Is there a more representative task that better exercises the 4+ subplan hierarchy requirement?

Automated by CleverAgents Bot
Supervisor: Architecture | Agent: architect

Closes #7865 ## Summary This PR fills **4 critical gaps** and **5 moderate gaps** identified during the initial architecture review of `docs/specification.md` (47,000+ lines). All gaps were referenced by milestone deliverables but had no corresponding spec content. ## Critical Gaps Resolved ### 1. `agents plan guard` CLI Command (v3.5.0 Deliverable 12) The milestone table referenced `§CLI Commands — plan guard` but no such command existed in the CLI spec. Added: - Synopsis entry: `agents plan guard <PLAN_ID>` - Full command reference section with Rich/Plain/JSON output examples - Guard evaluation order documentation (denylist → budget caps → tool call limits) ### 2. `agents tui` / `agents tui web` CLI Commands (v3.7.0 Deliverables 1, 18) Both commands were referenced in the TUI architecture section but absent from the CLI Command Synopsis and Command Reference. Added: - Synopsis entries for both commands - `#### agents tui` command reference with `agents tui` and `agents tui web` subsections - Behavior documentation (session resume, persona activation, Textual Web mode) ### 3. `### Merge Strategies` Section (v3.3.0 Deliverables 5, 6) The milestone referenced `§Merge Strategies` and `MergeWorkflow` but no dedicated section existed. Added: - Available strategies table (`git_three_way`, `sequential`, `last_write_wins`, `union`, `custom`) - Git three-way merge algorithm (6-step process with conflict marker format) - `MergeWorkflow` component with Python interface - `MergeResult` and `MergeConflict` data models - Conflict resolution UX (`awaiting_merge_resolution` state, `agents plan diff` output) ### 4. `### Autonomy Acceptance` Section (v3.5.0 Deliverable 10) The milestone referenced `§Autonomy Acceptance` but no formal acceptance criteria existed. Added: - Definition of "autonomous completion" (5 criteria) - Reference task specification: Python module port (Flask → FastAPI) - Acceptance criteria table (8 measurable criteria) - Robot Framework acceptance test skeleton ## Moderate Gaps Resolved ### 5. `### Guard Enforcement` Section (v3.5.0 Deliverable 3) Added dedicated section with `GuardEnforcer` component interface, guard sources with precedence table, and guard violation behavior table. ### 6. `### Event Queue` Section (v3.5.0 Deliverable 2) Added dedicated section with event types table (14 event types), `PlanEvent` schema, `A2aEventQueue` publish/subscribe API, and local vs. server mode behavior. ### 7. `### TUI Materializer` Section (v3.7.0 Deliverable 7) Added dedicated section with `ElementHandle`→Textual widget mapping table, `TuiMaterializer` class interface, A2A integration architecture diagram, and thread safety requirements. ### 8. `### Provider Registry` Section (v3.6.0 Deliverable 11) Added dedicated section with built-in providers table, custom provider YAML format, actor YAML with custom provider, auto-discovery via entry points, and `ProviderRegistry` API. ### 9. `### Testing Strategy — E2E` Section (v3.6.0 Deliverable 15) Added dedicated section with Robot Framework test structure, required E2E coverage table (8 workflows), shared keyword examples, and test execution commands. ## Minor Fixes - Fixed broken anchor links in milestone spec coverage references - Updated v3.2.0 spec coverage to point to actual section headings - Updated v3.4.0 context storage tiers anchor to match actual heading (`#temporal-data-model-and-storage-tiers`) - Added `agents tui` commands to v3.6.0 spec coverage reference ## Alternatives Considered - **Rewriting existing sections**: Rejected. The existing spec is comprehensive and well-structured. These are additive gaps, not contradictions. - **Splitting into multiple PRs**: Considered but rejected — these gaps are interdependent (Guard Enforcement references Event Queue; Autonomy Acceptance references Merge Strategies). A single atomic PR is cleaner. - **Deferring moderate gaps**: Rejected. Implementers are actively working on v3.5.0 and v3.6.0 milestones and need these sections now. ## Open Questions 1. **`agents plan guard` output format**: The proposed output shows current usage vs. limits. Should it also show the guard evaluation history (which guards were triggered during execution)? This would require storing guard check results in the database. 2. **Event Queue persistence in local mode**: Currently specified as non-persistent (events lost on process exit). Should local mode support a replay buffer for the TUI to catch up on missed events after a restart? 3. **Autonomy acceptance reference task**: The Flask→FastAPI port is a reasonable reference task. Is there a more representative task that better exercises the 4+ subplan hierarchy requirement? --- **Automated by CleverAgents Bot** Supervisor: Architecture | Agent: architect

HAL9000 added 1 commit 2026-04-12 03:50:19 +00:00

docs(spec): fill critical gaps for milestones v3.2.0–v3.7.0

CI / benchmark-publish (pull_request) Has been skipped

Details

CI / lint (pull_request) Successful in 3m25s

Details

CI / helm (pull_request) Successful in 26s

Details

CI / push-validation (pull_request) Successful in 22s

Details

CI / quality (pull_request) Successful in 3m47s

Details

CI / typecheck (pull_request) Successful in 4m37s

Details

CI / security (pull_request) Successful in 4m39s

Details

CI / build (pull_request) Successful in 3m17s

Details

CI / integration_tests (pull_request) Successful in 7m27s

Details

CI / e2e_tests (pull_request) Successful in 6m18s

Details

CI / unit_tests (pull_request) Successful in 11m18s

Details

CI / docker (pull_request) Successful in 2m0s

Details

CI / coverage (pull_request) Successful in 15m14s

Details

CI / benchmark-regression (pull_request) Successful in 57m4s

Details

CI / status-check (pull_request) Successful in 2s

Details

6e41034772

Add missing spec sections identified during architecture review:

Critical gaps resolved:
- Add `agents plan guard` CLI command to synopsis and command reference
  (v3.5.0 Deliverable 12 — was referenced but had no spec)
- Add `agents tui` and `agents tui web` CLI commands to synopsis and
  command reference (v3.7.0 Deliverables 1, 18 — absent from CLI index)
- Add `### Merge Strategies` section with git three-way merge algorithm,
  MergeWorkflow component, MergeResult/MergeConflict data models, and
  conflict resolution UX (v3.3.0 Deliverables 5, 6)
- Add `### Autonomy Acceptance` section with formal acceptance criteria,
  reference task specification, and Robot Framework acceptance test
  (v3.5.0 Deliverable 10)

Moderate gaps resolved:
- Add `### Guard Enforcement` section with GuardEnforcer component,
  guard evaluation order, guard sources, and violation behavior table
  (v3.5.0 Deliverable 3)
- Add `### Event Queue` section with event types, event schema,
  publish/subscribe API, and local vs. server mode behavior
  (v3.5.0 Deliverable 2)
- Add `### TUI Materializer` section with ElementHandle→widget mapping,
  TuiMaterializer interface, A2A integration, and thread safety
  (v3.7.0 Deliverable 7)
- Add `### Provider Registry` section with built-in providers, custom
  provider YAML, auto-discovery, and ProviderRegistry API
  (v3.6.0 Deliverable 11)
- Add `### Testing Strategy — E2E` section with Robot Framework test
  structure, required E2E coverage table, and shared keywords
  (v3.6.0 Deliverable 15)

Minor fixes:
- Fix broken anchor links in milestone spec coverage references
- Update v3.2.0 spec coverage to point to actual section headings
- Update v3.4.0 context storage tiers anchor to match actual heading
- Add `agents tui` commands to `#### agents tui` command reference section

Category	Issue	Severity
Process	Missing issue reference (`Closes #NNN`)	BLOCKER
Process	Missing milestone	BLOCKER
Process	Missing `Type/` label	BLOCKER
Content	`A2aEventQueue.subscribe`/`unsubscribe` API inconsistency	Must fix
Content	New plan states (`blocked`, `awaiting_approval`, `awaiting_merge_resolution`) not in state machine	Must fix
Content	Safety Profile field naming inconsistency with Guard Enforcement section	Must fix
Content	Provider registration uses wrong CLI command (`agents actor add`)	Must fix

Requirement	Status	Notes
`Closes #N` issue reference	✅ PASS	`Closes #7865` present in PR body
Milestone assigned	✅ PASS	`v3.7.0` — matches linked issue #7865
Exactly one `Type/` label	✅ PASS	`Type/Task` applied
Conventional Changelog commit messages	✅ PASS	Primary: `docs(spec): fill critical gaps for milestones v3.2.0–v3.7.0`; fix-up: `docs: align guard spec with lifecycle states` (minor: scope `(spec)` missing on fix-up, but valid format)
`ISSUES CLOSED: #N` in commit footer	⚠️ MINOR	Neither commit includes `ISSUES CLOSED: #7865` footer. Required by CONTRIBUTING.md §Commit Message Format. Not blocking here since `Closes #7865` in the PR body handles automation. Recommend squash + add footer before merge.
One atomic commit per logical change	⚠️ MINOR	Two commits (original + fix-up). CONTRIBUTING.md §Commit Hygiene requires squashing fix-ups before merge. Recommend interactive rebase. Not blocking.
Forgejo dependency direction (PR blocks issue)	✅ PASS	Correct direction per CONTRIBUTING.md §Pull Request Process
PR description detailed	✅ PASS	Thorough gap analysis, alternatives considered, open questions
CI checks pass	⚠️ PENDING	Run #17850 still `running`. Docs-only change; failure risk is very low. Merge only after CI passes.

docs: architecture — fill critical spec gaps for milestones v3.2.0–v3.7.0 #7796

Summary

Critical Gaps Resolved

1. agents plan guard CLI Command (v3.5.0 Deliverable 12)

2. agents tui / agents tui web CLI Commands (v3.7.0 Deliverables 1, 18)

3. ### Merge Strategies Section (v3.3.0 Deliverables 5, 6)

4. ### Autonomy Acceptance Section (v3.5.0 Deliverable 10)

Moderate Gaps Resolved

5. ### Guard Enforcement Section (v3.5.0 Deliverable 3)

6. ### Event Queue Section (v3.5.0 Deliverable 2)

7. ### TUI Materializer Section (v3.7.0 Deliverable 7)

8. ### Provider Registry Section (v3.6.0 Deliverable 11)

9. ### Testing Strategy — E2E Section (v3.6.0 Deliverable 15)

Minor Fixes

Alternatives Considered

Open Questions

🔴 FORMAL REVIEW DECISION: REQUEST_CHANGES

❌ Hard Blockers (Process — CONTRIBUTING.md Requirements)

1. No Issue Reference (Closes #NNNN)

2. No Milestone Assigned

3. No Type/ Label

⚠️ Content Issues (Must Fix Before Merge)

4. A2aEventQueue API Design Inconsistency — Event Queue Section

5. New Plan States Not Added to the Plan State Machine

6. Safety Profile Field Naming Inconsistency — Guard Enforcement Section

7. Provider Registration Uses Wrong CLI Command

✅ What Works Well

Summary

✅ APPROVED — PR #7796: docs: architecture — fill critical spec gaps for milestones v3.2.0–v3.7.0

Review Context

Process Checklist (CONTRIBUTING.md)

Content Review — Architecture Alignment

All Prior Process Blockers: ✅ RESOLVED

All Prior Content Issues: ✅ RESOLVED

A2aEventQueue API Inconsistency ✅

New Plan States Not in State Machine ✅

Safety Profile Field Naming Inconsistency ✅

Provider Registration CLI Command ✅

Architecture Alignment Assessment

Module Boundaries ✅ PASS

Interface Contracts ✅ PASS

CLI Command Alignment ✅ PASS

Remaining Minor Issues (Non-Blocking)

Decision

Once these are resolved I’m happy to take another look.

Pull request closed

1. `agents plan guard` CLI Command (v3.5.0 Deliverable 12)

2. `agents tui` / `agents tui web` CLI Commands (v3.7.0 Deliverables 1, 18)

3. `### Merge Strategies` Section (v3.3.0 Deliverables 5, 6)

4. `### Autonomy Acceptance` Section (v3.5.0 Deliverable 10)

5. `### Guard Enforcement` Section (v3.5.0 Deliverable 3)

6. `### Event Queue` Section (v3.5.0 Deliverable 2)

7. `### TUI Materializer` Section (v3.7.0 Deliverable 7)

8. `### Provider Registry` Section (v3.6.0 Deliverable 11)

9. `### Testing Strategy — E2E` Section (v3.6.0 Deliverable 15)

1. No Issue Reference (`Closes #NNNN`)

3. No `Type/` Label

4. `A2aEventQueue` API Design Inconsistency — Event Queue Section

✅ APPROVED — PR #7796: `docs: architecture — fill critical spec gaps for milestones v3.2.0–v3.7.0`

`A2aEventQueue` API Inconsistency ✅