[AUTO-DOCS-1] Add changelog, architecture overview, and README updates for v3.0.0 and v3.1.0 #9902

2026-04-16T00:03:52Z

HAL9000 commented

2026-04-16 00:03:52 +00:00

Summary

This PR adds documentation for the completed v3.0.0 and v3.1.0 milestones:

CHANGELOG.md: Added entries for v3.0.0 (Minimal Local Source-Code Workflow) and v3.1.0 (Actor Compiler + Full LLM Integration) following Keep a Changelog format
README.md: Added "Source-Code Workflow (v3.0.0+)" and "Actor System (v3.1.0+)" sections with CLI examples and links to architecture docs
docs/architecture.md: Extended with a new "Milestone History" section documenting capabilities introduced in v3.0.0 and v3.1.0

Milestones Documented

v3.0.0 — M1: Minimal Local Source-Code Workflow (COMPLETE): action create, resource add git-checkout, project create/link-resource, plan use/execute/diff/apply, git worktree sandbox, SQLite + Alembic, Pydantic v2 domain models
v3.1.0 — M2: Actor Compiler + Full LLM Integration (COMPLETE): Actor YAML v3 schema, GRAPH-type compilation to LangGraph StateGraph, agents actor add CLI, custom actor resolution, skill registry, MCP adapter, validation runner, multi-file ChangeSet

Automated by CleverAgents Bot
Supervisor: Documentation | Agent: documentation-pool-supervisor [AUTO-DOCS-1]

Automated by CleverAgents Bot
Agent: pr-creator

## Summary This PR adds documentation for the completed v3.0.0 and v3.1.0 milestones: - **CHANGELOG.md**: Added entries for v3.0.0 (Minimal Local Source-Code Workflow) and v3.1.0 (Actor Compiler + Full LLM Integration) following Keep a Changelog format - **README.md**: Added "Source-Code Workflow (v3.0.0+)" and "Actor System (v3.1.0+)" sections with CLI examples and links to architecture docs - **docs/architecture.md**: Extended with a new "Milestone History" section documenting capabilities introduced in v3.0.0 and v3.1.0 ## Milestones Documented - **v3.0.0 — M1: Minimal Local Source-Code Workflow** (COMPLETE): action create, resource add git-checkout, project create/link-resource, plan use/execute/diff/apply, git worktree sandbox, SQLite + Alembic, Pydantic v2 domain models - **v3.1.0 — M2: Actor Compiler + Full LLM Integration** (COMPLETE): Actor YAML v3 schema, GRAPH-type compilation to LangGraph StateGraph, agents actor add CLI, custom actor resolution, skill registry, MCP adapter, validation runner, multi-file ChangeSet --- **Automated by CleverAgents Bot** Supervisor: Documentation | Agent: documentation-pool-supervisor [AUTO-DOCS-1] --- **Automated by CleverAgents Bot** Agent: pr-creator

HAL9000 added 57 commits 2026-04-16 00:03:52 +00:00

Build: Made conflict resolution a more explicit part of the pr-merge agents

CI / helm (push) Successful in 35s

Details

CI / push-validation (push) Successful in 19s

Details

CI / build (push) Successful in 3m57s

Details

CI / lint (push) Successful in 4m3s

Details

CI / quality (push) Successful in 4m34s

Details

CI / typecheck (push) Successful in 4m44s

Details

CI / security (push) Successful in 4m50s

Details

CI / e2e_tests (push) Successful in 7m3s

Details

CI / integration_tests (push) Successful in 10m13s

Details

CI / unit_tests (push) Successful in 11m21s

Details

CI / docker (push) Successful in 1m30s

Details

CI / coverage (push) Successful in 10m42s

Details

CI / status-check (push) Successful in 1s

Details

a71c142854

fix(concurrency): fix SubplanExecutionService._execute_parallel() #7582 3cfa24854a

Ensure fail_fast cancels in-flight futures and reports them as CANCELLED.

Add Behave coverage that reproduces the concurrency regression.

ISSUES CLOSED: #7582

docs(changelog): add v3.3.0 changelog entry for #7582 fail_fast fix

CI / lint (pull_request) Successful in 30s

Details

CI / typecheck (pull_request) Successful in 1m6s

Details

CI / security (pull_request) Successful in 1m7s

Details

CI / quality (pull_request) Successful in 42s

Details

CI / helm (pull_request) Successful in 27s

Details

CI / build (pull_request) Successful in 34s

Details

CI / push-validation (pull_request) Successful in 23s

Details

CI / e2e_tests (pull_request) Successful in 3m17s

Details

CI / integration_tests (pull_request) Successful in 4m13s

Details

CI / unit_tests (pull_request) Successful in 5m36s

Details

CI / docker (pull_request) Successful in 1m37s

Details

CI / coverage (pull_request) Successful in 11m50s

Details

CI / status-check (pull_request) Successful in 2s

Details

CI / lint (push) Successful in 36s

Details

CI / typecheck (push) Successful in 53s

Details

CI / quality (push) Successful in 30s

Details

CI / security (push) Successful in 1m16s

Details

CI / helm (push) Successful in 22s

Details

CI / push-validation (push) Successful in 15s

Details

CI / e2e_tests (push) Successful in 3m50s

Details

CI / build (push) Successful in 3m32s

Details

CI / integration_tests (push) Successful in 6m45s

Details

CI / unit_tests (push) Successful in 7m39s

Details

CI / docker (push) Successful in 1m19s

Details

CI / coverage (push) Successful in 14m59s

Details

CI / status-check (push) Successful in 1s

Details

c11b05b773

Build: Better protection against agents editing the main working directory

CI / lint (push) Successful in 24s

Details

CI / typecheck (push) Successful in 54s

Details

CI / quality (push) Successful in 45s

Details

CI / security (push) Successful in 1m15s

Details

CI / build (push) Successful in 29s

Details

CI / push-validation (push) Successful in 30s

Details

CI / helm (push) Successful in 37s

Details

CI / e2e_tests (push) Successful in 3m39s

Details

CI / integration_tests (push) Successful in 4m28s

Details

CI / unit_tests (push) Successful in 5m22s

Details

CI / docker (push) Successful in 21s

Details

CI / coverage (push) Successful in 11m39s

Details

CI / status-check (push) Successful in 1s

Details

38bcd41338

fix(concurrency): wire LockService into plan lifecycle — guard execute_plan and apply_plan b83f575cb5

LockService was implemented but never integrated into the plan execution
path, leaving execute_plan() and apply_plan() unprotected against
concurrent calls on the same plan_id (race condition, issue #7989).

Changes:
- container.py: add _build_lock_service() factory and register
  LockService as a Singleton provider; inject it into
  PlanLifecycleService via the DI container.
- plan_lifecycle_service.py: accept optional lock_service parameter in
  __init__; in execute_plan() and apply_plan() acquire a plan-level
  advisory lock before the critical section and release it in a finally
  block so the lock is always freed even when exceptions occur.

When lock_service is None (existing tests without DI wiring) the
behaviour is unchanged — locking is silently skipped for backward
compatibility.

Closes #7989

fix(concurrency): fix lock owner identity to prevent re-entrant acquisition b1f7b51a80

The original implementation used plan_id as the owner_id when acquiring
the advisory lock. Because LockService treats owner_id as the caller
identity and allows re-entrant acquisition for the same owner, concurrent
sessions attempting to lock the same plan would all present the same
owner_id and thus silently renew the lock instead of raising
LockConflictError.

This fix generates a unique UUID for each invocation as the owner_id,
ensuring that concurrent sessions present different owners and thus
trigger LockConflictError when attempting to acquire the same plan lock.
The lock is still acquired before the phase transition and released in
a finally block to ensure cleanup even on error.

ISSUES CLOSED: #8067

docs(contributors): add HAL 9000 concurrency-fix contribution detail

CI / lint (pull_request) Successful in 39s

Details

CI / quality (pull_request) Successful in 41s

Details

CI / typecheck (pull_request) Successful in 57s

Details

CI / security (pull_request) Successful in 57s

Details

CI / build (pull_request) Successful in 45s

Details

CI / helm (pull_request) Successful in 45s

Details

CI / push-validation (pull_request) Successful in 20s

Details

CI / e2e_tests (pull_request) Successful in 4m5s

Details

CI / integration_tests (pull_request) Successful in 4m14s

Details

CI / unit_tests (pull_request) Successful in 5m30s

Details

CI / docker (pull_request) Successful in 1m33s

Details

CI / coverage (pull_request) Successful in 13m0s

Details

CI / status-check (pull_request) Successful in 1s

Details

CI / lint (push) Successful in 29s

Details

CI / quality (push) Successful in 48s

Details

CI / typecheck (push) Successful in 58s

Details

CI / security (push) Successful in 59s

Details

CI / build (push) Successful in 34s

Details

CI / push-validation (push) Successful in 29s

Details

CI / helm (push) Successful in 36s

Details

CI / e2e_tests (push) Successful in 3m22s

Details

CI / integration_tests (push) Successful in 5m46s

Details

CI / unit_tests (push) Successful in 8m50s

Details

CI / docker (push) Successful in 2m10s

Details

CI / coverage (push) Successful in 13m38s

Details

CI / status-check (push) Successful in 1s

Details

e757ca9db0

Add a Details entry for HAL 9000 describing the plan lifecycle
concurrency race-condition fix (#7989) — wiring LockService into
execute_plan/apply_plan with unique per-invocation owner identities.

ISSUES CLOSED: #7989

Build: improve grooming worker permissions, milestone enforcement, and PR merge throughput

CI / lint (push) Successful in 21s

Details

CI / quality (push) Successful in 43s

Details

CI / security (push) Successful in 51s

Details

CI / build (push) Successful in 28s

Details

CI / helm (push) Successful in 40s

Details

CI / push-validation (push) Successful in 27s

Details

CI / typecheck (push) Successful in 1m20s

Details

CI / e2e_tests (push) Successful in 3m25s

Details

CI / integration_tests (push) Successful in 3m59s

Details

CI / unit_tests (push) Successful in 5m13s

Details

CI / docker (push) Successful in 10s

Details

CI / coverage (push) Successful in 12m9s

Details

CI / status-check (push) Successful in 1s

Details

CI / lint (pull_request) Successful in 31s

Details

CI / typecheck (pull_request) Successful in 48s

Details

CI / quality (pull_request) Successful in 37s

Details

CI / security (pull_request) Successful in 58s

Details

CI / helm (pull_request) Successful in 22s

Details

CI / build (pull_request) Successful in 34s

Details

CI / push-validation (pull_request) Successful in 16s

Details

CI / e2e_tests (pull_request) Successful in 4m10s

Details

CI / integration_tests (pull_request) Successful in 4m20s

Details

CI / coverage (pull_request) Has been cancelled

Details

CI / unit_tests (pull_request) Has been cancelled

Details

CI / status-check (pull_request) Has been cancelled

Details

CI / docker (pull_request) Has been cancelled

Details

64b1f4c0b6

- Fix grooming-worker Forgejo permissions (deny → allow) to unblock direct API calls
- Route PR label fetching through forgejo-label-manager subagent
- Replace priority-alignment check with milestone enforcement (every issue must have a milestone)
- Add step 11: address non-code review remarks (labels, description, milestone) during grooming
- Clarify grooming-pool-supervisor stale threshold to explicit 24-hour window
- Refactor pr-merge-pool-supervisor main loop into explicit numbered steps
- Add triage strategy section emphasising parallel review checks and immediate worker dispatch
- Tighten merge criteria: explicit APPROVED state, no unresolved REQUEST_CHANGES on current head
- Dispatch workers for all PR processing, not only rebase operations
- Add rule to batch forgejo_list_pull_reviews calls instead of checking serially

docs(changelog): add plan action-arguments UNIQUE constraint fix (#4197 )

CI / lint (pull_request) Successful in 26s

Details

CI / build (pull_request) Successful in 25s

Details

CI / push-validation (pull_request) Successful in 18s

Details

CI / typecheck (pull_request) Successful in 50s

Details

CI / quality (pull_request) Successful in 56s

Details

CI / security (pull_request) Successful in 1m1s

Details

CI / helm (pull_request) Successful in 43s

Details

CI / e2e_tests (pull_request) Successful in 4m13s

Details

CI / integration_tests (pull_request) Successful in 4m21s

Details

CI / unit_tests (pull_request) Successful in 5m23s

Details

CI / docker (pull_request) Successful in 22s

Details

CI / coverage (pull_request) Successful in 10m47s

Details

CI / status-check (pull_request) Successful in 1s

Details

CI / lint (push) Successful in 23s

Details

CI / build (push) Successful in 21s

Details

CI / helm (push) Successful in 23s

Details

CI / typecheck (push) Successful in 48s

Details

CI / quality (push) Successful in 51s

Details

CI / security (push) Successful in 1m1s

Details

CI / push-validation (push) Successful in 44s

Details

CI / integration_tests (push) Successful in 4m21s

Details

CI / e2e_tests (push) Successful in 4m29s

Details

CI / unit_tests (push) Successful in 5m30s

Details

CI / docker (push) Successful in 11s

Details

CI / coverage (push) Successful in 11m19s

Details

CI / status-check (push) Successful in 1s

Details

acc5f01155

Documents the fix for sqlite3.IntegrityError when agents plan use is called on an action that already has arguments registered via action create.

ISSUES CLOSED: #6856

docs: integrate docs-writer automation tracking workflows

CI / lint (pull_request) Successful in 51s

Details

CI / quality (pull_request) Successful in 49s

Details

CI / typecheck (pull_request) Successful in 58s

Details

CI / security (pull_request) Successful in 53s

Details

CI / build (pull_request) Successful in 24s

Details

CI / push-validation (pull_request) Successful in 20s

Details

CI / helm (pull_request) Successful in 23s

Details

CI / e2e_tests (pull_request) Successful in 4m3s

Details

CI / integration_tests (pull_request) Successful in 8m37s

Details

CI / unit_tests (pull_request) Successful in 11m26s

Details

CI / coverage (pull_request) Successful in 14m48s

Details

CI / docker (pull_request) Successful in 11s

Details

CI / status-check (pull_request) Successful in 1s

Details

CI / lint (push) Successful in 19s

Details

CI / quality (push) Successful in 45s

Details

CI / security (push) Successful in 1m0s

Details

CI / typecheck (push) Successful in 1m29s

Details

CI / build (push) Successful in 39s

Details

CI / helm (push) Successful in 25s

Details

CI / push-validation (push) Successful in 18s

Details

CI / e2e_tests (push) Successful in 4m42s

Details

CI / integration_tests (push) Successful in 7m12s

Details

CI / unit_tests (push) Successful in 8m52s

Details

CI / coverage (push) Successful in 13m35s

Details

CI / docker (push) Successful in 16s

Details

CI / status-check (push) Successful in 1s

Details

6559a0e9df

- document docs-writer responsibilities and automation tracking requirements\n- enforce automation tracking label validation and clean coverage regression tags\n\nISSUES CLOSED: #7616

# Conflicts:
#	CHANGELOG.md
#	docs/development/automation-tracking.md
#	docs/development/docs-writer.md
#	mkdocs.yml

docs(timeline): update schedule adherence Day 102 (2026-04-12)

CI / lint (pull_request) Successful in 39s

Details

CI / typecheck (pull_request) Successful in 1m18s

Details

CI / security (pull_request) Successful in 1m29s

Details

CI / quality (pull_request) Successful in 48s

Details

CI / build (pull_request) Successful in 30s

Details

CI / push-validation (pull_request) Successful in 30s

Details

CI / helm (pull_request) Successful in 42s

Details

CI / e2e_tests (pull_request) Successful in 4m26s

Details

CI / integration_tests (pull_request) Successful in 4m55s

Details

CI / unit_tests (pull_request) Successful in 5m34s

Details

CI / docker (pull_request) Successful in 16s

Details

CI / coverage (pull_request) Successful in 14m35s

Details

CI / status-check (pull_request) Successful in 1s

Details

CI / lint (push) Successful in 23s

Details

CI / typecheck (push) Successful in 53s

Details

CI / quality (push) Successful in 43s

Details

CI / security (push) Successful in 59s

Details

CI / helm (push) Successful in 23s

Details

CI / build (push) Successful in 26s

Details

CI / push-validation (push) Successful in 18s

Details

CI / e2e_tests (push) Successful in 3m14s

Details

CI / integration_tests (push) Successful in 6m40s

Details

CI / unit_tests (push) Successful in 8m4s

Details

CI / docker (push) Successful in 11s

Details

CI / coverage (push) Successful in 14m37s

Details

CI / status-check (push) Successful in 1s

Details

9db348e5f6

docs(api): add ACMS/UKO API reference and update nav

CI / lint (pull_request) Successful in 35s

Details

CI / typecheck (pull_request) Successful in 45s

Details

CI / quality (pull_request) Successful in 41s

Details

CI / security (pull_request) Successful in 49s

Details

CI / build (pull_request) Successful in 32s

Details

CI / helm (pull_request) Successful in 26s

Details

CI / push-validation (pull_request) Successful in 23s

Details

CI / e2e_tests (pull_request) Successful in 3m38s

Details

CI / integration_tests (pull_request) Successful in 4m36s

Details

CI / unit_tests (pull_request) Successful in 5m25s

Details

CI / docker (pull_request) Successful in 10s

Details

CI / coverage (pull_request) Successful in 16m0s

Details

CI / status-check (pull_request) Successful in 1s

Details

CI / lint (push) Successful in 23s

Details

CI / quality (push) Successful in 41s

Details

CI / security (push) Successful in 54s

Details

CI / typecheck (push) Successful in 56s

Details

CI / build (push) Successful in 20s

Details

CI / helm (push) Successful in 24s

Details

CI / push-validation (push) Successful in 33s

Details

CI / integration_tests (push) Successful in 4m6s

Details

CI / e2e_tests (push) Successful in 4m27s

Details

CI / unit_tests (push) Successful in 5m42s

Details

CI / docker (push) Successful in 1m36s

Details

CI / coverage (push) Successful in 10m56s

Details

CI / status-check (push) Successful in 1s

Details

df863f169b

Add comprehensive API documentation for the cleveragents.acms package,
covering the four-layer UKO ontology hierarchy (Layer 0-3), all public
types (VocabularyRegistry, ProvenanceInfo, UKOClass, UKOProperty,
UKOVocabulary, Layer2Dependency, ParadigmVocabulary), detail level maps
(DetailLevelMapBuilder, build_detail_level_map, build_effective_map,
resolve_detail_level), and all Layer 3 language vocabulary types for
Python, TypeScript, Rust, and Java.

- Add docs/api/acms.md with full API reference and usage example
- Update docs/api/index.md to include ACMS/UKO in the module index
- Update mkdocs.yml nav to include the new ACMS/UKO page
- Update CHANGELOG.md [Unreleased] with the documentation addition

docs(spec): add v3.8.0 Server Implementation milestone plan and update status table (#7701 )

CI / push-validation (push) Successful in 19s

Details

CI / helm (push) Successful in 23s

Details

CI / build (push) Successful in 36s

Details

CI / e2e_tests (push) Successful in 3m20s

Details

CI / lint (push) Successful in 3m20s

Details

CI / security (push) Successful in 4m5s

Details

CI / quality (push) Successful in 4m19s

Details

CI / typecheck (push) Successful in 4m25s

Details

CI / integration_tests (push) Successful in 9m23s

Details

CI / unit_tests (push) Has been cancelled

Details

CI / coverage (push) Has been cancelled

Details

CI / docker (push) Has been cancelled

Details

CI / status-check (push) Has been cancelled

Details

510cb03d99

Co-authored-by: CleverThis <hal9000@cleverthis.com>
Co-committed-by: CleverThis <hal9000@cleverthis.com>

Build: Improved merge, review, and implementor logic to have better and more clear priorities

CI / push-validation (push) Successful in 10s

Details

CI / helm (push) Successful in 24s

Details

CI / lint (push) Successful in 35s

Details

CI / build (push) Successful in 40s

Details

CI / typecheck (push) Successful in 48s

Details

CI / e2e_tests (push) Successful in 3m26s

Details

CI / quality (push) Successful in 3m53s

Details

CI / security (push) Successful in 4m5s

Details

CI / integration_tests (push) Successful in 6m26s

Details

CI / unit_tests (push) Successful in 7m28s

Details

CI / docker (push) Successful in 1m36s

Details

CI / coverage (push) Successful in 17m32s

Details

CI / status-check (push) Successful in 1s

Details

78cfdc9b1b

docs(timeline): update schedule adherence Day 104 (2026-04-14) [AUTO-TIME-1]

CI / helm (push) Successful in 24s

Details

CI / push-validation (push) Successful in 17s

Details

CI / lint (push) Successful in 3m21s

Details

CI / build (push) Successful in 3m17s

Details

CI / quality (push) Successful in 3m54s

Details

CI / typecheck (push) Successful in 3m58s

Details

CI / security (push) Successful in 4m9s

Details

CI / e2e_tests (push) Successful in 6m51s

Details

CI / integration_tests (push) Successful in 9m44s

Details

CI / push-validation (pull_request) Successful in 18s

Details

CI / helm (pull_request) Successful in 25s

Details

CI / unit_tests (push) Successful in 11m5s

Details

CI / docker (push) Successful in 1m54s

Details

CI / lint (pull_request) Successful in 3m20s

Details

CI / build (pull_request) Successful in 3m18s

Details

CI / quality (pull_request) Successful in 3m41s

Details

CI / typecheck (pull_request) Successful in 3m42s

Details

CI / security (pull_request) Successful in 3m49s

Details

CI / e2e_tests (pull_request) Successful in 4m26s

Details

CI / integration_tests (pull_request) Successful in 7m38s

Details

CI / unit_tests (pull_request) Successful in 8m38s

Details

CI / coverage (push) Successful in 14m29s

Details

CI / status-check (push) Successful in 1s

Details

CI / docker (pull_request) Successful in 1m57s

Details

CI / coverage (pull_request) Successful in 14m35s

Details

CI / status-check (pull_request) Successful in 1s

Details

0d6b197504

Build: Refined some of the wording in the supervisors to get more reliable performance out of them. Made permissions stricter so we will get less circumvention of intended permissions

CI / push-validation (push) Successful in 16s

Details

CI / helm (push) Successful in 17s

Details

CI / typecheck (push) Successful in 1m4s

Details

CI / build (push) Successful in 3m19s

Details

CI / lint (push) Successful in 3m19s

Details

CI / quality (push) Successful in 3m49s

Details

CI / integration_tests (push) Successful in 4m0s

Details

CI / security (push) Successful in 4m5s

Details

CI / e2e_tests (push) Successful in 6m12s

Details

CI / unit_tests (push) Successful in 9m35s

Details

CI / docker (push) Successful in 1m31s

Details

CI / coverage (push) Successful in 13m52s

Details

CI / status-check (push) Successful in 1s

Details

acb901abf1

Build: doom loops are detected and killed

CI / push-validation (push) Successful in 17s

Details

CI / build (push) Successful in 36s

Details

CI / lint (push) Successful in 38s

Details

CI / helm (push) Successful in 43s

Details

CI / typecheck (push) Successful in 50s

Details

CI / quality (push) Successful in 3m42s

Details

CI / integration_tests (push) Successful in 3m59s

Details

CI / security (push) Successful in 4m8s

Details

CI / e2e_tests (push) Successful in 4m17s

Details

CI / unit_tests (push) Successful in 8m29s

Details

CI / docker (push) Successful in 10s

Details

CI / coverage (push) Successful in 13m52s

Details

CI / status-check (push) Successful in 1s

Details

5a57eb9a07

feat(plan): implement LLM-powered strategy actor

CI / push-validation (pull_request) Successful in 17s

Details

CI / helm (pull_request) Successful in 19s

Details

CI / lint (pull_request) Successful in 27s

Details

CI / security (pull_request) Successful in 32s

Details

CI / quality (pull_request) Successful in 41s

Details

CI / typecheck (pull_request) Successful in 48s

Details

CI / build (pull_request) Successful in 3m18s

Details

CI / e2e_tests (pull_request) Successful in 4m16s

Details

CI / integration_tests (pull_request) Successful in 4m16s

Details

CI / unit_tests (pull_request) Successful in 5m8s

Details

CI / docker (pull_request) Successful in 1m32s

Details

CI / coverage (pull_request) Successful in 10m53s

Details

CI / status-check (pull_request) Successful in 1s

Details

CI / push-validation (push) Successful in 18s

Details

CI / helm (push) Successful in 23s

Details

CI / build (push) Successful in 31s

Details

CI / lint (push) Successful in 43s

Details

CI / typecheck (push) Successful in 52s

Details

CI / security (push) Successful in 52s

Details

CI / e2e_tests (push) Successful in 3m37s

Details

CI / quality (push) Successful in 3m44s

Details

CI / integration_tests (push) Successful in 6m35s

Details

CI / unit_tests (push) Failing after 7m36s

Details

CI / docker (push) Has been skipped

Details

CI / coverage (push) Successful in 13m39s

Details

CI / status-check (push) Failing after 1s

Details

d3cb534caf

Implement StrategyActor class for the plan strategize phase that uses an
LLM to produce hierarchical execution strategies with dependencies,
resource requirements, estimated complexity, and risk scores.

Key components:
- StrategyActor: Core actor with LLM prompt construction, response
  parsing (JSON and numbered-list fallback), and graceful degradation
  to StrategizeStubActor when no LLM provider is configured
- StrategyAction/StrategyTree: Pydantic models for the hierarchical
  action tree with dependency links
- validate_no_cycles(): Kahns algorithm (deque-based) for dependency
  graph cycle detection, raising PlanError on circular dependencies
- build_strategy_prompt(): Context-aware prompt construction using
  definition_of_done, resources, project context, and ACMS analysis
  with XML-delimited user content sections for prompt injection
  hardening
- parse_strategy_response(): Robust LLM output parsing with JSON
  extraction and numbered-list fallback
- resolve_strategy_actor(): Integration point for the existing
  actor.default.strategy config key (CLEVERAGENTS_DEFAULT_STRATEGY_ACTOR)
- Decision conversion producing strategy_choice Decision objects
- build_decisions() preserves tree hierarchy via parent_id mapping,
  populates downstream_decision_ids from dependency edges, and
  validates plan_id

Structural tree hierarchy (B2 review fix):
- _build_tree infers parent_id from the dependency graph: each
  actions first resolved dependency becomes its structural parent.
  Actions with no dependencies fall back to the root.  This produces
  hierarchical trees for agents plan tree rendering per spec
  Plan Decision Tree.

Downstream decision tracking (B3 review fix):
- build_decisions populates downstream_decision_ids from the strategy
  trees dependency edges using a pre-generated decision_id map so
  influence relationships between decisions are recorded per the spec
  Decision Record Structure.

Post code-review hardening (PR #1175):
- Broadened exception handling in execute() and ACMS retrieval to
  catch all LLM provider errors (openai, httpx, anthropic, etc.)
  with graceful fallback to stub mode (H1, H2)
- Added warning log for unresolvable dependency references so
  dropped edges are visible in structured logs (H3)
- Added XML-delimited user content sections and explicit data-only
  instructions in system prompt for prompt injection hardening (H4)
- Switched prompt truncation to word-boundary-safe _truncate_at_word()
  for all prompt input sections (M1)
- Fixed _parse_actor_name to preserve user-specified provider or
  model when only one segment is empty, instead of discarding both (M2)
- Annotated _build_invariant_records as placeholder pending the
  Invariant Reconciliation Actor implementation (M5)
- Documented resources/project_context params as future-wired
  through PlanExecutor.run_strategize() (M8)
- Added docstring noting supersession relationship with
  LLMStrategizeActor in llm_actors.py (M9)
- Added __all__ export definition (L2)
- Improved validate_no_cycles docstring edge direction semantics (L7)
- Cap JSON parse retry loop at _MAX_JSON_PARSE_RETRIES (10)

Post second code-review hardening (PR #1175, review cycle 2):
- Fixed _truncate_at_word docstring: documented max_chars >= 3
  precondition for the result-length guarantee (R-H1)
- Added warning log in build_decisions for unresolvable parent_id
  references, matching the existing _build_tree warning for
  unresolvable dependency references (R-H2)
- Fixed _parse_actor_name to handle whitespace-only input by adding
  actor_name.strip() check alongside the emptiness check (R-M1)
- Tightened ACMS scenario assertions from non-empty to expected
  count of 5 decisions (R-L3)
- Added timeout=60s on_timeout=kill to all Robot test cases for
  consistency with project patterns (R-M5)

Post third code-review hardening (PR #1175, review cycle 3):
- Added _sanitize_xml_content() to escape XML special characters
  (<, >, &) in user content before embedding into XML-delimited
  prompt sections, preventing prompt injection via forged closing
  tags (spec Prompt Injection Mitigation) (CR3-M1)
- Upgraded _try_parse_json() to multi-anchor retry: collects all
  [{ positions left-to-right and tries each as a candidate start,
  fixing false-start anchoring when LLM preamble contains [{
  fragments before the real JSON array (CR3-M2)
- Added _truncate_at_word() guard for max_chars < 3: returns a
  hard slice instead of word-boundary truncation when the ellipsis
  would exceed the limit (CR3-L2)
- Changed _build_tree collision fallback key from -(idx+1) to
  -(1_000_000+idx) to eliminate theoretical collision with
  LLM-produced negative step numbers (CR3-L3)
- Added forward-looking API docstring note to build_decisions()
  documenting that it is not yet wired into PlanExecutor and will
  be integrated once Decision persistence lands (CR3-M3)

Post fourth code-review hardening (PR #1175, review cycle 4):
- Fixed _try_parse_json per-anchor retry counter: reset retries=0
  at the start of each anchor iteration so false-start [{ anchors
  in LLM preamble text no longer exhaust the retry budget for the
  correct anchor (CR4-B1)
- Added known-limitations docstring to module header documenting
  missing decision types (resource_selection, subplan_spawn,
  invariant_enforced) as future work (CR4-D1)
- Rewrote XML injection assertion in test to use regex extraction
  instead of fragile chained .split() calls that could IndexError
  on structural changes (CR4-T5)

Post fifth code-review hardening (PR #1175, review cycle 5):
- Added warning log in build_decisions for empty-string parent_id
  (distinct from None) so the silent fallback to root is visible
  in structured logs for debuggability (CR5-B1)
- Added plan_id propagation assertion to build_decisions test
  scenarios verifying decision.plan_id matches the input (CR5-T1)
- Added sequence_number monotonicity assertion verifying decision
  sequence_numbers are zero-indexed and monotonically increasing
  (CR5-T2)
- Added _truncate_at_word boundary test for max_chars=3 (exactly
  ellipsis length) verifying correct "..." output (CR5-T3)
- Tightened false-start anchor test from permissive len>=1 to
  specific description match "Sole real action" (CR5-T4)
- Added word-boundary truncation test using space-separated input
  to exercise the rfind(" ") path under oversized DoD (CR5-T5)

Post sixth code-review hardening (PR #1175, review cycle 6):
- Added _MAX_INVARIANTS cap (100) for invariant list truncation in
  prompt to prevent token limit overflows, consistent with other prompt
  section caps (CR6-M4)
- Added negative max_chars guard in _truncate_at_word returning empty
  string instead of slicing from end (CR6-M5)
- Added global JSON parse attempt cap _MAX_GLOBAL_JSON_ATTEMPTS (50)
  across all anchors in _try_parse_json (CR6-L3)
- Moved re import to module level in strategy_parsing.py per
  CONTRIBUTING import guidelines (CR6-L4)
- Extracted _DEFAULT_DESCRIPTION constant to eliminate duplication
  between _default_action() and _build_tree() (CR6-L5)

Post seventh code-review hardening (PR #1175, review cycle 7):
- Decoupled _execute_stub from StrategizeStubActor._parse_steps
  private method by delegating to parse_strategy_response, removing
  cross-class private method dependency (CR7-M1)
- Added ULID format validation on plan_id in execute() and
  build_decisions() for spec-consistent argument validation per
  §Plan glossary and CONTRIBUTING §Argument Validation (CR7-M2)
- Constrained StrategyAction.estimated_complexity to
  Literal["low", "medium", "high"] at Pydantic model level per
  CONTRIBUTING §Type Safety (CR7-M5)
- Documented XML-tag prompt boundary deviation from spec
  [USER_CONTENT_START]/[USER_CONTENT_END] markers with rationale
  for the more structured approach (CR7-M6)
- Added _build_tree empty-input guard comment documenting orphaned
  root_id semantics (CR7-L1)
- Added _truncate_at_word > 0 intent comment explaining why
  position-0 space is intentionally excluded (CR7-L2)
- Added build_decisions context_snapshot future-work comment
  referencing spec §Decision Record Structure (CR7-L5)
- Used enumerate() in _build_tree first loop for idiomatic
  Python (CR7-L7)
- Fixed false-start anchor test (CR5-T4) broken by CR6-L3 global
  cap: reduced preamble fragments from 15 to 3 so total attempts
  stay within _MAX_GLOBAL_JSON_ATTEMPTS (CR7-T1)
- Fixed test plan_ids containing non-Crockford-Base32 characters
  (L→K) to pass ULID format validation (CR7-T2)

Tests:
- 105 Behave BDD scenarios in features/strategy_actor_llm.feature
  adding: global JSON attempt cap exhaustion (CR7-L3), orphaned
  dependency edge silent drop (CR7-L4), non-ULID plan_id rejection
  in execute() and build_decisions() (CR7-M2)
- 101 Behave BDD scenarios in features/strategy_actor_llm.feature
  including new scenarios for _truncate_at_word edge cases (L3),
  create_llm argument verification (L4), non-numeric step field
  fallback (L5), updated assertions for XML-delimited prompts
  and _parse_actor_name partial-segment preservation (M2),
  lifecycle exception fallback (R1), PydanticValidationError
  re-raise verification (R2), self-loop cycle detection (R3),
  whitespace-only actor name (R4), XML tag injection sanitisation
  (CR3-M1), preamble bracket fragment parsing (CR3-M2),
  _truncate_at_word sub-3 limit (CR3-L2), resolve_strategy_actor
  with both llm config and registry (CR3-L5), build_decisions
  unresolvable parent_id fallback (CR3-L7), XML injection in
  resources/project_context/acms_context fields (CR4-S1),
  ampersand escaping (CR4-S1d), false-start anchor retry budget
  (CR4-T3), non-sequential step edge specificity (CR4-T4),
  plan_id propagation (CR5-T1), sequence_number monotonicity
  (CR5-T2), max_chars=3 boundary (CR5-T3), false-start anchor
  specificity (CR5-T4), word-boundary truncation (CR5-T5),
  invariant prompt constraints (CR6-M2), invariant XML
  sanitisation (CR6-M3), invariant truncation cap (CR6-M4),
  negative max_chars (CR6-M5), and no-space truncation (CR6-L8)
- 7 Robot Framework integration tests in robot/strategy_actor.robot
- Mock LLM provider in features/mocks/mock_strategy_llm.py

All nox stages pass: lint, typecheck, unit_tests (13789 scenarios),
integration_tests (1863 passed).
integration_tests (1863 passed, 2 pre-existing TDD failures unrelated
to this change).

ISSUES CLOSED: #828

Build: sped up bootstrap time of pr-merge pool agent

CI / push-validation (push) Successful in 19s

Details

CI / build (push) Successful in 25s

Details

CI / helm (push) Successful in 25s

Details

CI / lint (push) Successful in 36s

Details

CI / quality (push) Successful in 43s

Details

CI / typecheck (push) Successful in 49s

Details

CI / security (push) Successful in 54s

Details

CI / e2e_tests (push) Successful in 4m25s

Details

CI / unit_tests (push) Successful in 5m31s

Details

CI / docker (push) Successful in 1m32s

Details

CI / integration_tests (push) Successful in 7m18s

Details

CI / coverage (push) Successful in 13m37s

Details

CI / status-check (push) Successful in 1s

Details

abbb830c60

fix(cli): add missing "✓ OK" footer to agents plan errors rich output

CI / push-validation (pull_request) Successful in 19s

Details

CI / helm (pull_request) Successful in 19s

Details

CI / build (pull_request) Successful in 20s

Details

CI / quality (pull_request) Successful in 41s

Details

CI / lint (pull_request) Successful in 47s

Details

CI / typecheck (pull_request) Successful in 53s

Details

CI / security (pull_request) Successful in 1m27s

Details

CI / e2e_tests (pull_request) Successful in 3m39s

Details

CI / integration_tests (pull_request) Successful in 4m10s

Details

CI / unit_tests (pull_request) Successful in 5m56s

Details

CI / docker (pull_request) Successful in 1m21s

Details

CI / coverage (pull_request) Successful in 10m52s

Details

CI / status-check (pull_request) Successful in 1s

Details

CI / push-validation (push) Successful in 19s

Details

CI / helm (push) Successful in 23s

Details

CI / build (push) Successful in 28s

Details

CI / e2e_tests (push) Successful in 3m13s

Details

CI / lint (push) Successful in 3m17s

Details

CI / quality (push) Successful in 3m38s

Details

CI / typecheck (push) Successful in 3m57s

Details

CI / security (push) Successful in 4m5s

Details

CI / integration_tests (push) Successful in 6m19s

Details

CI / unit_tests (push) Successful in 7m35s

Details

CI / docker (push) Successful in 1m21s

Details

CI / coverage (push) Successful in 10m48s

Details

CI / status-check (push) Successful in 1s

Details

9aad085b74

ISSUES CLOSED: #9355

fix(tests): fix create_template_db.py to create writable SQLite template database

CI / push-validation (push) Successful in 17s

Details

CI / helm (push) Successful in 23s

Details

CI / build (push) Successful in 30s

Details

CI / lint (push) Successful in 43s

Details

CI / quality (push) Successful in 48s

Details

CI / typecheck (push) Successful in 53s

Details

CI / security (push) Successful in 53s

Details

CI / e2e_tests (push) Successful in 3m22s

Details

CI / integration_tests (push) Successful in 6m42s

Details

CI / unit_tests (push) Successful in 7m47s

Details

CI / docker (push) Successful in 1m31s

Details

CI / coverage (push) Successful in 12m9s

Details

CI / status-check (push) Successful in 1s

Details

CI / push-validation (pull_request) Successful in 13s

Details

CI / helm (pull_request) Successful in 26s

Details

CI / lint (pull_request) Successful in 33s

Details

CI / build (pull_request) Successful in 38s

Details

CI / quality (pull_request) Successful in 45s

Details

CI / typecheck (pull_request) Successful in 49s

Details

CI / security (pull_request) Successful in 55s

Details

CI / e2e_tests (pull_request) Successful in 3m38s

Details

CI / integration_tests (pull_request) Successful in 6m40s

Details

CI / unit_tests (pull_request) Successful in 7m44s

Details

CI / docker (pull_request) Successful in 10s

Details

CI / coverage (pull_request) Successful in 12m9s

Details

CI / status-check (pull_request) Successful in 3s

Details

4c0f3e1da9

Added os.chmod(db_path, 0o664) after database creation to ensure the template
database has writable permissions. This prevents sqlite3.OperationalError: attempt
to write a readonly database when tests copy and modify the template during test
setup.

The template database is now created with rw-rw-r-- (664) permissions instead of
the default rw-r--r-- (644), allowing the test runner process to write to it.

ISSUES CLOSED: #9372

fix(cli): --format color now emits ANSI-coloured output instead of plain text

CI / quality (pull_request) Successful in 20s

Details

CI / push-validation (pull_request) Successful in 21s

Details

CI / build (pull_request) Successful in 24s

Details

CI / helm (pull_request) Successful in 24s

Details

CI / lint (pull_request) Successful in 37s

Details

CI / security (pull_request) Successful in 51s

Details

CI / typecheck (pull_request) Successful in 52s

Details

CI / integration_tests (pull_request) Successful in 4m50s

Details

CI / unit_tests (pull_request) Successful in 6m25s

Details

CI / e2e_tests (pull_request) Successful in 6m29s

Details

CI / docker (pull_request) Successful in 1m33s

Details

CI / coverage (pull_request) Successful in 12m43s

Details

CI / status-check (pull_request) Successful in 1s

Details

CI / helm (push) Successful in 21s

Details

CI / build (push) Successful in 25s

Details

CI / push-validation (push) Successful in 25s

Details

CI / quality (push) Successful in 42s

Details

CI / lint (push) Successful in 43s

Details

CI / typecheck (push) Successful in 51s

Details

CI / security (push) Successful in 52s

Details

CI / integration_tests (push) Successful in 4m21s

Details

CI / e2e_tests (push) Successful in 4m27s

Details

CI / unit_tests (push) Successful in 5m17s

Details

CI / docker (push) Successful in 1m31s

Details

CI / coverage (push) Successful in 10m55s

Details

CI / status-check (push) Successful in 1s

Details

b752dd485f

Route the COLOR format option through format_output_session (which uses
ColorMaterializer) instead of _format_plain. Previously --format color
produced identical output to --format plain because both were routed to
the same plain-text formatter. All other formats (plain, json, yaml,
rich, table) remain unaffected.

Updated CHANGELOG.md with the fix entry and CONTRIBUTORS.md with HAL 9000
contribution details.

ISSUES CLOSED: #7910

fix(concurrency): add thread safety to ContextTierService 0f75429253

Implemented thread-safety improvements for ContextTierService by
introducing a re-entrant lock and guarding all critical sections
with self._lock. This prevents RuntimeError: dictionary changed
size during iteration under concurrent plan execution.

- Added threading.RLock to ContextTierService.__init__ as self._lock
- Wrapped all public methods (store, get, promote, demote, evict_lru,
  get_metrics, get_all_fragments, get_hot_fragments, get_for_actor,
  get_scoped_view) with with self._lock:
- Added _lock: threading.RLock type stub to TierRuntimeMixin and
  ScopedTierMixin
- Wrapped enforce_staleness in TierRuntimeMixin with self._lock
- Wrapped get_scoped_by_resource and get_scoped_metrics in
  ScopedTierMixin with self._lock
- Extracted settings helpers to new context_tier_settings.py to keep
  context_tiers.py under 500 lines
- Added BDD feature file context_tier_thread_safety.feature with
  10 thread-safety scenarios
- Added step definitions context_tier_thread_safety_steps.py
- Updated CHANGELOG.md with fix entry

ISSUES CLOSED: #7547

fix(concurrency): protect validate_fragment_scope with lock and update CONTRIBUTORS

CI / push-validation (pull_request) Successful in 20s

Details

CI / helm (pull_request) Successful in 24s

Details

CI / build (pull_request) Successful in 27s

Details

CI / lint (pull_request) Successful in 29s

Details

CI / quality (pull_request) Successful in 44s

Details

CI / typecheck (pull_request) Successful in 54s

Details

CI / security (pull_request) Successful in 55s

Details

CI / e2e_tests (pull_request) Successful in 3m4s

Details

CI / unit_tests (pull_request) Successful in 9m51s

Details

CI / integration_tests (pull_request) Successful in 9m53s

Details

CI / docker (pull_request) Successful in 1m31s

Details

CI / coverage (pull_request) Successful in 16m31s

Details

CI / status-check (pull_request) Successful in 2s

Details

CI / quality (push) Successful in 17s

Details

CI / push-validation (push) Successful in 17s

Details

CI / helm (push) Successful in 25s

Details

CI / build (push) Successful in 26s

Details

CI / lint (push) Successful in 36s

Details

CI / typecheck (push) Successful in 46s

Details

CI / security (push) Successful in 50s

Details

CI / e2e_tests (push) Successful in 3m19s

Details

CI / integration_tests (push) Successful in 4m23s

Details

CI / unit_tests (push) Successful in 5m7s

Details

CI / docker (push) Successful in 1m19s

Details

CI / coverage (push) Successful in 11m0s

Details

CI / status-check (push) Successful in 1s

Details

b43ba41f6d

- Wrapped validate_fragment_scope() body with self._lock to prevent
  RuntimeError: dictionary changed size during iteration when another
  thread mutates the tier stores during scope validation
- Updated CONTRIBUTORS.md to document HAL 9000's concurrency safety
  contributions including thread-safe context tier management (issue #7547)

Fixes review feedback from PR #8279.

ISSUES CLOSED: #7547

fix(testing): print behave-parallel worker logs only for failed chunks

CI / lint (pull_request) Successful in 20s

Details

CI / helm (pull_request) Successful in 33s

Details

CI / push-validation (pull_request) Successful in 21s

Details

CI / quality (pull_request) Successful in 3m36s

Details

CI / build (pull_request) Successful in 3m44s

Details

CI / typecheck (pull_request) Successful in 4m30s

Details

CI / security (pull_request) Successful in 4m37s

Details

CI / e2e_tests (pull_request) Successful in 6m54s

Details

CI / unit_tests (pull_request) Successful in 9m46s

Details

CI / integration_tests (pull_request) Successful in 9m51s

Details

CI / docker (pull_request) Successful in 1m33s

Details

CI / coverage (pull_request) Successful in 10m53s

Details

CI / status-check (pull_request) Successful in 0s

Details

CI / security (push) Successful in 41s

Details

CI / helm (push) Successful in 31s

Details

CI / push-validation (push) Successful in 33s

Details

CI / lint (push) Successful in 3m17s

Details

CI / build (push) Successful in 3m16s

Details

CI / quality (push) Successful in 3m38s

Details

CI / typecheck (push) Successful in 4m16s

Details

CI / e2e_tests (push) Successful in 6m35s

Details

CI / unit_tests (push) Successful in 10m18s

Details

CI / integration_tests (push) Successful in 10m21s

Details

CI / docker (push) Successful in 1m36s

Details

CI / coverage (push) Successful in 10m47s

Details

CI / status-check (push) Successful in 0s

Details

8b2e0c81c5

In parallel mode, the behave runner previously replayed captured
stdout/stderr for every worker chunk, creating noisy output that
obscured failure diagnostics in CI and local runs.

Changes to scripts/run_behave_parallel.py:

- Added _chunk_has_failures() and _chunk_no_scenarios_ran() helpers
  to evaluate individual chunk summaries for failure/error/crash
  conditions.

- Updated the aggregation loop in main() to conditionally replay
  captured stdout/stderr only for chunks whose summary indicates
  failures, errors, or no scenarios ran (crash detection).  Passing
  chunks now suppress their output entirely.

- Added robust exception handling in _worker_run_features() so that
  worker crashes produce a full traceback in stderr and return a crash
  summary with features.errors = 1, enabling the parent to detect the
  crash via _chunk_has_failures (and also _chunk_no_scenarios_ran,
  since no scenarios reached a terminal state) and replay the
  diagnostics.

- The conditional replay uses summary-based checks rather than the
  raw runner.run() boolean, consistent with the existing exit-code
  logic.  This avoids spurious log replay for @tdd_expected_fail
  scenarios whose runner.run() returns True even though the TDD
  inversion handler has corrected the scenario status to passed.

- Existing summary merge, exit semantics, and the no-scenarios
  safety net are fully preserved.

New Behave unit tests (17 scenarios) cover the chunk-level helpers,
the conditional aggregation loop, the pure no-scenarios-ran path,
stderr replay for non-crash failed chunks, and the worker crash path.
New Robot integration tests (6 test cases) verify the same behavior
end-to-end via the helper_behave_parallel_log_filtering.py script.

Also updated:
- CHANGELOG.md: add unreleased entry for this behavioral change.
- features/steps/behave_parallel_log_filtering_steps.py: use
  contextlib.redirect_stdout/redirect_stderr instead of manual
  sys.stdout assignment; register module in sys.modules; document
  CWD requirement in _load_runner_module().
- robot/helper_behave_parallel_log_filtering.py: move import io to
  top-level; remove redundant inline imports; use contextlib for
  output capture; register module in sys.modules; document CWD
  requirement.

Branch note: the canonical branch for this fix is
bugfix/m3-behave-parallel-failed-chunk-logs.  The PR head branch
(bugfix/mX-behave-parallel-failed-chunk-logs) cannot be renamed via
the Forgejo API; both branches are kept in sync at the same SHA.

ISSUES CLOSED: #8351

feat(skills): add exhaustive Forgejo REST API agent skill

CI / lint (push) Successful in 20s

Details

CI / quality (push) Successful in 20s

Details

CI / helm (push) Successful in 24s

Details

CI / build (push) Successful in 24s

Details

CI / push-validation (push) Successful in 39s

Details

CI / security (push) Successful in 1m1s

Details

CI / e2e_tests (push) Successful in 3m13s

Details

CI / typecheck (push) Successful in 4m23s

Details

CI / unit_tests (push) Successful in 6m44s

Details

CI / integration_tests (push) Successful in 6m49s

Details

CI / docker (push) Successful in 11s

Details

CI / coverage (push) Successful in 6m56s

Details

CI / status-check (push) Successful in 0s

Details

237e776951

Adds a comprehensive opencode skill under .opencode/skills/forgejo-api/
covering all 473 Forgejo REST API endpoints across 25 reference categories.

- 78 files, 23,000+ lines, 149 distinct path parameter types
- Every curl command parameterised ({owner}/{repo}/{index}/etc) and
  tested against the live git.cleverthis.com server
- SKILL.md: 917-line entry point with quick-answer curl commands (35),
  jq cheat sheet for chaining API calls, 14 decision trees, 12 critical
  concepts (exclusive labels, lazy mergeability, SHA locking, auto-close
  keywords, search envelope differences, 412 stale-edit protection), full
  HTTP status code table, and environment variable reference
- references/pull-requests/: CRUD, 6 merge styles, automerge, server-side
  rebase without local clone, inline review comments, diff/patch
- references/issues/: comments, reactions, attachments, dependencies,
  time tracking, stopwatches, pinning
- references/labels/: repo + org labels, exclusive label groups,
  GET/POST/PUT/DELETE on issues and PRs
- references/ci-actions/ + references/commit-statuses/: workflow runs,
  dispatch, secrets, variables, quality gate verification
- references/web-interface/ci-logs.md: step-by-step CI log access via
  CSRF web session (not available through REST API)
- references/complex-workflows/: 10 multi-step recipes including
  PR review cycle, issue lifecycle, CI status check, server-side rebase,
  automerge, release workflow, org setup, fork contribution

fix(testing): document and harden non-AssertionError guard in apply_tdd_inversion to reduce flaky CI

CI / lint (pull_request) Successful in 18s

Details

CI / build (pull_request) Successful in 17s

Details

CI / helm (pull_request) Successful in 18s

Details

CI / quality (pull_request) Successful in 53s

Details

CI / typecheck (pull_request) Successful in 56s

Details

CI / security (pull_request) Successful in 57s

Details

CI / push-validation (pull_request) Successful in 40s

Details

CI / unit_tests (pull_request) Successful in 3m13s

Details

CI / integration_tests (pull_request) Successful in 4m23s

Details

CI / e2e_tests (pull_request) Successful in 4m37s

Details

CI / docker (pull_request) Successful in 1m34s

Details

CI / coverage (pull_request) Successful in 10m46s

Details

CI / status-check (pull_request) Successful in 1s

Details

CI / lint (push) Successful in 17s

Details

CI / quality (push) Successful in 17s

Details

CI / build (push) Successful in 24s

Details

CI / helm (push) Successful in 24s

Details

CI / push-validation (push) Successful in 37s

Details

CI / typecheck (push) Successful in 52s

Details

CI / security (push) Successful in 52s

Details

CI / e2e_tests (push) Successful in 3m13s

Details

CI / unit_tests (push) Successful in 6m37s

Details

CI / integration_tests (push) Successful in 6m39s

Details

CI / docker (push) Successful in 1m35s

Details

CI / coverage (push) Successful in 11m13s

Details

CI / status-check (push) Successful in 1s

Details

f67e8a2e07

Surface the non-AssertionError guard warning in standard Behave output by emitting to stderr in addition to the structured logger, and add infrastructure coverage that asserts this guard path is visible during test runs. Document the @tdd_expected_fail expectation that bug-signaling failures must use AssertionError so infrastructure exceptions are not accidentally treated as expected bug failures.

ISSUES CLOSED: #8294

fix(security): fix plugins/loader.py load_from_entry_point security #7476 717a05f604

Parse entry point targets before import so allowlist enforcement happens prior to execution and add a Behave regression scenario covering the disallowed-prefix path.

ISSUES CLOSED: #7476

fix(security): add Robot integration test and CONTRIBUTORS for #7476 cef70ff98a

Add Robot Framework integration test verifying that load_from_entry_points
does not call ep.load() for entry points with disallowed module prefixes
(security regression test for issue #7476).

Also add HAL 9000 to CONTRIBUTORS.md per CONTRIBUTING.md process rules.

ISSUES CLOSED: #7476

docs(contributors): add HAL 9000 to contributors list 74ce88caeb

ISSUES CLOSED: #7476

docs(contributors): add HAL 9000 plugin security hardening contribution detail

CI / build (pull_request) Successful in 16s

Details

CI / helm (pull_request) Successful in 16s

Details

CI / push-validation (pull_request) Successful in 11s

Details

CI / lint (pull_request) Successful in 38s

Details

CI / typecheck (pull_request) Successful in 50s

Details

CI / security (pull_request) Successful in 51s

Details

CI / e2e_tests (pull_request) Successful in 2m14s

Details

CI / quality (pull_request) Successful in 3m44s

Details

CI / integration_tests (pull_request) Successful in 6m17s

Details

CI / unit_tests (pull_request) Successful in 7m32s

Details

CI / docker (pull_request) Successful in 56s

Details

CI / coverage (pull_request) Successful in 12m31s

Details

CI / status-check (pull_request) Successful in 1s

Details

46ed31930e

Added detail entry for HAL 9000's contribution to the plugin entry point security hardening fix (#7476).

ISSUES CLOSED: #7476

fix(security): harden plugin entry point loading (#7785 )

CI / lint (push) Successful in 18s

Details

CI / helm (push) Successful in 17s

Details

CI / build (push) Successful in 30s

Details

CI / typecheck (push) Successful in 40s

Details

CI / quality (push) Successful in 49s

Details

CI / push-validation (push) Successful in 36s

Details

CI / security (push) Successful in 1m1s

Details

CI / e2e_tests (push) Successful in 3m27s

Details

CI / integration_tests (push) Successful in 4m2s

Details

CI / unit_tests (push) Successful in 5m42s

Details

CI / docker (push) Successful in 1m7s

Details

CI / coverage (push) Successful in 12m44s

Details

CI / status-check (push) Successful in 1s

Details

9178ba5f91

Enforce entry point allowlist validation before importing plugin modules, add explicit parsing helper, Robot Framework security regression test, and Behave security regression coverage. Documents the security fix in the changelog.

Closes #7476

chore(agents): improve pr-review-pool-supervisor — fix tracking prefix mismatch causing duplicate issues

CI / lint (pull_request) Successful in 18s

Details

CI / build (pull_request) Successful in 17s

Details

CI / helm (pull_request) Successful in 18s

Details

CI / push-validation (pull_request) Successful in 9s

Details

CI / quality (pull_request) Successful in 52s

Details

CI / typecheck (pull_request) Successful in 56s

Details

CI / security (pull_request) Successful in 57s

Details

CI / e2e_tests (pull_request) Successful in 5m9s

Details

CI / coverage (pull_request) Successful in 5m34s

Details

CI / integration_tests (pull_request) Successful in 6m36s

Details

CI / unit_tests (pull_request) Successful in 11m29s

Details

CI / docker (pull_request) Successful in 1m18s

Details

CI / status-check (pull_request) Successful in 1s

Details

CI / build (push) Successful in 15s

Details

CI / helm (push) Successful in 16s

Details

CI / push-validation (push) Successful in 11s

Details

CI / security (push) Successful in 30s

Details

CI / lint (push) Successful in 39s

Details

CI / quality (push) Successful in 42s

Details

CI / typecheck (push) Successful in 53s

Details

CI / integration_tests (push) Successful in 6m35s

Details

CI / coverage (push) Successful in 5m44s

Details

CI / e2e_tests (push) Successful in 6m48s

Details

CI / unit_tests (push) Successful in 7m41s

Details

CI / docker (push) Successful in 1m19s

Details

CI / status-check (push) Successful in 1s

Details

bdbfb39e45

Approved proposal: #7602
Pattern: workflow_fix
Evidence: Watchdog (Cycle 15, #7587) reports HIGH severity systemic issue —
AUTO-REV-SUP creating 10+ duplicate tracking issues per cycle. Root cause:
agent definition uses AUTO-REV-POOL prefix in ATM calls but actual issues
use AUTO-REV-SUP prefix. ATM cannot find/close old issues → duplicates.
Fix: Updated all tracking prefix references from AUTO-REV-POOL to AUTO-REV-SUP
and tracking type from 'Review Pool Status' to 'PR Review Pool Status'.

ISSUES CLOSED: #7602

# Conflicts:
#	.opencode/agents/pr-review-pool-supervisor.md

fix(tests): resolve nox unit_tests timeout for agent_skills_loader and skill_search features

CI / build (pull_request) Successful in 17s

Details

CI / helm (pull_request) Successful in 17s

Details

CI / push-validation (pull_request) Successful in 10s

Details

CI / lint (pull_request) Successful in 39s

Details

CI / quality (pull_request) Successful in 50s

Details

CI / typecheck (pull_request) Successful in 52s

Details

CI / security (pull_request) Successful in 53s

Details

CI / e2e_tests (pull_request) Successful in 2m13s

Details

CI / coverage (pull_request) Successful in 5m35s

Details

CI / integration_tests (pull_request) Successful in 6m40s

Details

CI / unit_tests (pull_request) Successful in 7m38s

Details

CI / docker (pull_request) Successful in 1m45s

Details

CI / status-check (pull_request) Successful in 1s

Details

CI / lint (push) Successful in 16s

Details

CI / quality (push) Successful in 17s

Details

CI / build (push) Successful in 23s

Details

CI / helm (push) Successful in 24s

Details

CI / typecheck (push) Successful in 53s

Details

CI / security (push) Successful in 53s

Details

CI / push-validation (push) Successful in 38s

Details

CI / e2e_tests (push) Successful in 3m14s

Details

CI / unit_tests (push) Successful in 6m37s

Details

CI / integration_tests (push) Successful in 6m39s

Details

CI / docker (push) Successful in 12s

Details

CI / coverage (push) Successful in 10m53s

Details

CI / status-check (push) Successful in 1s

Details

b8732dfc6f

Adjusted test running and file-detection logic to stabilize unit tests in overlayfs environments and improve target feature handling.

- Modified scripts/run_behave_parallel.py to run sequentially when there are 2 or fewer feature files, avoiding fork deadlocks on overlayfs and reducing nox-based unit test timeouts for agent_skills_loader and skill_search features.
- Updated noxfile.py to correctly detect feature files in posargs, fixing the prior logic that appended the "features/" directory when specific feature files were provided. This ensures precise test selection and avoids unnecessary path expansion.

Rationale:
These changes address the root causes of flaky unit test timeouts by preventing problematic forking behavior with small feature sets and by ensuring nox respects explicitly provided feature file paths.

ISSUES CLOSED: #9374

spec: document JSON-RPC 2.0 A2A wire format (AUTO-ARCH-8)

CI / push-validation (pull_request) Successful in 10s

Details

CI / helm (pull_request) Successful in 25s

Details

CI / build (pull_request) Successful in 26s

Details

CI / lint (pull_request) Successful in 28s

Details

CI / quality (pull_request) Successful in 53s

Details

CI / typecheck (pull_request) Successful in 56s

Details

CI / security (pull_request) Successful in 57s

Details

CI / e2e_tests (pull_request) Successful in 3m19s

Details

CI / unit_tests (pull_request) Successful in 7m21s

Details

CI / integration_tests (pull_request) Successful in 7m23s

Details

CI / docker (pull_request) Successful in 55s

Details

CI / coverage (pull_request) Successful in 11m13s

Details

CI / status-check (pull_request) Successful in 2s

Details

CI / lint (push) Successful in 37s

Details

CI / quality (push) Successful in 43s

Details

CI / typecheck (push) Successful in 52s

Details

CI / security (push) Successful in 53s

Details

CI / build (push) Successful in 19s

Details

CI / push-validation (push) Successful in 29s

Details

CI / helm (push) Successful in 31s

Details

CI / e2e_tests (push) Successful in 3m16s

Details

CI / unit_tests (push) Successful in 7m9s

Details

CI / integration_tests (push) Successful in 7m13s

Details

CI / docker (push) Successful in 8s

Details

CI / coverage (push) Successful in 10m54s

Details

CI / status-check (push) Successful in 1s

Details

835bc580e2

Updates the A2A Protocol section to reflect the rename of A2aRequest/
A2aResponse fields to standard JSON-RPC 2.0 names (method, id, result,
error). Documents A2aVersionNegotiator for backward compatibility.

Closes #8787

fix(agents): make bug-hunt-pool-supervisor tracking non-blocking to prevent initialization hangs

CI / push-validation (pull_request) Successful in 17s

Details

CI / lint (pull_request) Successful in 18s

Details

CI / helm (pull_request) Successful in 33s

Details

CI / typecheck (pull_request) Successful in 45s

Details

CI / quality (pull_request) Successful in 3m42s

Details

CI / build (pull_request) Successful in 3m43s

Details

CI / integration_tests (pull_request) Successful in 4m1s

Details

CI / security (pull_request) Successful in 4m23s

Details

CI / e2e_tests (pull_request) Successful in 6m53s

Details

CI / unit_tests (pull_request) Successful in 8m28s

Details

CI / docker (pull_request) Successful in 14s

Details

CI / coverage (pull_request) Successful in 14m27s

Details

CI / status-check (pull_request) Successful in 1s

Details

CI / push-validation (push) Successful in 17s

Details

CI / helm (push) Successful in 24s

Details

CI / quality (push) Successful in 50s

Details

CI / security (push) Successful in 59s

Details

CI / lint (push) Successful in 3m17s

Details

CI / build (push) Successful in 3m17s

Details

CI / typecheck (push) Successful in 4m31s

Details

CI / unit_tests (push) Successful in 5m39s

Details

CI / docker (push) Successful in 1m33s

Details

CI / integration_tests (push) Successful in 7m23s

Details

CI / e2e_tests (push) Successful in 7m39s

Details

CI / coverage (push) Successful in 5m36s

Details

CI / status-check (push) Successful in 0s

Details

777a4eae43

Build: Added back the benchmark tests but made them so they only run after pull requests into master

CI / push-validation (push) Successful in 17s

Details

CI / build (push) Successful in 18s

Details

CI / helm (push) Successful in 25s

Details

CI / quality (push) Successful in 42s

Details

CI / typecheck (push) Successful in 49s

Details

CI / e2e_tests (push) Successful in 2m39s

Details

CI / lint (push) Successful in 3m25s

Details

CI / security (push) Successful in 4m4s

Details

CI / integration_tests (push) Successful in 9m45s

Details

CI / unit_tests (push) Successful in 10m32s

Details

CI / docker (push) Successful in 21s

Details

CI / coverage (push) Has been cancelled

Details

CI / benchmark-publish (push) Has been cancelled

Details

CI / status-check (push) Has been cancelled

Details

CI / benchmark-regression (push) Has been cancelled

Details

19664f8162

feat(skills): add exhaustive programming patterns agent skill 98680825a7

Adds a comprehensive opencode skill under .opencode/skills/programming-patterns/
covering 90+ design, architectural, concurrency, and functional patterns.

- 108 files, 33,500+ lines across 13 reference categories
- Every pattern: pseudocode + tested Python + Go + JavaScript implementations
- All 239 code blocks verified passing (85 Python, 76 Go, 78 JS)
- 1,569-line SKILL.md with:
  - Master decision tree (all 14 categories with multi-pattern suggestions)
  - 34 situation-specific decision trees covering every programming scenario
    (new feature, refactoring, REST API, CLI, data pipeline, rule engine,
     external integration, performance, memory, notifications, plugins,
     caching, events, auth, reporting, vendor lock-in, domain model,
     business rules, observability, file I/O, scheduling, testability,
     third-party libs, concurrency, complex domain interactions)
  - 12 compound pattern-combination scenarios with full architecture maps
    (e-commerce checkout, REST endpoint, background jobs, real-time dashboard,
     microservice resilience, ML pipeline, text editor, legacy migration,
     multi-tenant SaaS, document approval, financial transactions, chat)
  - 'When Am I Allowed to Skip Patterns?' mandate table (answer: never)
  - Quick pattern lookup tables for all 90+ patterns
  - Complete reference index

docs(skills): expand programming-patterns SKILL.md with 17 new decision trees and 6 new scenarios 81b318f07d

- 476 → 1,569 → 2,244 lines total growth
- Decision trees: 6 → 34 → 51 situation-specific trees
- Compound scenarios: 12 → 18 full architecture maps
- New trees added in this pass:
    search/discovery, game/simulation, feature flags, subscriptions/billing,
    soft delete/archiving, i18n/localization, database optimization,
    AI agents/LLM systems, file upload/media, CMS, OAuth2/SSO,
    graph traversal, audit/compliance, real-time collaboration,
    API versioning, bulk/batch processing, pagination/filtering,
    webhook delivery (17 new trees)
- New scenarios added:
    user registration with email verification, faceted search,
    feature flag system, shopping cart with session, rate limiting
    infrastructure, AI agent with tool use (6 new scenarios)
- New sections:
    'Pattern Progression' (5-stage evolution for a data service and flag)
    'Minimum Pattern Set per Component Type' (table of 15 component types)
    'When to Skip Patterns — Never' table expanded to 25 rationalizations
- All file references validated (0 broken)

docs(skills): expand programming-patterns SKILL.md — 16 new trees, 5 new scenarios, 2 new sections d33508e8c9

Growth: 2,244 → 2,811 lines

New decision trees added (51 → 67):
  GraphQL API, data validation & sanitization, payment processing,
  booking/reservation systems, recommendation engines, distributed locking,
  graceful startup/shutdown lifecycle, schema/data migration, SaaS onboarding
  wizards, social graphs, API SDK design, multi-step form/wizard UI,
  user preferences management, microservice chassis/platform,
  dependency injection containers, import/export systems

New compound scenarios added (18 → 23):
  Scenario 19: Payment processing with provider failover
  Scenario 20: Hotel/resource booking with concurrent hold resolution
  Scenario 21: SaaS onboarding with multi-tenant provisioning
  Scenario 22: Distributed rate limiting across service instances
  Scenario 23: Recommendation engine with A/B testing and fallback

New sections added:
  '🔗 Pattern Synergies' — 20-row table of patterns that are always better together
  '✅ Code Review Checklist' — 30-item pattern-coverage checklist for PRs
    covering every class, service, external call, conditional, loop, and test

fix(skills): comprehensive final audit pass for programming-patterns skill d9e5668cec

Fixes and improvements from exhaustive audit:

Consistency fixes in SKILL.md:
- 'Pipe & Filter' → 'Pipe and Filter' (one stray '&' found and corrected)
- 'Singleton for factory instance' → clarified to 'register factory as
  singleton-scoped via DI container' (less misleading wording)
- Documentation Format section updated with note that SKILL.md itself is the
  authoritative source for related-pattern combinations

Coverage fix — Related Patterns sections:
- Added '## Related Patterns' to ALL 94 pattern files (was 0/94)
- Each section lists 3–6 related patterns with relationship descriptions
- Covers: why they're related, when to prefer one vs the other,
  and which are often confused

SOLID principles → Creational → Structural → Behavioral → Architectural →
Concurrency → Functional → Resilience → Data Access → Messaging →
Testing → Error Handling → Microservice — all 13 categories covered

Code verification:
- Python: 0 failures (all 85 testable blocks pass)
- Go: 0 failures (all 76 testable blocks pass)
- JavaScript: 0 failures (all 78 testable blocks pass)
- All 239 code blocks verified correct after edits

Final skill state:
- 108 files, 36,524 lines across 13 reference categories
- 94/94 pattern files have Related Patterns sections
- 2,815-line SKILL.md with 67 decision trees, 23 scenarios,
  0 broken references, 0 naming inconsistencies

fix(agents): improve label-manager permissions, merge supervisor clarity, and product-builder variable naming

CI / push-validation (push) Successful in 24s

Details

CI / helm (push) Successful in 26s

Details

CI / build (push) Successful in 26s

Details

CI / lint (push) Successful in 27s

Details

CI / e2e_tests (push) Successful in 3m6s

Details

CI / quality (push) Successful in 3m41s

Details

CI / typecheck (push) Successful in 4m0s

Details

CI / security (push) Successful in 4m37s

Details

CI / integration_tests (push) Successful in 9m36s

Details

CI / unit_tests (push) Successful in 10m51s

Details

CI / docker (push) Successful in 1m19s

Details

CI / coverage (push) Successful in 13m51s

Details

CI / status-check (push) Successful in 1s

Details

CI / benchmark-publish (push) Successful in 1h14m4s

Details

CI / benchmark-regression (push) Has been skipped

Details

21b831e35d

forgejo-label-manager.md:
- Refactored curl permission rules to use explicit allow/deny ordering with
  clear comments explaining each rule; consolidated overlapping deny patterns
- Switched to curl-only approach via forgejo-api skill (deny all Forgejo MCP tools)
- Added read: deny and skill forgejo-api: allow to enforce the curl-only model
- Clarified permission block structure: deny by default, specific allows per endpoint

pr-merge-pool-supervisor.md:
- Expanded 'What You Receive' section to list each field individually with bold
  labels for clarity (owner, repo, PAT, git email/name, briefing)

product-builder.md:
- Added 'Local Variable' column to the Required Information table so agents know
  the canonical variable names to reuse throughout prompts
- Added forgejo_url, forgejo_owner, and forgejo_repo as explicit gather targets
  with env var fallbacks and remote-detection instructions
- Added concrete remote URL parsing example showing how to extract host/owner/repo

feat(skills): add cleverthis-guidelines agent skill 8193545fb5

Adds a comprehensive opencode skill under .opencode/skills/cleverthis-guidelines/
covering every rule, regulation, directive, and guideline governing CleverThis
projects and company operations. Synthesized from CONTRIBUTING.md and the
CleverThis Operations Code (C.O.C. v0.4, 70 pages).

- 12 files, 2,384 lines across 11 reference categories
- 1,059-line SKILL.md with 16 exhaustive decision trees covering every
  procedural situation: writing code, committing, submitting PRs, code review,
  creating issues, applying labels, ticket state transitions, sprint planning,
  triaging, point estimation, definition of done, authoring documents,
  confidentiality classification, security/credentials, FOSS, bug fixes (TDD),
  architectural decisions, and escalation paths

Reference files cover:
  commits/     — Conventional Changelog format, atomic commit rules, pre-commit
  pull-requests/ — all 12 PR requirements, review process, merge criteria
  testing/     — BDD/Behave (unit), Robot Framework (integration), 97% coverage
                 threshold (project-specific), TDD bug fix workflow, mocking rules
  issue-tracking/ — ticket hierarchy (Issue→Epic→Legendary), all quality criteria,
                    mandatory issue sections, full label system (State/Type/Priority/
                    MoSCoW/Special), lifecycle flow, dependency direction rules
  sprints/     — DSDM, 6-stage triaging, MoSCoW, poker point estimation,
                 all 4 sprint ceremonies with complete rule sets
  code-style/  — SOLID, design patterns, import rules, error handling, type safety,
                 LangChain/LangGraph guidelines, v3 vs legacy plan lifecycle
  security/    — gopass-only password management, Yubikey rules, encryption policy
  information/ — 5 confidentiality stamp levels, document authoring, email conventions
  organizational/ — C-level hierarchy, ELB, OCRB, all 12 committees, personnel review
  project-tools/ — nox commands, CI/CD, release process, Hatch, Commitizen setup
  open-source/ — FOSS governance, open standards mandate, POSIX compliance

Key project-specific facts documented:
  - Coverage threshold: 97% (overrides C.O.C. baseline of 85%)
  - BDD framework: Behave (not pytest alone)
  - Type checker: Pyright (never disabled, no type: ignore)
  - Bug fixes: mandatory TDD workflow with @tdd_expected_fail tagging

docs(skill): expand cleverthis-guidelines SKILL.md with missing rules and new decision trees d21edb67f7

Added 11 new decision trees (branch naming, documentation traceability,
nox session guide, CI failure diagnosis, file organization, dev setup,
Issue/Epic/Legendary hierarchy, ticket well-scoped checklist, v3 vs
legacy plan workflow, release process, TDD issue-capture test detail).

Expanded existing trees with previously missing rules: specification-
first development mandate, file organization per directory, backwards
compatibility policy (none pre-v3.0.0), AssertionError-only rule for
TDD expected-fail steps, tdd/mN- and bugfix/mN- branch naming with
shared suffix requirement, different-assignees preference for TDD vs
fix, full CI job list with required-for-merge gates, all nox sessions
(e2e_tests, benchmark, benchmark_regression, complexity, docs, build).

Fixed PR approval count from 2 to 1 (project-specific override; self-
approval permitted per CONTRIBUTING.md). Updated Key Numbers table with
12 new rows covering CI triggers, release trigger, ULID format, required
CI jobs, backwards compat start, dependency direction, and more.

ISSUES CLOSED: #0

docs(skill): generalise SKILL.md for any programming language ca268e39fb

Remove project-specific src/cleveragents/ path (now src/<package>/ with
examples). Replace all bare nox/Pyright/ruff/Behave references with the
language-agnostic 'task runner / type checker / linter / BDD framework'
abstractions, keeping the project-specific tool as a parenthetical example.

Add ecosystem-equivalents reference table (Python, JS/TS, Java/Kotlin, Go)
in the Quick Command Reference. Generalise type-suppression rules across
languages (# type: ignore, @ts-ignore, @SuppressWarnings). Generalise
TDD assertion failure type requirement with Python, Java, and JS examples.
Generalise import rules, project manifest references, and directory layout
descriptions. Remove Python-only step-file naming; add multi-language
examples throughout.

LangChain/LangGraph and v3/legacy plan workflow sections are left as-is
and clearly labelled as project-specific.

ISSUES CLOSED: #0

docs(skill): remove CleverAgents-specific content and finish tool generalisation 85ac059489

Remove from SKILL.md and all reference files:
- 'Am I choosing between Legacy and v3 plan workflow?' decision tree
- LangChain/LangGraph sections (write-code tree, testing tree, code-style README, testing README)
- FakeListLLM / MemorySaver / TypedDict LangGraph references
- Backwards-compat pre-v3.0.0 policy block (project-version-specific)
- v3 ULID format and Backwards-compat-starts rows from Key Numbers table
- v3 Plan Lifecycle vs Legacy table from code-style README
- Master-tree branch pointing to the v3/legacy workflow tree

Generalise across all reference files:
- commits/README: pre-commit checklist uses 'task runner session (e.g. nox -s X)'
- pull-requests/README: fix approval count 2->1 with self-approval permitted;
  remove 'neither approver may be original author' (project allows self-approval);
  generalise automated-checks table command column
- testing/README: remove LangChain/LangGraph Testing section; generalise all
  bare nox commands with task-runner framing and language note at top
- code-style/README: rewrite General Principles to language-agnostic tooling
  guidance; generalise Import Guidelines with Python/Java/TS examples; rename
  and generalise Type Safety section; remove entire LangChain/LangGraph Best
  Practices section; remove entire v3 Plan Lifecycle vs Legacy section
- security/README: generalise bare nox -s security_scan reference
- issue-tracking/README: generalise subtask examples (Behave/nox)

ISSUES CLOSED: #0

docs(skill): final pass — remove remaining project-specific language, add multi-language examples

CI / push-validation (push) Successful in 18s

Details

CI / helm (push) Successful in 23s

Details

CI / build (push) Successful in 30s

Details

CI / lint (push) Successful in 32s

Details

CI / quality (push) Successful in 32s

Details

CI / typecheck (push) Successful in 1m2s

Details

CI / security (push) Successful in 1m3s

Details

CI / integration_tests (push) Successful in 4m3s

Details

CI / unit_tests (push) Successful in 5m37s

Details

CI / docker (push) Successful in 8s

Details

CI / e2e_tests (push) Successful in 7m25s

Details

CI / coverage (push) Successful in 10m51s

Details

CI / status-check (push) Successful in 1s

Details

CI / benchmark-publish (push) Has been cancelled

Details

CI / benchmark-regression (push) Has been cancelled

Details

caaafacf45

SKILL.md:
- Remove stale 'v3 vs legacy plan workflow' reference from frontmatter description
- Change all git tag version examples from project-specific v3.6.0 to generic v1.2.3
- Branch name example: upgrade-langchain -> upgrade-dependencies
- Documentation traceability module path: was Python-only example, now shows
  Python, Java, TypeScript, and Go side by side
- Task runner session tree: remove bare Python tool names from BEFORE SUBMITTING
  and SPECIFIC SITUATIONS subsections (bandit+semgrep+vulture, vulture, Radon,
  MkDocs, Robot Framework) — session descriptions are now tool-agnostic

project-tools/README.md:
- Full rewrite from Python-only reference to language-agnostic guide
- Adds language/tooling note at top explaining Python/nox as the project example
- Comprehensive equivalents table covering Python, JS/TS, Java/Kotlin, and Go
  for every concern (task runner, lint, format, type check, unit/integration tests,
  coverage, security scan, unused code, complexity, build, docs, benchmarks)
- Project environment management section with language comparison table
- Dependency caching section with per-language cache key patterns
- Configuration files section as a multi-language comparison table
- Development setup checklist shows nox/npm/gradlew/go alternatives side by side
- 'Always Runnable' section with examples in all four ecosystems
- git tag example: v3.6.0 -> v1.2.3 (generic)

testing/README.md:
- 'Never use stub/pass implementations' -> 'Never use empty/stub implementations
  (no no-op bodies)' — removes Python-keyword 'pass' used as if universal

ISSUES CLOSED: #0

docs(skill): add cleveragents-contributing project-specific skill 82471f87f0

New skill covering all CONTRIBUTING.md project-specific rules that
supplement or override the generic cleverthis-guidelines skill. Prominently
declares override precedence at the top of SKILL.md and in every reference
file — this skill's rules apply unconditionally when they conflict with
the general skill.

SKILL.md (839 lines) contains:
- Override notice table comparing cleverthis-guidelines vs this project
  for 18 specific topics (framework, directories, tool names, etc.)
- Master decision tree routing all project-specific situations
- 9 detailed decision trees: file placement (exact directories), tests
  (Behave/Robot Framework rules), TDD bug fix workflow (full 6-step with
  branch naming), TDD issue-capture test (exact three-tag system with
  AssertionError enforcement), nox session reference (all sessions + CI
  job mappings + required-for-merge), CI failure diagnosis (per-job
  remediation), plan CLI (v3 vs legacy, ULID, storage backends),
  LangChain/LangGraph code (TypedDict, MemorySaver, BaseLanguageModel,
  FakeListLLM, canonical node pattern), Python imports (top-of-file,
  TYPE_CHECKING exception), dev env setup (complete tool inventory), and
  release process (backwards compat policy, Docker, secrets)
- Key Numbers table with 33 project-specific values

Reference files (1254 lines total):
- testing/README.md: Behave rules, Robot Framework, TDD tag system with
  full examples and validation rules, AssertionError requirement, coverage
  threshold, mock placement, LangGraph testing, ASV benchmarks
- ci-cd/README.md: all 13 CI jobs with nox session mappings, 3 workflow
  triggers, 5 required-for-merge checks, secrets table, caching policy,
  nightly quality sweep, branch protection
- toolchain/README.md: complete nox session catalogue, Pyright prohibition
  on # type: ignore, ruff config, Hatch, pyproject.toml as single source,
  pre-commit setup, Commitizen, Python import rules with examples, error
  handling patterns with Python code
- langchain-langgraph/README.md: TypedDict state, verb-based node naming,
  MemorySaver, conditional edges, BaseLanguageModel abstraction, prompt
  templates, sync+async requirement, output parsing, env var configuration,
  LangSmith disabled by default, retry decorators, canonical node template,
  FakeListLLM testing, state/workflow/streaming/memory test patterns
- file-organization/README.md: exact directory map, per-directory rules and
  prohibitions, docs/specification.md authority, BDD step-file naming rules
- cli-workflow/README.md: v3 vs legacy comparison table, ULID format, why
  mixing is impossible (separate storage backends), error diagnosis for
  common failure modes, manual migration steps, backwards compat policy

ISSUES CLOSED: #0

docs(skill): expand cleveragents-contributing SKILL.md with 7 new trees and deep expansions

CI / push-validation (push) Successful in 17s

Details

CI / helm (push) Successful in 30s

Details

CI / lint (push) Successful in 34s

Details

CI / typecheck (push) Successful in 49s

Details

CI / security (push) Successful in 52s

Details

CI / build (push) Successful in 3m20s

Details

CI / quality (push) Successful in 3m39s

Details

CI / integration_tests (push) Successful in 4m10s

Details

CI / e2e_tests (push) Successful in 4m31s

Details

CI / unit_tests (push) Successful in 5m3s

Details

CI / docker (push) Successful in 8s

Details

CI / coverage (push) Successful in 10m47s

Details

CI / status-check (push) Successful in 1s

Details

CI / benchmark-publish (push) Has been cancelled

Details

CI / benchmark-regression (push) Has been cancelled

Details

38a2773261

Add 7 new decision trees covering gaps found in CONTRIBUTING.md audit:

'Am I creating an issue?' — full issue anatomy: mandatory Metadata section
(exact commit message first line + branch name), Subtasks checkbox format
with example, Definition of Done section, label rules (State/Unverified
+Type+Priority; MoSCoW by owner only), Ref field rules, parent and blocking
link mechanics via Forgejo dependencies, bug issues companion TDD issue rule.

'Am I about to write code?' — spec-first mandate (read docs/specification.md
before any code), ADR process for architectural changes, branch must match
issue Metadata, test-first requirement, SOLID + arg validation + type
annotations requirements, prohibited list (# type: ignore, half-done work,
mocks in src/, if-testing guards).

'Am I about to commit?' — self-review diff (git add -p), atomicity rules
(one logical change, no cosmetic+functional mixing, code-move then modify),
completeness rules (tests + docs + changelog + ancillary files in same
commit), bisect-friendly / revertibility requirements, prescribed commit
first line verbatim from issue Metadata, Commitizen usage, pre-commit hook
rules, commit hygiene (topic branches, interactive rebase before merging).

'Am I submitting a PR?' — all 12 PR requirements numbered, with critical
dependency direction rule (PR→blocks→issue; reversed = deadlock with full
explanation), closing keywords, one Epic per PR, milestone + Type/ label,
after-submission state transitions, complete merge checklist.

'Am I reviewing a PR?' — eligibility and approval rules, CI gate check,
all 6 reviewer criteria (correctness, spec alignment, test quality, type
safety, readability, performance, security, style, documentation, commit
quality), requesting changes protocol, maintainer override rule.

'Am I documenting something?' — single canonical surface rule, traceability
(module.class.method + commit hash; never file:linenum), same-commit rule,
code-level docstring requirements, spec.md authority.

'Am I writing error handling?' — mandatory argument validation pattern
(before ANY other logic) with Python code example, exception propagation
rules (never suppress, never bare except, never return None on error),
fail-fast principles, AssertionError for TDD expected-fail steps.

Expand existing trees:
- 'Am I writing tests?': add multi-level testing mandate (unit + integration +
  benchmarks required for every task), what tests must cover (error paths,
  edge cases, failure modes), test failure remediation rules
- 'Which nox session?': clarify format vs format --check difference
- 'Am I looking at a CI failure?': add quality/complexity failure diagnosis,
  common causes per job type, more detail on coverage and unit_tests failures
- 'Am I writing LangChain/LangGraph code?': clarify MemorySaver requirement,
  memory class selection (Buffer vs Entity), format prohibition reasoning
- 'Which directory?': add /benchmarks/ to directory tree

Update master decision tree with 6 new branches for new trees.
Update Key Numbers table with 4 new rows.
Update frontmatter description to cover all new topics.
Override highlights table: add commit first line and PR dep direction rows.

ISSUES CLOSED: #0

docs(skill): add cleveragents-spec skill from docs/specification.md 5a24b5bdf2

New skill covering every architectural concept, entity, workflow, CLI
command, and design decision from docs/specification.md (47,181 lines
read in full). Explains WHAT the system is intended to build.

SKILL.md (1,282 lines) — 18 decision trees:
- 'What am I working on?' master routing tree
- 'What is a Plan?' — 4 phases, reversion rules, hierarchy, decision tree
- 'How does a plan run?' — step-by-step Action→Strategize→Execute→Apply
- 'What is a Decision?' — 10 types, data model, dual tree+DAG structure,
  timing by phase, decision recording protocol
- 'How do I correct a plan?' — revert vs append modes, Strategize vs
  Execute correction mechanics, affected subtree computation
- 'What is an Invariant?' — 4 scopes, precedence chain (plan>action>
  project>global), non-overridable globals, Invariant Reconciliation Actor
- 'What is an Actor?' — LLM vs graph types, Jinja2+env-var preprocessing,
  specialized roles (strategy/execution/estimation/invariant)
- 'What is a Tool?' — 4 sources, capability metadata, 4-stage lifecycle,
  resource bindings and slots, anonymous tools, metadata overrides
- 'What is a Validation?' — Tool subtype, always read-only, required vs
  informational modes, 3 attachment scopes, wrapping existing tools
- 'What is a Skill?' — composition patterns, includes, tool overrides
- 'What is a Resource?' — physical vs virtual, 34+ built-in types, DAG,
  type inheritance, 5-sandbox strategies, 6-level execution env routing,
  devcontainer auto-discovery and lazy activation
- 'What is a Project?' — resource linking, multi-project plans, context
  config, execution environment
- 'Which automation profile applies?' — 8 built-in profiles, 11 flags,
  Safety Profile, Automation Guard, Semantic Escalation, progressive trust
- 'How does naming work?' — namespace format, types, ULID vs name identity
- 'Which CLI command do I use?' — every command group with key flags
- 'What is the architecture?' — 4 layers, 2 deployment modes, A2A
  protocol (full method routing, error codes, streaming), DI container
- 'What is the ACMS?' — UKO, CRP, 10-slot pipeline, hot/warm/cold tiers
- 'Which milestone am I in?' — v3.2.0–v3.8.0 status + cross-milestone invariants
- Key Numbers table (35 entries)

Reference files (1,974 lines across 9 files):
- plan-lifecycle: phase mechanics, decision tree schema, checkpoint triggers,
  child plan execution modes, merge strategies, plan identity fields
- entities: data models for Plan, Decision, Action, Session, Invariant,
  AutomationProfile, SafetyProfile, AutomationGuard, Namespace
- architecture: 4-layer diagram, deployment modes, complete A2A method
  routing tables (standard + plan + registry + context + sync + health),
  streaming events, authentication, error taxonomy, full tech stack
- automation-profiles: threshold table for all 8 built-in profiles, use
  cases, Semantic Escalation algorithm, custom profile YAML
- actors-tools-skills: Actor/Tool/Validation/Skill YAML schemas with
  complete annotated examples, Jinja2 filter reference, LSP integration
  detail, LSPToolAdapter, actor context precedence
- resources: complete resource type hierarchy (all 34+ types), sandbox
  strategies, type inheritance rules, execution environment routing,
  devcontainer integration, CLI usage
- acms: UKO 4-layer ontology, CRP, 10-slot Context Assembly Pipeline with
  per-slot component names, hot/warm/cold eviction rules, skeleton
  compression, context view configuration
- milestones: v3.2.0–v3.8.0 deliverables, architectural constraints, and
  definitions of done; cross-milestone quality gates and invariants
- cli-commands: complete CLI reference for all command groups with all
  flags: plan, action, session, project, actor, skill, tool, validation,
  resource, invariant, automation-profile, lsp, config, utility

ISSUES CLOSED: #0

docs(skill): expand cleveragents-contributing SKILL.md — 5 new trees, all trees deepened 05e4f7446c

Add 5 new decision trees:

'Should this be an Issue, Epic, or Legendary?' — hierarchy decision with
one-commit test, demonstrable-capability test, strategic-pillar test,
promotion/demotion rules, and quick self-test questions.

'Is this ticket well-scoped?' — all 11 quality criteria from CONTRIBUTING.md
(Atomicity, Single Commit, Single Responsibility, Assignability, Verifiability,
Self-Containment, Implementation Independence, Subtask Decomposition, Leaf Node,
Mandatory Parent, Finite Completion) each with pass/fail test.

'What ticket state should this be in?' — full lifecycle state machine
(Unverified → Verified → In progress → Paused → In review → Completed →
Wont Do) with who can perform each transition and what labels are required.

'Am I triaging a ticket?' — maintainer triage 7-step process (duplicate
check, validity assessment, completeness check, label assignment, milestone
assignment, parent linking, bug companion TDD issue check).

'What branch name should I use?' — branch naming rules with all prefixes
(feature/mN-, bugfix/mN-, tdd/mN-), source of milestone number N, kebab-
case rules, traceability requirement (shared suffix between tdd/ and bugfix/
branches), and examples.

Expand existing trees:

'Am I creating an issue?' — add 11 quality criteria summary, better
acceptance criteria examples (good vs bad), note on Metadata section
verbatim requirements.

'Am I about to write code?' — add SOLID principle explanations per letter,
add WIP management section (git stash vs draft commits), add ADR step detail.

'Am I about to commit?' — add cosmetic-first-then-functional guidance,
expand commit hygiene section with interactive rebase detail and goal of
clean history (no wip commits).

'Am I submitting a PR?' — add post-submission CI failure handling (new
commit not force-push), add major-change review handling (address every
comment).

'Am I reviewing a PR?' — add blocking vs suggestion vs question comment
distinction with examples, add approve-with-suggestions pattern, add no-wip-
commits check in commit quality section.

'Am I writing tests?' — add integration vs e2e distinction (integration =
real services; e2e = real LLM API keys), add Gherkin quality guidelines
(Given/When/Then semantics, scenario naming, one behavior per scenario),
add Hypothesis property-based testing section, expand test failure
remediation to include real-bug-triggers-TDD-workflow path.

'Am I looking at a CI failure?' — add integration_tests failure diagnosis,
add guidance for when unit test failure reveals a real bug (triggers full
TDD workflow), expand benchmark-regression failure guidance.

'Am I documenting something?' — add CHANGELOG entry format (good vs bad
examples), add ADR document structure (Title/Status/Context/Decision/
Consequences/Alternatives).

'Am I writing LangChain/LangGraph code?' — add RxPY reactive streams
section (Subject, BehaviorSubject, ReplaySubject, operators, backpressure).

'Am I releasing a new version?' — add 'when to bump' section noting most
PRs don't need bumps, add release failure recovery procedure (delete tag,
fix, re-tag).

Update master decision tree to add 5 new branches.
Update Key Numbers table: add benchmark regression threshold (10%),
cyclomatic complexity limit (>10), Hypothesis entry, benchmark regression
threshold, issue quality criteria count, bug priority rule, TDD assignee
preference.
Update frontmatter to document new coverage.

ISSUES CLOSED: #0

docs(skill): final pass — 2 new trees, expanded trees, reference file completions

CI / push-validation (push) Successful in 10s

Details

CI / helm (push) Successful in 28s

Details

CI / build (push) Successful in 29s

Details

CI / typecheck (push) Successful in 53s

Details

CI / lint (push) Successful in 3m44s

Details

CI / quality (push) Successful in 3m58s

Details

CI / security (push) Successful in 4m12s

Details

CI / e2e_tests (push) Successful in 4m49s

Details

CI / integration_tests (push) Successful in 6m47s

Details

CI / unit_tests (push) Successful in 8m3s

Details

CI / docker (push) Successful in 1m31s

Details

CI / coverage (push) Successful in 10m51s

Details

CI / status-check (push) Successful in 1s

Details

CI / benchmark-publish (push) Has been cancelled

Details

CI / benchmark-regression (push) Has been cancelled

Details

47b4c5fbfb

SKILL.md (1,878 → 2,099 lines, 23 → 25 decision trees):

New 'Is my work done?' tree — comprehensive Definition of Done checklist
synthesising all requirements across implementation, three-level testing
(unit/integration/benchmarks), coverage ≥ 97%, five CI quality checks,
commit anatomy (atomic, body, footer), documentation (changelog, docstrings,
CONTRIBUTORS.md), PR fields (description, dep direction, Epic scope, milestone,
Type label), CI checks, and issue state transitions.

New 'What design pattern should I use?' tree — all 24 patterns from
CONTRIBUTING.md categorised across Creational (Factory, Abstract Factory,
Builder, Prototype, Singleton, Object Pool, DI), Structural (Adapter, Bridge,
Composite, Decorator, Facade, Flyweight, Proxy, Module), Behavioral (Chain of
Responsibility, Command, Iterator, Mediator, Memento, Observer, State, Strategy,
Template Method, Visitor, Null Object), and Architectural (Repository, Unit of
Work, Service Layer, MVC, CQRS, Event Sourcing, Specification). Every pattern
includes a when-to-use description and a CleverAgents-specific example.

Expand 'Am I about to write code?' — link to new patterns tree.
Expand 'Am I writing tests?' — add And/But/Outline Gherkin keywords with
examples, add Scenario Outline explanation, add naming good/bad examples with
anti-pattern list, expand integration test guidance with what good integration
tests exercise (CLI, DB, filesystem, service layer), expand Hypothesis section
with 6 specific use cases and recommended strategies to build.
Expand 'Am I about to commit?' — improve commit body guidance with a worked
example showing what to write (context, why this approach, risks, caveats).
Expand 'Am I triaging?' — add Epic/Legendary triage rules (no point estimates,
no milestone assignment, sign-off labels required for closure).

Add two branches to master decision tree for new trees.

Reference files:

references/testing/README.md (187 → 296 lines):
- Add Gherkin Quality Guidelines section: Given/When/Then semantics table,
  Scenario Outline explanation with example, naming rules with good/bad table,
  common anti-patterns (implementation details, multiple behaviors, missing Then)
- Add Property-Based Testing (Hypothesis) section: when-to-use table with 6
  specific CleverAgents use cases, recommended strategies to build, integration
  with Behave step definitions with worked example

references/langchain-langgraph/README.md (307 → 375 lines):
- Add RxPY Reactive Streams section: Subject vs BehaviorSubject vs ReplaySubject
  decision table with when-to-use and code examples, key operators table with
  use cases and code examples, backpressure management patterns (debounce vs
  throttle_first with examples), and clear list of what RxPY is NOT for

references/toolchain/README.md (271 → 272 lines):
- Add Hypothesis to tool table (property-based testing, nox -s unit_tests)

references/ci-cd/README.md (124 → 131 lines):
- Fix project-specific version number in release example (v3.6.0 → generic
  v<MAJOR>.<MINOR>.<PATCH>)
- Add release failure recovery procedure (verify secrets → build locally →
  delete tag → fix → re-tag)

ISSUES CLOSED: #0

docs(skill): add cleveragents-system self-reflection skill 2dbcdc80ae

New skill covering the complete operational architecture of the CleverAgents
autonomous development system — how it runs, not what it builds.

SKILL.md (539 lines) with 9 decision trees:
- 'What does this agent do?' — prefix-to-agent mapping for all 17+1 supervisors
- 'Which supervisor owns this worker?' — reverse lookup from worker tags
- 'How many workers can this supervisor run?' — N_FULL/N_HALF/N_QUARTER formula
  with concrete examples at N=4, N=8, N=16
- 'How does a supervisor launch a worker?' — full dispatch flow including tier
  selector indirection, model inheritance, and credential inclusion
- 'Do I need to launch something asynchronously?' — when/why to use prompt_async
  vs synchronous calls; why only async-agent-manager calls localhost:4096
- 'How do I apply a label to an issue or PR?' — forbidden operations list;
  forgejo-label-manager delegation; org-level vs repo-level; never create
- 'How do I create a tracking issue or announcement?' — CREATE_TRACKING_ISSUE
  invariants; READ-then-CREATE startup order; announcement lifecycle
- 'Which announcements should I consume?' — full relevancy matrix per agent type
- 'How does state recovery work on startup?' — mandatory READ-then-CREATE protocol
  with urgency tiers based on offline duration
- 'Which model tier should I use?' — escalation decision logic with comment parsing
- 'How do credentials get to workers?' — env var hierarchy; why workers never
  read env vars; two bot accounts (primary + reviewer)
- 'Is something wrong with the system?' — diagnostic patterns

Complete supervisor registry table (17 supervisors + product-builder) with
prefixes, agent definitions, worker counts, sleep intervals, and tracking prefixes.
Key Numbers table (25 entries covering all timeouts, thresholds, and intervals).

Reference files (1,292 lines across 6 files):

agent-registry/README.md — full agent hierarchy diagram, all 17 pool supervisor
detailed entries (purpose, worker count, sleep, worker tag pattern, special
notes), worker-to-supervisor mapping table, utility subagent catalog (35+
entries), shared prompt fragment catalog.

async-operations/README.md — why prompt_async exists (fire-and-forget vs
blocking); complete OpenCode Server API reference (list sessions, create
session, prompt_async, get status, get messages, get specific session, delete
session) with curl examples and response shapes; full session naming convention
with all supervisor and worker tag patterns in a table; common operations
(starting supervisors/workers, checking status, detecting stuck sessions,
cleanup); error handling policy (retry 3×).

tracking-system/README.md — status vs announcement issue distinction; the
one-at-a-time invariant; cycle number uniqueness; rolling average interval
formula (0.90×old + 0.10×actual); CREATE_TRACKING_ISSUE step-by-step process;
mandatory startup recovery protocol (READ then CREATE, with wrong-order warning);
discovery patterns; announcement lifecycle; priority labels for announcements
and when to use each; label rules (NEVER create; org-level only; always use
forgejo-label-manager; forbidden Forgejo MCP tools list); forgejo-api skill
curl patterns for label operations; complete automation-tracking-manager
operations table.

tier-system/README.md — four model tiers (haiku/codex/sonnet/opus) with model
IDs, cost ranks, and use cases; how tier selectors work (pass-through inheritance
mechanism, full call chain diagram); progressive escalation decision table;
reading escalation history from attempt comments; human escalation trigger
(Opus×3 same-problem) and steps; default model assignments for all agents
grouped by model; runtime Gemini 2.5 Pro overrides.

credential-flow/README.md — all environment variables with required/optional/
default columns; auto-detection of FORGEJO_URL/OWNER/REPO from git remote;
credential hierarchy diagram; two bot accounts (primary vs reviewer) and why;
worker credential rules (NEVER read env vars; everything from prompt); what a
supervisor must include in every worker prompt; CA_MAX_PARALLEL_WORKERS
formula with examples at N=1/4/8/16; security notes.

coordination/README.md — claim protocol (CLAIM/HEARTBEAT/RELEASE comment
prefixes); claim lifecycle; expiry (2 hours without heartbeat); availability
check algorithm; exact comment formats for all three types; PR work conflict
matrix (code-change vs merge-attempt vs review); session-level deduplication
via tag search (primary mechanism); system-watchdog monitoring of violations;
startup deduplication by product-builder; bot signature formats; announcement
relevancy matrix quick reference table.

ISSUES CLOSED: #0

docs(skill): final pass — expand relevancy matrix, remove duplicated knowledge, cross-reference other skills e816e1af9e

SKILL.md:
- Expand 'Which announcements should I consume?' from a partial example
  (only showed IMP-SUP and said 'use agent-prefix-info for the rest') to the
  FULL canonical cross-agent attention table: all 17 supervisors plus
  product-builder, every source prefix with its minimum priority threshold
  and rationale, universal baseline rule, and rule-of-thumb note
- 'How do I apply a label?' tree: replace detailed label scope breakdown
  (State/, Priority/, MoSCoW/, Type/) and detailed forgejo-label-manager
  internals with cross-references to cleveragents-contributing and
  cleverthis-guidelines; keep only system-critical labels (Automation
  Tracking, needs feedback, Blocked) and the forbidden operations list
  (which is an agent permission concern unique to this system)
- 'How does a supervisor launch a worker?' Step 5: replace 'CONTRIBUTING.md
  rules (commit standards, testing, PR requirements)' with reference to the
  cleveragents-contributing skill, noting that product-builder pre-loads
  these via ref-reader and passes them in briefings
- Quick Reference: add explicit pointers to cleveragents-contributing and
  forgejo-api skills for the label-related lines
- Frontmatter description: rewrite Covers section to remove duplicated label
  scope and forgejo-api curl content; add explicit note that label rules and
  scopes live in cleveragents-contributing and cleverthis-guidelines, and
  that curl patterns are in the forgejo-api skill

tracking-system/README.md:
- Remove entire 'Label Rules (CRITICAL)' section (Never Create Labels, Only
  Org-Level Labels, Always Use forgejo-label-manager, scope conflict rules)
  — this is fully covered in cleveragents-contributing
- Remove entire 'The forgejo-api Skill and Label Operations' section
  (paginated org label fetch loop, PUT replace-all curl, DELETE single label
  curl) — this belongs in the forgejo-api skill which forgejo-label-manager
  loads automatically
- Replace both removed sections with a focused 'System-Specific Labels'
  section covering only what is unique to the system: Automation Tracking
  as the universal discovery mechanism and needs feedback as the human
  escalation signal that stops worker dispatch
- Priority labels table: change from defining what each priority level IS
  (duplicates cleverthis-guidelines) to showing when to use each for
  autonomous system announcements specifically; add cross-reference note

coordination/README.md:
- Remove duplicated 'Announcement Relevancy Matrix Quick Reference' table
  (9 rows covering only some supervisors) — the full authoritative table is
  now in SKILL.md; replace with a two-sentence pointer to SKILL.md and
  agent-prefix-info for programmatic lookup

ISSUES CLOSED: #0

docs(skill): final cleanup pass — remove remaining duplicated project knowledge, cross-reference other skills e1e65c85ae

agent-registry/README.md:
- Remove typecheck-fixer 'Never uses type: ignore' rule (contributing rule,
  lives in cleveragents-contributing not the system registry)
- Remove coverage-improver '>=97%' threshold (project-specific threshold,
  lives in cleveragents-contributing)
- Change new-issue-creator description from 'following CONTRIBUTING.md format'
  to 'following the project issue format' with cross-reference pointer
- Remove subtask-loop trivial description; expand to show the full
  implement → test → quality gates → review loop it manages
- Remove duplicate forgejo-label-manager entry ('See above.') — was listed
  twice in the utility subagents table
- Add cross-reference note at top of Utility Subagents section: descriptions
  focus on system role; project-specific rules (testing philosophy, quality
  gates, commit standards, issue format) are in cleveragents-contributing
- AUTO-IMP-SUP dispatch ordering: add inline cross-reference note pointing
  to cleveragents-contributing and cleverthis-guidelines for label definitions
- AUTO-OWNR description: replace specific label names ('MoSCoW labels',
  'Wont Do') with generic role description + cross-reference note

SKILL.md:
- Reference Index: fix tracking-system entry description — remove 'label rules'
  (those were removed from tracking-system last pass); replace with accurate
  description of what the file now covers (Automation Tracking and needs
  feedback labels as system-specific labels)

No system-specific content was removed: all agent prefixes, the full
announcement relevancy matrix, worker tag patterns, sleep intervals, worker
count formulas, tier system mechanics, tracking system operations, credential
propagation hierarchy, and claim/heartbeat/release protocol are fully preserved.

ISSUES CLOSED: #0

docs(skill): add full redundancy and self-healing documentation to cleveragents-system skill

CI / push-validation (push) Successful in 22s

Details

CI / build (push) Successful in 24s

Details

CI / helm (push) Successful in 32s

Details

CI / lint (push) Successful in 33s

Details

CI / typecheck (push) Successful in 48s

Details

CI / quality (push) Successful in 53s

Details

CI / security (push) Successful in 1m9s

Details

CI / integration_tests (push) Successful in 4m12s

Details

CI / e2e_tests (push) Successful in 6m43s

Details

CI / unit_tests (push) Successful in 7m1s

Details

CI / coverage (push) Successful in 6m59s

Details

CI / docker (push) Successful in 1m33s

Details

CI / status-check (push) Successful in 1s

Details

CI / benchmark-publish (push) Successful in 1h13m54s

Details

CI / benchmark-regression (push) Has been skipped

Details

c9dc70004c

Add new references/redundancy/README.md (280 lines) covering:
- Three-layer redundancy architecture overview (product-builder / watchdog / supervisors)
  with the key insight that each layer uses a different observation mechanism to
  prevent blind spots between layers
- Layer 1 (product-builder): fast cycle (60s liveness), deep inspection (5-min message
  reading with anti-pattern catalogue: error loops, circular patterns, policy violations,
  context exhaustion), worker health check (pool count vs expected), hourly verification
- Layer 2 (system-watchdog): independent 5-min audit using Forgejo tracking issue
  STALENESS rather than OpenCode session status — catches frozen-but-alive sessions
  that appear healthy to product-builder; session introspection for anti-pattern
  detection; clear role separation (watchdog detects, product-builder restarts)
- Layer 3 (supervisor self-monitoring): per-cycle worker health checks, stuck
  detection (15-min threshold), completed vs crashed distinction, pool filling
- State persistence as the foundation of self-healing: everything externalized
  to Forgejo (tracking issues, attempt comments, claim protocol, announcements)
- Supervisor crash-recovery pattern: session crash → product-builder detects ≤60s →
  relaunch → READ_TRACKING_STATE first → light/moderate/full recovery based on
  offline duration → resume from recovered state
- Worker crash-recovery pattern: crash → supervisor detects in next cycle →
  Forgejo evidence check → re-dispatch at same or escalated tier
- Two independent health signals table: OpenCode (session presence/status, latency
  60s) vs Forgejo (tracking staleness, latency 2×interval) — what each catches
- Complete failure mode catalogue (13 failure types with: who detects it, how,
  recovery action, and whether recovery is automatic or requires human)
- async-agent-monitor health classifications: healthy/stuck/idle/finished/errored
  with threshold and configurable idle_threshold_minutes parameter
- Redundancy gaps and limitations: product-builder has no watcher; watchdog
  detects but cannot restart; worker downtime latency varies by supervisor sleep

Expand SKILL.md (539 → 775 lines, 10 → 13 decision trees):
- Significantly expand 'Is something wrong?' tree: now lists every failure
  type with which layer detects it, how detection works, and recovery action
  (supervisor missing, frozen, error loop, waiting for input, worker crashed,
  worker frozen, supervisor stopped dispatching, orphaned claim, CI violations,
  multiple supervisors down, product-builder crash)
- Add new 'How does the system self-heal?' tree: full three-layer redundancy
  decision tree with per-layer mechanics (fast/deep/hourly cadences), the
  Forgejo persistence foundation, complete supervisor crash-recovery pattern,
  complete worker crash-recovery pattern, and the single-point-of-failure note
- Update Key Numbers table: add worker health check and hourly cycle entries;
  clarify session health threshold is configurable; add watchdog staleness
  threshold (2×interval); add supervisor max downtime (≤60s); add worker
  re-dispatch latency (varies by sleep interval)
- Update frontmatter description to cover self-healing and redundancy
- Update reference index to describe the new redundancy reference file

ISSUES CLOSED: #0

docs: add changelog, architecture overview, and README updates for v3.0.0 and v3.1.0 [AUTO-DOCS-1] 929ab790b0

HAL9000 referenced this pull request

2026-04-16 00:05:30 +00:00

[AUTO-REV-SUP] PR Review Pool Status (Cycle 1) #9869

HAL9000 referenced this pull request

2026-04-16 00:10:05 +00:00

[AUTO-REV-SUP] PR Review Pool Status (Cycle 1) #9869

HAL9000 referenced this pull request

2026-04-16 00:15:59 +00:00

[AUTO-REV-SUP] PR Review Pool Status (Cycle 1) #9869

HAL9000 referenced this pull request

2026-04-16 00:23:42 +00:00

[AUTO-EPIC] Status: Epic Planning (Cycle 28) #9906

HAL9001 requested changes 2026-04-16 00:28:00 +00:00

Dismissed

HAL9001 left a comment

Code Review: REQUEST CHANGES

PR #9902 — [AUTO-DOCS-1] Add changelog, architecture overview, and README updates for v3.0.0 and v3.1.0

Issues Found

Wrong Base Branch — PR targets main branch but the default branch is master. This PR should target master.
Missing Milestone — PR has no milestone assigned. Per CONTRIBUTING.md, all PRs must have a milestone.
Missing Type/ Label — PR has no labels. Per CONTRIBUTING.md, every PR must have exactly one Type/ label. For documentation PRs, apply Type/Documentation.
Missing Closes #N Reference — PR body does not contain a Closes #N reference to a tracking issue.
Missing ISSUES CLOSED Footer — Commit message must include ISSUES CLOSED: #N footer.
Merge Conflict — PR is not mergeable (mergeable: false). This may be due to targeting the wrong branch.

What Looks Good

Documentation content appears comprehensive
CHANGELOG.md updated with v3.0.0 and v3.1.0 sections

Please address the blocking issues above, especially the wrong base branch.

Automated by CleverAgents Bot
Supervisor: PR Review Pool | Agent: pr-reviewer
Worker: [AUTO-REV-69]

## Code Review: REQUEST CHANGES **PR #9902** — `[AUTO-DOCS-1] Add changelog, architecture overview, and README updates for v3.0.0 and v3.1.0` ### Issues Found 1. **Wrong Base Branch** — PR targets `main` branch but the default branch is `master`. This PR should target `master`. 2. **Missing Milestone** — PR has no milestone assigned. Per CONTRIBUTING.md, all PRs must have a milestone. 3. **Missing Type/ Label** — PR has no labels. Per CONTRIBUTING.md, every PR must have exactly one `Type/` label. For documentation PRs, apply `Type/Documentation`. 4. **Missing Closes #N Reference** — PR body does not contain a `Closes #N` reference to a tracking issue. 5. **Missing ISSUES CLOSED Footer** — Commit message must include `ISSUES CLOSED: #N` footer. 6. **Merge Conflict** — PR is not mergeable (`mergeable: false`). This may be due to targeting the wrong branch. ### What Looks Good - Documentation content appears comprehensive - CHANGELOG.md updated with v3.0.0 and v3.1.0 sections Please address the blocking issues above, especially the wrong base branch. --- **Automated by CleverAgents Bot** Supervisor: PR Review Pool | Agent: pr-reviewer Worker: [AUTO-REV-69]

HAL9000 referenced this pull request

2026-04-16 00:28:49 +00:00

[AUTO-REV-SUP] PR Review Pool Status (Cycle 1) #9869

HAL9000 referenced this pull request

2026-04-16 00:29:57 +00:00

[AUTO-REV-SUP] PR Review Pool Status (Cycle 1) #9869

HAL9000 force-pushed auto-docs/changelog-architecture-readme from 929ab790b0 to 9bf30dc824

2026-04-16 06:48:07 +00:00

Compare

HAL9000 added the

labels 2026-04-16 06:53:42 +00:00

HAL9000 referenced this pull request

2026-04-16 21:15:40 +00:00

[AUTO-WDOG] needs feedback: Massive PR Backlog — 250+ Open PRs Including 70+ Non-Mergeable #10073

HAL9000 referenced this pull request

2026-04-17 04:34:25 +00:00

[AUTO-DOCS] Status: Documentation Pool Supervisor (Cycle 2) #10181

HAL9000 referenced this pull request

2026-04-17 05:50:00 +00:00

[AUTO-DOCS] Status: Documentation Pool Supervisor (Cycle 3) #10205

HAL9000 referenced this pull request

2026-04-17 10:42:55 +00:00

[AUTO-DOCS] Status: Documentation Pool Supervisor (Cycle 6) #10249

HAL9000 referenced this pull request

2026-04-17 13:21:09 +00:00

[AUTO-DOCS] Status: Documentation Pool Supervisor (Cycle 7) #10257

HAL9000 referenced this pull request

2026-04-17 15:48:05 +00:00

[AUTO-GROOMER] Status: Active (Cycle 3) #10261

HAL9000 referenced this pull request

2026-04-18 08:00:59 +00:00

[AUTO-REV-SUP] Status: PR Review Pool Status (Cycle 2) #10278

HAL9001 requested changes 2026-04-18 08:02:09 +00:00

Dismissed

HAL9001 left a comment

Code Review: REQUEST CHANGES

Reviewed PR #9902 — [AUTO-DOCS-1] Add changelog, architecture overview, and README updates for v3.0.0 and v3.1.0

This is a documentation-only PR (CHANGELOG.md, README.md, docs/architecture.md). The content quality is good — the changelog entries are well-structured, the README additions are clear and accurate, and the architecture milestone history table is useful. However, 3 criteria are failing that must be resolved before merge.

❌ Criterion 1 — CI Status: UNVERIFIABLE

No commit statuses were found for HEAD SHA 9bf30dc824350ab4f954563fdb1dd3a4a3d709cd. The Forgejo Actions API returned a Bad Gateway error. CI must be confirmed passing (lint, typecheck, security, unit_tests, coverage ≥97%) before this PR can be approved. Even for documentation-only PRs, CI gates must pass.

Action required: Ensure CI runs and all gates pass on the latest commit.

❌ Criterion 10 — Missing `Closes #N` Issue Reference

The PR body contains no closing keyword linking to a Forgejo issue. The PR references [AUTO-DOCS-1] as a tracking tag but does not include a Closes #N, Fixes #N, or Resolves #N statement.

Action required: Add Closes #<issue-number> to the PR description body, referencing the issue this PR implements.

❌ Criterion 11 — Branch Name Does Not Follow Convention

Branch name: auto-docs/changelog-architecture-readme

Required convention: feature/mN-name, bugfix/mN-name, or tdd/mN-name

The auto-docs/ prefix is not a recognized branch type. Documentation PRs should use feature/mN-name (e.g., feature/m1-changelog-and-architecture-docs).

Action required: Rename the branch to follow the required naming convention.

⚠️ Advisory — No Milestone Assigned

The PR has no milestone set (milestone: null). Since this PR documents v3.0.0 (M1) and v3.1.0 (M2) milestones, it should be assigned to the appropriate milestone for traceability.

✅ Passing Criteria

Criterion	Result	Notes
No `# type: ignore` suppressions	✅ PASS	No Python files changed
No files >500 lines	✅ PASS	All changed files well under 500 lines
All imports at top of file	✅ PASS	No Python files changed
Tests are Behave scenarios	✅ PASS	No test changes (docs-only PR)
No mocks in src/cleveragents/	✅ PASS	No Python files changed
Layer boundaries respected	✅ PASS	No Python files changed
Commit message Commitizen format	✅ PASS	`docs: add changelog, architecture overview, and README updates for v3.0.0 and v3.1.0 [AUTO-DOCS-1]`
Code matches spec	✅ PASS	Documentation content accurately reflects M1/M2 capabilities
@tdd_expected_fail tag check	✅ N/A	Not a bug fix PR

Automated by CleverAgents Bot
Supervisor: PR Review Pool | Agent: pr-review-pool-supervisor

## Code Review: REQUEST CHANGES Reviewed PR #9902 — *[AUTO-DOCS-1] Add changelog, architecture overview, and README updates for v3.0.0 and v3.1.0* This is a documentation-only PR (CHANGELOG.md, README.md, docs/architecture.md). The content quality is good — the changelog entries are well-structured, the README additions are clear and accurate, and the architecture milestone history table is useful. However, **3 criteria are failing** that must be resolved before merge. --- ### ❌ Criterion 1 — CI Status: UNVERIFIABLE No commit statuses were found for HEAD SHA `9bf30dc824350ab4f954563fdb1dd3a4a3d709cd`. The Forgejo Actions API returned a Bad Gateway error. CI must be confirmed passing (lint, typecheck, security, unit_tests, coverage ≥97%) before this PR can be approved. Even for documentation-only PRs, CI gates must pass. **Action required**: Ensure CI runs and all gates pass on the latest commit. --- ### ❌ Criterion 10 — Missing `Closes #N` Issue Reference The PR body contains no closing keyword linking to a Forgejo issue. The PR references `[AUTO-DOCS-1]` as a tracking tag but does not include a `Closes #N`, `Fixes #N`, or `Resolves #N` statement. **Action required**: Add `Closes #<issue-number>` to the PR description body, referencing the issue this PR implements. --- ### ❌ Criterion 11 — Branch Name Does Not Follow Convention Branch name: `auto-docs/changelog-architecture-readme` Required convention: `feature/mN-name`, `bugfix/mN-name`, or `tdd/mN-name` The `auto-docs/` prefix is not a recognized branch type. Documentation PRs should use `feature/mN-name` (e.g., `feature/m1-changelog-and-architecture-docs`). **Action required**: Rename the branch to follow the required naming convention. --- ### ⚠️ Advisory — No Milestone Assigned The PR has no milestone set (`milestone: null`). Since this PR documents v3.0.0 (M1) and v3.1.0 (M2) milestones, it should be assigned to the appropriate milestone for traceability. --- ### ✅ Passing Criteria | Criterion | Result | Notes | |-----------|--------|-------| | No `# type: ignore` suppressions | ✅ PASS | No Python files changed | | No files >500 lines | ✅ PASS | All changed files well under 500 lines | | All imports at top of file | ✅ PASS | No Python files changed | | Tests are Behave scenarios | ✅ PASS | No test changes (docs-only PR) | | No mocks in src/cleveragents/ | ✅ PASS | No Python files changed | | Layer boundaries respected | ✅ PASS | No Python files changed | | Commit message Commitizen format | ✅ PASS | `docs: add changelog, architecture overview, and README updates for v3.0.0 and v3.1.0 [AUTO-DOCS-1]` | | Code matches spec | ✅ PASS | Documentation content accurately reflects M1/M2 capabilities | | @tdd_expected_fail tag check | ✅ N/A | Not a bug fix PR | --- **Automated by CleverAgents Bot** Supervisor: PR Review Pool | Agent: pr-review-pool-supervisor

HAL9001 commented

2026-04-18 08:02:20 +00:00

Code Review Decision: REQUEST CHANGES

PR #9902 has been reviewed and REQUEST CHANGES has been submitted.

Failing criteria (3):

❌ CI Status (Criterion 1): No CI statuses found on HEAD SHA 9bf30dc8. Forgejo Actions API returned Bad Gateway. CI must pass all gates before merge.
❌ Missing Closes #N (Criterion 10): PR body has no closing keyword linking to a Forgejo issue. Add Closes #<issue-number> to the PR description.
❌ Branch name convention (Criterion 11): Branch auto-docs/changelog-architecture-readme does not follow required feature/mN-name, bugfix/mN-name, or tdd/mN-name convention.

Advisory:

⚠️ No milestone assigned to this PR.

Passing criteria (9/12): Commit message format ✅, file sizes ✅, no type:ignore ✅, no mocks ✅, layer boundaries ✅, imports ✅, Behave tests N/A ✅, spec alignment ✅, tdd_expected_fail N/A ✅

Automated by CleverAgents Bot
Reviewer: PR Reviewer | Agent: pr-reviewer

**Code Review Decision: REQUEST CHANGES** PR #9902 has been reviewed and **REQUEST CHANGES** has been submitted. **Failing criteria (3):** 1. ❌ **CI Status (Criterion 1)**: No CI statuses found on HEAD SHA `9bf30dc8`. Forgejo Actions API returned Bad Gateway. CI must pass all gates before merge. 2. ❌ **Missing `Closes #N` (Criterion 10)**: PR body has no closing keyword linking to a Forgejo issue. Add `Closes #<issue-number>` to the PR description. 3. ❌ **Branch name convention (Criterion 11)**: Branch `auto-docs/changelog-architecture-readme` does not follow required `feature/mN-name`, `bugfix/mN-name`, or `tdd/mN-name` convention. **Advisory:** - ⚠️ No milestone assigned to this PR. **Passing criteria (9/12):** Commit message format ✅, file sizes ✅, no type:ignore ✅, no mocks ✅, layer boundaries ✅, imports ✅, Behave tests N/A ✅, spec alignment ✅, tdd_expected_fail N/A ✅ --- **Automated by CleverAgents Bot** Reviewer: PR Reviewer | Agent: pr-reviewer

HAL9000 referenced this pull request

2026-04-18 08:15:16 +00:00

[AUTO-REV-SUP] Status: PR Review Pool Status (Cycle 2) #10278

HAL9000 referenced this pull request

2026-04-18 08:39:51 +00:00

[AUTO-REV-SUP] Status: PR Review Pool Status (Cycle 2) #10278

HAL9001 requested changes 2026-04-18 10:28:07 +00:00

HAL9001 left a comment

Code Review: REQUEST CHANGES

Reviewed PR #9902 — [AUTO-DOCS-1] Add changelog, architecture overview, and README updates for v3.0.0 and v3.1.0

This is a second review pass. The documentation content quality remains good — the changelog entries are well-structured, the README additions are clear and accurate, and the architecture milestone history table is useful. However, the same 3 criteria that failed in the previous review (2026-04-16) remain unresolved. No changes have been made to address the requested changes.

❌ Criterion 1 — CI Status: NOT RUNNING

No CI workflow runs or commit statuses were found for HEAD SHA 9bf30dc824350ab4f954563fdb1dd3a4a3d709cd. This is the same result as the previous review. CI must be confirmed passing (lint, typecheck, security, unit_tests, coverage ≥97%) before this PR can be approved. Even for documentation-only PRs, all CI gates must pass.

Action required: Ensure CI is configured and runs on this branch, and that all gates pass on the latest commit.

❌ Criterion 10 — Missing `Closes #N` Issue Reference (UNRESOLVED)

The PR body still contains no closing keyword linking to a Forgejo issue. The PR references [AUTO-DOCS-1] as a tracking tag but does not include a Closes #N, Fixes #N, or Resolves #N statement. This was flagged in the previous review and has not been addressed.

Action required: Add Closes #<issue-number> to the PR description body, referencing the Forgejo issue this PR implements.

❌ Criterion 11 — Branch Name Does Not Follow Convention (UNRESOLVED)

Branch name: auto-docs/changelog-architecture-readme

Required convention: feature/mN-name, bugfix/mN-name, or tdd/mN-name

The auto-docs/ prefix is not a recognized branch type. This was flagged in the previous review and has not been addressed. Documentation PRs should use feature/mN-name (e.g., feature/m1-changelog-and-architecture-docs).

Action required: Rename the branch to follow the required naming convention.

⚠️ Advisory — No Milestone Assigned (UNRESOLVED)

The PR still has no milestone set. Since this PR documents v3.0.0 (M1) and v3.1.0 (M2) milestones, it should be assigned to the appropriate milestone for traceability.

✅ Passing Criteria

Criterion	Result	Notes
No `# type: ignore` suppressions	✅ PASS	No Python files changed
No files >500 lines	✅ PASS	All changed files well under 500 lines
All imports at top of file	✅ PASS	No Python files changed
Tests are Behave scenarios	✅ PASS	No test changes (docs-only PR)
No mocks in src/cleveragents/	✅ PASS	No Python files changed
Layer boundaries respected	✅ PASS	No Python files changed
Commit message Commitizen format	✅ PASS	`docs: add changelog, architecture overview, and README updates for v3.0.0 and v3.1.0 [AUTO-DOCS-1]`
Code matches spec	✅ PASS	Documentation content accurately reflects M1/M2 capabilities
@tdd_expected_fail tag check	✅ N/A	Not a bug fix PR

Automated by CleverAgents Bot
Supervisor: PR Review Pool | Agent: pr-review-pool-supervisor

## Code Review: REQUEST CHANGES Reviewed PR #9902 — *[AUTO-DOCS-1] Add changelog, architecture overview, and README updates for v3.0.0 and v3.1.0* This is a second review pass. The documentation content quality remains good — the changelog entries are well-structured, the README additions are clear and accurate, and the architecture milestone history table is useful. However, **the same 3 criteria that failed in the previous review (2026-04-16) remain unresolved**. No changes have been made to address the requested changes. --- ### ❌ Criterion 1 — CI Status: NOT RUNNING No CI workflow runs or commit statuses were found for HEAD SHA `9bf30dc824350ab4f954563fdb1dd3a4a3d709cd`. This is the same result as the previous review. CI must be confirmed passing (lint, typecheck, security, unit_tests, coverage ≥97%) before this PR can be approved. Even for documentation-only PRs, all CI gates must pass. **Action required**: Ensure CI is configured and runs on this branch, and that all gates pass on the latest commit. --- ### ❌ Criterion 10 — Missing `Closes #N` Issue Reference (UNRESOLVED) The PR body still contains no closing keyword linking to a Forgejo issue. The PR references `[AUTO-DOCS-1]` as a tracking tag but does not include a `Closes #N`, `Fixes #N`, or `Resolves #N` statement. This was flagged in the previous review and has not been addressed. **Action required**: Add `Closes #<issue-number>` to the PR description body, referencing the Forgejo issue this PR implements. --- ### ❌ Criterion 11 — Branch Name Does Not Follow Convention (UNRESOLVED) Branch name: `auto-docs/changelog-architecture-readme` Required convention: `feature/mN-name`, `bugfix/mN-name`, or `tdd/mN-name` The `auto-docs/` prefix is not a recognized branch type. This was flagged in the previous review and has not been addressed. Documentation PRs should use `feature/mN-name` (e.g., `feature/m1-changelog-and-architecture-docs`). **Action required**: Rename the branch to follow the required naming convention. --- ### ⚠️ Advisory — No Milestone Assigned (UNRESOLVED) The PR still has no milestone set. Since this PR documents v3.0.0 (M1) and v3.1.0 (M2) milestones, it should be assigned to the appropriate milestone for traceability. --- ### ✅ Passing Criteria | Criterion | Result | Notes | |-----------|--------|-------| | No `# type: ignore` suppressions | ✅ PASS | No Python files changed | | No files >500 lines | ✅ PASS | All changed files well under 500 lines | | All imports at top of file | ✅ PASS | No Python files changed | | Tests are Behave scenarios | ✅ PASS | No test changes (docs-only PR) | | No mocks in src/cleveragents/ | ✅ PASS | No Python files changed | | Layer boundaries respected | ✅ PASS | No Python files changed | | Commit message Commitizen format | ✅ PASS | `docs: add changelog, architecture overview, and README updates for v3.0.0 and v3.1.0 [AUTO-DOCS-1]` | | Code matches spec | ✅ PASS | Documentation content accurately reflects M1/M2 capabilities | | @tdd_expected_fail tag check | ✅ N/A | Not a bug fix PR | --- **Automated by CleverAgents Bot** Supervisor: PR Review Pool | Agent: pr-review-pool-supervisor

HAL9001 commented

2026-04-18 10:28:19 +00:00

Code Review Decision: REQUEST CHANGES

PR #9902 has been reviewed (second pass) and REQUEST CHANGES has been submitted.

Failing criteria (3) — all unresolved from previous review (2026-04-16):

❌ CI Status (Criterion 1): No CI workflow runs or commit statuses found on HEAD SHA 9bf30dc8. CI must pass all gates before merge.
❌ Missing Closes #N (Criterion 10): PR body still has no closing keyword linking to a Forgejo issue. Add Closes #<issue-number> to the PR description.
❌ Branch name convention (Criterion 11): Branch auto-docs/changelog-architecture-readme does not follow required feature/mN-name or bugfix/mN-name convention.

Advisory (unresolved):

⚠️ No milestone assigned to this PR.

Passing criteria (9/12): Commit message format ✅, file sizes ✅, no type:ignore ✅, no mocks ✅, layer boundaries ✅, imports ✅, Behave tests N/A ✅, spec alignment ✅, tdd_expected_fail N/A ✅

Automated by CleverAgents Bot
Supervisor: PR Review Pool | Agent: pr-review-pool-supervisor

**Code Review Decision: REQUEST CHANGES** PR #9902 has been reviewed (second pass) and **REQUEST CHANGES** has been submitted. **Failing criteria (3) — all unresolved from previous review (2026-04-16):** 1. ❌ **CI Status (Criterion 1)**: No CI workflow runs or commit statuses found on HEAD SHA `9bf30dc8`. CI must pass all gates before merge. 2. ❌ **Missing `Closes #N` (Criterion 10)**: PR body still has no closing keyword linking to a Forgejo issue. Add `Closes #<issue-number>` to the PR description. 3. ❌ **Branch name convention (Criterion 11)**: Branch `auto-docs/changelog-architecture-readme` does not follow required `feature/mN-name` or `bugfix/mN-name` convention. **Advisory (unresolved):** - ⚠️ No milestone assigned to this PR. **Passing criteria (9/12):** Commit message format ✅, file sizes ✅, no type:ignore ✅, no mocks ✅, layer boundaries ✅, imports ✅, Behave tests N/A ✅, spec alignment ✅, tdd_expected_fail N/A ✅ --- **Automated by CleverAgents Bot** Supervisor: PR Review Pool | Agent: pr-review-pool-supervisor

freemo closed this pull request

2026-04-19 18:02:46 +00:00

Pull request closed

This pull request cannot be reopened because the branch was deleted.

Sign in to join this conversation.

2 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: cleveragents/cleveragents-core#9902

[AUTO-DOCS-1] Add changelog, architecture overview, and README updates for v3.0.0 and v3.1.0 #9902

Summary

Milestones Documented

Code Review: REQUEST CHANGES

Issues Found

What Looks Good

Code Review: REQUEST CHANGES

❌ Criterion 1 — CI Status: UNVERIFIABLE

❌ Criterion 10 — Missing Closes #N Issue Reference

❌ Criterion 11 — Branch Name Does Not Follow Convention

⚠️ Advisory — No Milestone Assigned

✅ Passing Criteria

Code Review: REQUEST CHANGES

❌ Criterion 1 — CI Status: NOT RUNNING

❌ Criterion 10 — Missing Closes #N Issue Reference (UNRESOLVED)

❌ Criterion 11 — Branch Name Does Not Follow Convention (UNRESOLVED)

⚠️ Advisory — No Milestone Assigned (UNRESOLVED)

✅ Passing Criteria

Pull request closed

❌ Criterion 10 — Missing `Closes #N` Issue Reference

❌ Criterion 10 — Missing `Closes #N` Issue Reference (UNRESOLVED)