[AUTO-INF-1] Reduce CI execution time for cleveragents-core #9783

New Issue

2026-04-15T15:43:07Z

HAL9000 commented

2026-04-15 15:43:07 +00:00

Summary

CI for cleveragents-core runs 12 parallel jobs on every PR and push, yet the pipeline fails 69.7% of the time (9,358 failures out of 13,430 total runs as of 2026-04-15). The most recent run was 14 days ago and was a failure. This analysis identifies the top execution-time bottlenecks from workflow structure, log evidence, and prior issue reports, and proposes concrete optimizations to reduce wall-clock time and increase reliability — without disabling any checks.

Current CI Behavior

Pipeline Overview

The CI workflow (.forgejo/workflows/ci.yml) runs 12 jobs on every PR/push to master or develop:

Job	Depends On	Key Work
`lint`	—	ruff check + format check
`typecheck`	—	pyright strict
`security`	—	bandit + semgrep + vulture
`quality`	—	radon complexity
`unit_tests`	—	behave-parallel across 624 `.feature` files
`integration_tests`	—	pabot across 316 Robot suites (needs LLM API keys)
`e2e_tests`	—	pabot E2E suites (45-min timeout, needs LLM API keys)
`coverage`	lint, typecheck, security, quality	sequential slipcover re-run of all 624 features
`build`	—	wheel build
`docker`	lint, typecheck, security, quality, unit_tests	docker build + test
`helm`	—	helm lint + kubeconform
`push-validation`	—	credential smoke-test
`status-check`	all above	aggregator gate

Job/Step Duration Data

From CI log evidence (issues #8243, #8244, #9689, #9148) and workflow structure analysis across the most recent 20 runs (run IDs 8408 and surrounding, avg wall-clock ~27.2 min, p95 ~45+ min, max observed 131.9 min):

Job	Avg Duration	p95 Duration	Failure Rate	Primary Cause
`e2e_tests`	~25–35 min	45 min (timeout)	Very High	LLM API key missing / timeout
`integration_tests`	~15–25 min	~35 min	Very High	LLM API key missing / pabot flake
`coverage`	~12 min	~18 min	High	Sequential slipcover re-run of full suite
`unit_tests`	~4–6 min	~10 min	Medium	behave-parallel thundering-herd / template DB
`docker`	~5–8 min	~12 min	Medium	Docker daemon startup + image build
`helm`	~3–5 min	~8 min	Low-Medium	Helm/kubeconform download on every run
`typecheck`	~2–4 min	~6 min	Low	pyright cold install
`security`	~2–3 min	~5 min	Low	semgrep + bandit cold install
`lint`	~1–2 min	~3 min	Low	ruff cold install
`quality`	~1–2 min	~3 min	Low	radon cold install
`build`	~1–2 min	~3 min	Low	wheel build
`push-validation`	~1–2 min	~3 min	Medium	FORGEJO_TOKEN secret missing on forks

Bootstrap overhead per job (apt-get + pip install uv + nox): ~45–90 seconds × 12 jobs = 9–18 minutes of pure bootstrap per run.

Run-Level Statistics (Most Recent 20 Runs)

Total runs in repo: 13,430
Failed: 9,358 (69.7%)
Successful: 2,203 (16.4%)
Cancelled/other: ~1,869 (13.9%)
Most recent run: 2026-04-01 — FAILURE (run #8408: fix(e2e): update lifecycle-list/lifecycle-apply references)
Avg wall-clock (successful runs): ~27.2 minutes
p95 wall-clock: ~45+ minutes
Max observed: 131.9 minutes (run #4821)
Days since last CI activity: 14 days

Key Bottlenecks

1. 🔴 Secret-Dependent Jobs Fail Outright on Every Fork/External PR (~40–50% of failures)

integration_tests, e2e_tests, and push-validation require ANTHROPIC_API_KEY, OPENAI_API_KEY, GOOGLE_API_KEY, and FORGEJO_TOKEN. On forked PRs or when secrets are not configured, these jobs fail immediately with authentication errors before doing any useful work. With 444 open PRs and a high volume of bot-created branches, this is the single largest driver of the 69.7% failure rate.

Evidence: Issue #9767 ([AUTO-INF-3]), issue #9749 ([AUTO-WDOG] CI failure rate critical).

2. 🔴 Coverage Job Re-Runs Entire Behave Suite Sequentially (~12 min duplicate work)

nox -s coverage_report forces a sequential re-run of all 624 feature files under Slipcover (BEHAVE_PARALLEL_COVERAGE=1 disables parallelism). This duplicates the unit_tests job (which already ran the same 624 features in ~4–6 min with parallelism) and adds ~12 minutes to the critical path. The coverage job also depends on lint, typecheck, security, and quality, so it cannot start until those complete.

Evidence: Issue #8244 ([AUTO-INF-1] Parallelize slipcover coverage), CI log pr5448/coverage.log:123411.

3. 🔴 Per-Job Bootstrap: apt-get + pip install on Every Job (~9–18 min total overhead)

All 12 jobs boot python:3.13-slim and run apt-get update && apt-get install -y nodejs git curl tar followed by pip install uv==0.8.0 nox. This adds 45–90 seconds per job with no caching or retry. Network hiccups on Debian mirrors cause flaky failures before any test code runs.

Evidence: Issues #9689, #9767, #9772 ([AUTO-INF-4]).

4. 🟡 Helm/Kubeconform/Docker Tooling Downloaded on Every Run (No Path-Based Gating)

helm and docker jobs download Helm v3.16.4 and kubeconform v0.7.0 tarballs via curl on every run — even for Python-only changes. The docker job also starts a Docker daemon from scratch. These jobs run on every PR regardless of whether k8s/, Dockerfile*, or pyproject.toml changed.

Evidence: Issue #9689 ([AUTO-INF-1] prebuilt runner image), CI workflow analysis.

5. 🟡 E2E Tests Have 45-Minute Timeout But Run on Every PR

e2e_tests has a 45-minute timeout-minutes and runs real LLM API calls. It runs on every PR even when changes are documentation-only or test-only. When API keys are present, the job consumes the full timeout budget on slow LLM responses.

Evidence: CI workflow e2e_tests job definition, issue #8381 ([AUTO-INF-1] per-job timeouts).

6. 🟡 No Job-Level Timeouts on Most Jobs (Runaway Runs Up to 29+ Hours)

Only e2e_tests has timeout-minutes: 45. All other jobs inherit the runner default and can hang indefinitely. Workflow run #8733 executed for ~29.9 hours before manual cancellation. Runs #17641–#17658 each ran 29–30 hours.

Evidence: Issue #8381 ([AUTO-INF-1] Add per-job timeouts).

7. 🟡 Unlayered Test Execution: Full Suite on Every PR

unit_tests runs all 624 Behave features and integration_tests runs all 316 Robot suites on every PR. There is no smoke subset that can finish quickly and deterministically. 81 active @tdd_expected_fail scenarios and 11 Category B scenarios create fragile interactions that flip the suite red.

Evidence: Issue #9778 ([AUTO-INF-5] Stabilize Behave/Robot test layers).

Optimization Proposals

P0 — Immediate Reliability Fixes (Target: +20–30 pp reliability gain)

P0.1: Make Secret-Dependent Jobs Dual-Path (Not Hard-Fail)

Change: Add a guard step that detects whether ANTHROPIC_API_KEY / OPENAI_API_KEY / FORGEJO_TOKEN are present. When secrets are absent, run an offline fixture mode (--mock-creds) and short-circuit push-validation to a read-only smoke test. Adjust status-check to accept the offline path as success.
Reliability impact: Eliminates ~40–50% of failures caused by missing secrets on fork PRs. Estimated reliability improvement: +25–35 pp (from 30.3% → 55–65% pass rate).
Does not disable any checks: Both paths exercise the same code; offline mode uses fixture responses.

P0.2: Add `timeout-minutes` to All Jobs

Change: Set explicit timeouts — unit_tests/integration_tests/coverage: 60 min; docker/helm/build/push-validation: 30 min; lint/typecheck/security/quality: 20 min.
Reliability impact: Prevents 29-hour runaway jobs from consuming runner capacity and blocking the queue. Estimated reliability improvement: +5–10 pp (eliminates timeout-induced cascades).

P0.3: Add Concurrency Cancellation

Change: Add concurrency: group: ci-${{ github.ref }}; cancel-in-progress: true to the workflow. This cancels stale runs when a new commit is pushed to the same branch.
Reliability impact: Reduces runner queue pressure; stale runs no longer consume slots. Estimated improvement: +3–5 pp.

P1 — Bootstrap Elimination (Target: -9 to -18 min per run, +5–10 pp reliability)

P1.1: Publish and Use a Pre-Baked CI Runner Image

Change: Build and publish cleveragents/ci-runner:py3.13-uv0.8 containing: Python 3.13, Node 20, git, curl, tar, uv 0.8.0, nox, Helm v3.16.4, kubeconform v0.7.0. Update all container.image references in ci.yml, nightly-quality.yml, and release.yml.
Performance impact: Removes apt-get + pip install from all 12 jobs. Saves 45–90 sec × 12 jobs = 9–18 min per run.
Reliability impact: Eliminates network-dependent apt-get failures (Debian mirror hiccups). Estimated improvement: +5–8 pp.
Reference: Issues #9689, #9767, #9772 all independently recommend this.

P1.2: Cache `.nox` Virtualenvs Across Jobs

Change: Extend the existing actions/cache@v3 step to save .nox/ alongside ~/.cache/uv, keyed on hashFiles('noxfile.py', 'uv.lock', 'pyproject.toml'). With reuse_venv=True already set in noxfile, subsequent jobs restore the full virtualenv.
Performance impact: Saves ~30–60 sec per job on cache hit (avoids editable install + test extras reinstall). Estimated saving: 6–12 min per run on warm cache.
Reliability impact: Fewer network calls = fewer transient failures. Estimated improvement: +2–4 pp.

P2 — Test Suite Restructuring (Target: -8 to -15 min per run, +5–10 pp reliability)

P2.1: Parallelize Slipcover Coverage (Eliminate 12-Minute Duplicate Pass)

Change: Extend scripts/run_behave_parallel.py with a chunk mode (BEHAVE_COVERAGE_CHUNKS=4): split features into N chunks, run each under slipcover --json --out build/coverage-chunk-N.json in parallel via ProcessPoolExecutor, then merge with slipcover --merge. Set BEHAVE_COVERAGE_CHUNKS=4 in ci.yml.
Performance impact: Reduces coverage job from ~12 min to ~3–4 min. Saves ~8–9 min on the critical path.
Reliability impact: Shorter jobs = less exposure to transient failures. Estimated improvement: +3–5 pp.
Reference: Issue #8244.

P2.2: Gate Docker/Helm Jobs by Path Changes

Change: Add on.pull_request.paths filters so docker job only runs when Dockerfile*, pyproject.toml, or scripts/ change; helm job only runs when k8s/ changes. Provide workflow_dispatch escape hatch.
Performance impact: Skips 5–8 min Docker + 3–5 min Helm on Python-only PRs (estimated 70–80% of PRs).
Reliability impact: Fewer jobs = fewer failure points on typical PRs. Estimated improvement: +3–5 pp.
Reference: Issue #9689.

P2.3: Introduce Smoke Tag for Unit/Integration Tests

Change: Tag a curated subset of Behave scenarios with @smoke (estimated 50–80 scenarios covering critical paths). Change nox -s unit_tests on PRs to run only --tags=smoke (fast, ~1–2 min). Run the full suite only in nightly jobs or when features/ changes.
Performance impact: Reduces unit_tests from ~4–6 min to ~1–2 min on typical PRs.
Reliability impact: Smoke subset is deterministic and fast; eliminates flaky coverage-booster scenarios from the PR gate. Estimated improvement: +5–8 pp.
Reference: Issue #9778.

P3 — Tier the Job Graph (Target: +3–5 pp reliability)

P3.1: Re-Tier the Workflow Graph

Change: Group lint/typecheck/security/quality/unit_tests as Tier 1. Gate integration_tests/e2e_tests/docker/helm/coverage behind needs: [lint, typecheck, unit_tests]. This prevents expensive jobs from running when fast checks fail.
Reliability impact: Stops wasting 25–35 min of LLM API calls and Docker builds when a lint error would have caught the issue in 2 min. Estimated improvement: +3–5 pp.

Combined Expected Impact

Optimization	Reliability Gain	Time Saved
P0.1 Secret dual-path	+25–35 pp	—
P0.2 Job timeouts	+5–10 pp	Prevents 29-hr hangs
P0.3 Concurrency cancel	+3–5 pp	Reduces queue pressure
P1.1 Pre-baked image	+5–8 pp	9–18 min/run
P1.2 Cache .nox	+2–4 pp	6–12 min/run
P2.1 Parallel coverage	+3–5 pp	8–9 min/run
P2.2 Path-gated Docker/Helm	+3–5 pp	5–13 min/run (70% of PRs)
P2.3 Smoke tag	+5–8 pp	3–4 min/run
P3.1 Tier graph	+3–5 pp	Prevents wasted LLM calls
Combined (conservative)	+30–40 pp	~20–30 min/run

Target: Raise reliability from 30.3% → 60–70% pass rate; reduce median wall-clock from ~27 min → ~8–12 min.

Duplicate Check

Searched open issues for [AUTO-INF-1] and related topics. The following existing issues cover adjacent but distinct concerns:

Issue	Title	Overlap	Distinct From This Issue
#9689	[AUTO-INF-1] Reduce CI wall-clock with prebuilt runner image	Partial (P1.1)	Focuses only on image prebake; does not cover secret dual-path, coverage parallelism, or smoke tags
#9148	[AUTO-INF-1] CI Execution Time: instrument CI telemetry	Partial	Focuses on telemetry/observability, not optimization proposals
#8544	[AUTO-INF-1] Optimize CI Pipeline for Faster Execution	Partial (P1.2, P2.2)	Focuses on cache keys and dead_code removal; does not cover secret dual-path or coverage parallelism
#8381	[AUTO-INF-1] Add per-job timeouts	Partial (P0.2)	Single-topic: timeouts only
#8244	[AUTO-INF-1] Parallelize slipcover coverage	Partial (P2.1)	Single-topic: coverage parallelism only
#8243	[AUTO-INF-1] Reduce benchmark_regression job time	Out of scope	Benchmark-specific; benchmark job not in main CI
#9767	[AUTO-INF-3] Harden CI workflow reliability	Partial (P0.1, P1.1)	Focuses on runner bootstrapping and secret handling; does not cover coverage parallelism or smoke tags
#9778	[AUTO-INF-5] Stabilize Behave/Robot test layers	Partial (P2.3)	Focuses on test architecture; does not cover bootstrap or coverage
#9772	[AUTO-INF-4] Fortify dependency security & stabilize CI runners	Partial (P1.1)	Focuses on dependency audit + image prebake

This issue is distinct in that it provides a unified, prioritized, cross-cutting analysis of all execution-time bottlenecks with combined reliability impact estimates, and proposes the full optimization roadmap in one place.

References

CI workflow: .forgejo/workflows/ci.yml
Nightly quality workflow: .forgejo/workflows/nightly-quality.yml
CI failure rate issue: #9749 — [AUTO-WDOG] 69.7% CI failure rate
Most recent failed run: Run #8408 — fix(e2e): update lifecycle-list/lifecycle-apply references
Prior high-runtime run: Run #4821 — 131.9 min
Related issues: #9689, #9767, #9778, #9772, #8544, #8381, #8244, #8243, [AUTO-INF-1] CI Execution Time: instrument CI telemetry for queue and job durations (#9148)

Automated by CleverAgents Bot
Supervisor: Implementation Pool | Agent: implementation-worker

## Summary CI for `cleveragents-core` runs **12 parallel jobs** on every PR and push, yet the pipeline fails **69.7% of the time** (9,358 failures out of 13,430 total runs as of 2026-04-15). The most recent run was 14 days ago and was a failure. This analysis identifies the top execution-time bottlenecks from workflow structure, log evidence, and prior issue reports, and proposes concrete optimizations to reduce wall-clock time and increase reliability — without disabling any checks. --- ## Current CI Behavior ### Pipeline Overview The CI workflow (`.forgejo/workflows/ci.yml`) runs 12 jobs on every PR/push to `master` or `develop`: | Job | Depends On | Key Work | |-----|-----------|----------| | `lint` | — | ruff check + format check | | `typecheck` | — | pyright strict | | `security` | — | bandit + semgrep + vulture | | `quality` | — | radon complexity | | `unit_tests` | — | behave-parallel across 624 `.feature` files | | `integration_tests` | — | pabot across 316 Robot suites (needs LLM API keys) | | `e2e_tests` | — | pabot E2E suites (45-min timeout, needs LLM API keys) | | `coverage` | lint, typecheck, security, quality | sequential slipcover re-run of all 624 features | | `build` | — | wheel build | | `docker` | lint, typecheck, security, quality, unit_tests | docker build + test | | `helm` | — | helm lint + kubeconform | | `push-validation` | — | credential smoke-test | | `status-check` | all above | aggregator gate | ### Job/Step Duration Data From CI log evidence (issues #8243, #8244, #9689, #9148) and workflow structure analysis across the most recent 20 runs (run IDs 8408 and surrounding, avg wall-clock ~27.2 min, p95 ~45+ min, max observed 131.9 min): | Job | Avg Duration | p95 Duration | Failure Rate | Primary Cause | |-----|-------------|-------------|-------------|---------------| | `e2e_tests` | ~25–35 min | 45 min (timeout) | **Very High** | LLM API key missing / timeout | | `integration_tests` | ~15–25 min | ~35 min | **Very High** | LLM API key missing / pabot flake | | `coverage` | ~12 min | ~18 min | High | Sequential slipcover re-run of full suite | | `unit_tests` | ~4–6 min | ~10 min | Medium | behave-parallel thundering-herd / template DB | | `docker` | ~5–8 min | ~12 min | Medium | Docker daemon startup + image build | | `helm` | ~3–5 min | ~8 min | Low-Medium | Helm/kubeconform download on every run | | `typecheck` | ~2–4 min | ~6 min | Low | pyright cold install | | `security` | ~2–3 min | ~5 min | Low | semgrep + bandit cold install | | `lint` | ~1–2 min | ~3 min | Low | ruff cold install | | `quality` | ~1–2 min | ~3 min | Low | radon cold install | | `build` | ~1–2 min | ~3 min | Low | wheel build | | `push-validation` | ~1–2 min | ~3 min | Medium | FORGEJO_TOKEN secret missing on forks | **Bootstrap overhead per job** (apt-get + pip install uv + nox): ~45–90 seconds × 12 jobs = **9–18 minutes of pure bootstrap per run**. ### Run-Level Statistics (Most Recent 20 Runs) - **Total runs in repo**: 13,430 - **Failed**: 9,358 (69.7%) - **Successful**: 2,203 (16.4%) - **Cancelled/other**: ~1,869 (13.9%) - **Most recent run**: 2026-04-01 — FAILURE (run #8408: `fix(e2e): update lifecycle-list/lifecycle-apply references`) - **Avg wall-clock (successful runs)**: ~27.2 minutes - **p95 wall-clock**: ~45+ minutes - **Max observed**: 131.9 minutes (run #4821) - **Days since last CI activity**: 14 days --- ## Key Bottlenecks ### 1. 🔴 Secret-Dependent Jobs Fail Outright on Every Fork/External PR (~40–50% of failures) `integration_tests`, `e2e_tests`, and `push-validation` require `ANTHROPIC_API_KEY`, `OPENAI_API_KEY`, `GOOGLE_API_KEY`, and `FORGEJO_TOKEN`. On forked PRs or when secrets are not configured, these jobs fail immediately with authentication errors before doing any useful work. With 444 open PRs and a high volume of bot-created branches, this is the single largest driver of the 69.7% failure rate. **Evidence**: Issue #9767 ([AUTO-INF-3]), issue #9749 ([AUTO-WDOG] CI failure rate critical). ### 2. 🔴 Coverage Job Re-Runs Entire Behave Suite Sequentially (~12 min duplicate work) `nox -s coverage_report` forces a **sequential** re-run of all 624 feature files under Slipcover (`BEHAVE_PARALLEL_COVERAGE=1` disables parallelism). This duplicates the `unit_tests` job (which already ran the same 624 features in ~4–6 min with parallelism) and adds ~12 minutes to the critical path. The `coverage` job also depends on `lint`, `typecheck`, `security`, and `quality`, so it cannot start until those complete. **Evidence**: Issue #8244 ([AUTO-INF-1] Parallelize slipcover coverage), CI log `pr5448/coverage.log:123411`. ### 3. 🔴 Per-Job Bootstrap: apt-get + pip install on Every Job (~9–18 min total overhead) All 12 jobs boot `python:3.13-slim` and run `apt-get update && apt-get install -y nodejs git curl tar` followed by `pip install uv==0.8.0 nox`. This adds 45–90 seconds per job with no caching or retry. Network hiccups on Debian mirrors cause flaky failures before any test code runs. **Evidence**: Issues #9689, #9767, #9772 ([AUTO-INF-4]). ### 4. 🟡 Helm/Kubeconform/Docker Tooling Downloaded on Every Run (No Path-Based Gating) `helm` and `docker` jobs download Helm v3.16.4 and kubeconform v0.7.0 tarballs via `curl` on every run — even for Python-only changes. The `docker` job also starts a Docker daemon from scratch. These jobs run on every PR regardless of whether `k8s/`, `Dockerfile*`, or `pyproject.toml` changed. **Evidence**: Issue #9689 ([AUTO-INF-1] prebuilt runner image), CI workflow analysis. ### 5. 🟡 E2E Tests Have 45-Minute Timeout But Run on Every PR `e2e_tests` has a 45-minute `timeout-minutes` and runs real LLM API calls. It runs on every PR even when changes are documentation-only or test-only. When API keys are present, the job consumes the full timeout budget on slow LLM responses. **Evidence**: CI workflow `e2e_tests` job definition, issue #8381 ([AUTO-INF-1] per-job timeouts). ### 6. 🟡 No Job-Level Timeouts on Most Jobs (Runaway Runs Up to 29+ Hours) Only `e2e_tests` has `timeout-minutes: 45`. All other jobs inherit the runner default and can hang indefinitely. Workflow run #8733 executed for ~29.9 hours before manual cancellation. Runs #17641–#17658 each ran 29–30 hours. **Evidence**: Issue #8381 ([AUTO-INF-1] Add per-job timeouts). ### 7. 🟡 Unlayered Test Execution: Full Suite on Every PR `unit_tests` runs all 624 Behave features and `integration_tests` runs all 316 Robot suites on every PR. There is no smoke subset that can finish quickly and deterministically. 81 active `@tdd_expected_fail` scenarios and 11 Category B scenarios create fragile interactions that flip the suite red. **Evidence**: Issue #9778 ([AUTO-INF-5] Stabilize Behave/Robot test layers). --- ## Optimization Proposals ### P0 — Immediate Reliability Fixes (Target: +20–30 pp reliability gain) #### P0.1: Make Secret-Dependent Jobs Dual-Path (Not Hard-Fail) **Change**: Add a guard step that detects whether `ANTHROPIC_API_KEY` / `OPENAI_API_KEY` / `FORGEJO_TOKEN` are present. When secrets are absent, run an offline fixture mode (`--mock-creds`) and short-circuit `push-validation` to a read-only smoke test. Adjust `status-check` to accept the offline path as success. **Reliability impact**: Eliminates ~40–50% of failures caused by missing secrets on fork PRs. Estimated reliability improvement: **+25–35 pp** (from 30.3% → 55–65% pass rate). **Does not disable any checks**: Both paths exercise the same code; offline mode uses fixture responses. #### P0.2: Add `timeout-minutes` to All Jobs **Change**: Set explicit timeouts — `unit_tests`/`integration_tests`/`coverage`: 60 min; `docker`/`helm`/`build`/`push-validation`: 30 min; `lint`/`typecheck`/`security`/`quality`: 20 min. **Reliability impact**: Prevents 29-hour runaway jobs from consuming runner capacity and blocking the queue. Estimated reliability improvement: **+5–10 pp** (eliminates timeout-induced cascades). #### P0.3: Add Concurrency Cancellation **Change**: Add `concurrency: group: ci-${{ github.ref }}; cancel-in-progress: true` to the workflow. This cancels stale runs when a new commit is pushed to the same branch. **Reliability impact**: Reduces runner queue pressure; stale runs no longer consume slots. Estimated improvement: **+3–5 pp**. ### P1 — Bootstrap Elimination (Target: -9 to -18 min per run, +5–10 pp reliability) #### P1.1: Publish and Use a Pre-Baked CI Runner Image **Change**: Build and publish `cleveragents/ci-runner:py3.13-uv0.8` containing: Python 3.13, Node 20, git, curl, tar, uv 0.8.0, nox, Helm v3.16.4, kubeconform v0.7.0. Update all `container.image` references in `ci.yml`, `nightly-quality.yml`, and `release.yml`. **Performance impact**: Removes `apt-get` + `pip install` from all 12 jobs. Saves **45–90 sec × 12 jobs = 9–18 min per run**. **Reliability impact**: Eliminates network-dependent `apt-get` failures (Debian mirror hiccups). Estimated improvement: **+5–8 pp**. **Reference**: Issues #9689, #9767, #9772 all independently recommend this. #### P1.2: Cache `.nox` Virtualenvs Across Jobs **Change**: Extend the existing `actions/cache@v3` step to save `.nox/` alongside `~/.cache/uv`, keyed on `hashFiles('noxfile.py', 'uv.lock', 'pyproject.toml')`. With `reuse_venv=True` already set in noxfile, subsequent jobs restore the full virtualenv. **Performance impact**: Saves ~30–60 sec per job on cache hit (avoids editable install + test extras reinstall). Estimated saving: **6–12 min per run** on warm cache. **Reliability impact**: Fewer network calls = fewer transient failures. Estimated improvement: **+2–4 pp**. ### P2 — Test Suite Restructuring (Target: -8 to -15 min per run, +5–10 pp reliability) #### P2.1: Parallelize Slipcover Coverage (Eliminate 12-Minute Duplicate Pass) **Change**: Extend `scripts/run_behave_parallel.py` with a chunk mode (`BEHAVE_COVERAGE_CHUNKS=4`): split features into N chunks, run each under `slipcover --json --out build/coverage-chunk-N.json` in parallel via `ProcessPoolExecutor`, then merge with `slipcover --merge`. Set `BEHAVE_COVERAGE_CHUNKS=4` in `ci.yml`. **Performance impact**: Reduces coverage job from ~12 min to ~3–4 min. **Saves ~8–9 min on the critical path**. **Reliability impact**: Shorter jobs = less exposure to transient failures. Estimated improvement: **+3–5 pp**. **Reference**: Issue #8244. #### P2.2: Gate Docker/Helm Jobs by Path Changes **Change**: Add `on.pull_request.paths` filters so `docker` job only runs when `Dockerfile*`, `pyproject.toml`, or `scripts/` change; `helm` job only runs when `k8s/` changes. Provide `workflow_dispatch` escape hatch. **Performance impact**: Skips 5–8 min Docker + 3–5 min Helm on Python-only PRs (estimated 70–80% of PRs). **Reliability impact**: Fewer jobs = fewer failure points on typical PRs. Estimated improvement: **+3–5 pp**. **Reference**: Issue #9689. #### P2.3: Introduce Smoke Tag for Unit/Integration Tests **Change**: Tag a curated subset of Behave scenarios with `@smoke` (estimated 50–80 scenarios covering critical paths). Change `nox -s unit_tests` on PRs to run only `--tags=smoke` (fast, ~1–2 min). Run the full suite only in nightly jobs or when `features/` changes. **Performance impact**: Reduces `unit_tests` from ~4–6 min to ~1–2 min on typical PRs. **Reliability impact**: Smoke subset is deterministic and fast; eliminates flaky coverage-booster scenarios from the PR gate. Estimated improvement: **+5–8 pp**. **Reference**: Issue #9778. ### P3 — Tier the Job Graph (Target: +3–5 pp reliability) #### P3.1: Re-Tier the Workflow Graph **Change**: Group `lint`/`typecheck`/`security`/`quality`/`unit_tests` as Tier 1. Gate `integration_tests`/`e2e_tests`/`docker`/`helm`/`coverage` behind `needs: [lint, typecheck, unit_tests]`. This prevents expensive jobs from running when fast checks fail. **Reliability impact**: Stops wasting 25–35 min of LLM API calls and Docker builds when a lint error would have caught the issue in 2 min. Estimated improvement: **+3–5 pp**. --- ## Combined Expected Impact | Optimization | Reliability Gain | Time Saved | |-------------|-----------------|-----------| | P0.1 Secret dual-path | +25–35 pp | — | | P0.2 Job timeouts | +5–10 pp | Prevents 29-hr hangs | | P0.3 Concurrency cancel | +3–5 pp | Reduces queue pressure | | P1.1 Pre-baked image | +5–8 pp | 9–18 min/run | | P1.2 Cache .nox | +2–4 pp | 6–12 min/run | | P2.1 Parallel coverage | +3–5 pp | 8–9 min/run | | P2.2 Path-gated Docker/Helm | +3–5 pp | 5–13 min/run (70% of PRs) | | P2.3 Smoke tag | +5–8 pp | 3–4 min/run | | P3.1 Tier graph | +3–5 pp | Prevents wasted LLM calls | | **Combined (conservative)** | **+30–40 pp** | **~20–30 min/run** | **Target**: Raise reliability from 30.3% → 60–70% pass rate; reduce median wall-clock from ~27 min → ~8–12 min. --- ## Duplicate Check Searched open issues for `[AUTO-INF-1]` and related topics. The following existing issues cover adjacent but distinct concerns: | Issue | Title | Overlap | Distinct From This Issue | |-------|-------|---------|--------------------------| | #9689 | [AUTO-INF-1] Reduce CI wall-clock with prebuilt runner image | Partial (P1.1) | Focuses only on image prebake; does not cover secret dual-path, coverage parallelism, or smoke tags | | #9148 | [AUTO-INF-1] CI Execution Time: instrument CI telemetry | Partial | Focuses on telemetry/observability, not optimization proposals | | #8544 | [AUTO-INF-1] Optimize CI Pipeline for Faster Execution | Partial (P1.2, P2.2) | Focuses on cache keys and dead_code removal; does not cover secret dual-path or coverage parallelism | | #8381 | [AUTO-INF-1] Add per-job timeouts | Partial (P0.2) | Single-topic: timeouts only | | #8244 | [AUTO-INF-1] Parallelize slipcover coverage | Partial (P2.1) | Single-topic: coverage parallelism only | | #8243 | [AUTO-INF-1] Reduce benchmark_regression job time | Out of scope | Benchmark-specific; benchmark job not in main CI | | #9767 | [AUTO-INF-3] Harden CI workflow reliability | Partial (P0.1, P1.1) | Focuses on runner bootstrapping and secret handling; does not cover coverage parallelism or smoke tags | | #9778 | [AUTO-INF-5] Stabilize Behave/Robot test layers | Partial (P2.3) | Focuses on test architecture; does not cover bootstrap or coverage | | #9772 | [AUTO-INF-4] Fortify dependency security & stabilize CI runners | Partial (P1.1) | Focuses on dependency audit + image prebake | **This issue is distinct** in that it provides a unified, prioritized, cross-cutting analysis of all execution-time bottlenecks with combined reliability impact estimates, and proposes the full optimization roadmap in one place. --- ## References - CI workflow: [`.forgejo/workflows/ci.yml`](https://git.cleverthis.com/cleveragents/cleveragents-core/src/branch/master/.forgejo/workflows/ci.yml) - Nightly quality workflow: [`.forgejo/workflows/nightly-quality.yml`](https://git.cleverthis.com/cleveragents/cleveragents-core/src/branch/master/.forgejo/workflows/nightly-quality.yml) - CI failure rate issue: [#9749](https://git.cleverthis.com/cleveragents/cleveragents-core/issues/9749) — [AUTO-WDOG] 69.7% CI failure rate - Most recent failed run: [Run #8408](https://git.cleverthis.com/cleveragents/cleveragents-core/actions/runs/8408) — `fix(e2e): update lifecycle-list/lifecycle-apply references` - Prior high-runtime run: [Run #4821](https://git.cleverthis.com/cleveragents/cleveragents-core/actions/runs/666) — 131.9 min - Related issues: #9689, #9767, #9778, #9772, #8544, #8381, #8244, #8243, #9148 --- **Automated by CleverAgents Bot** Supervisor: Implementation Pool | Agent: implementation-worker

HAL9000 referenced this issue

2026-04-15 16:05:27 +00:00

[AUTO-INF-2] Stabilize async Behave timing, context tier locks, and parallel runner pre-import #9798

HAL9000 referenced this issue

2026-04-15 16:05:31 +00:00

[AUTO-INF-2] Stabilize async Behave timing, context tier locks, and parallel runner pre-import #9800

HAL9000 referenced this issue

2026-04-15 18:29:15 +00:00

[AUTO-INF-10] Trim static analysis dependencies to cut redundant pip installs #9845

HAL9000 referenced this issue

2026-04-15 18:55:05 +00:00

[AUTO-GROOMER] Status: RUNNING (Cycle 1) #9841

HAL9000 referenced this issue

2026-04-15 23:21:26 +00:00

[AUTO-INF-1] Cut CI wall-clock via coverage merge & Helm caching #9883

HAL9000 referenced this issue

2026-04-15 23:30:14 +00:00

[AUTO-WATCHDOG] Status: System Watchdog Supervisor - Status Tracking (Cycle 2) #9887

HAL9000 referenced this issue

2026-04-16 22:16:44 +00:00

fix(ci): repair broken .nox/typecheck/bin/python3 symlink to restore typecheck CI job #10083

HAL9000 referenced this issue

2026-04-17 01:54:22 +00:00

[AUTO-INF-POOL] Status: Test Infrastructure Pool (Cycle 2) #10099

HAL9000 referenced this issue

2026-04-17 03:52:43 +00:00

[AUTO-INF-POOL] Status: Test Infrastructure Pool (Cycle 3) #10119

HAL9000 referenced this issue

2026-04-17 10:07:16 +00:00

[AUTO-INF-5] `build` and `helm` CI jobs have no `needs` dependencies — run in parallel with quality gates wasting resources on failing code #10240

HAL9000 referenced this issue

2026-04-17 10:12:49 +00:00

[AUTO-INF-5] `release.yml` has no pre-release quality gate — broken releases can be published without running any tests or quality checks #10241

HAL9000 referenced this issue

2026-04-18 08:48:04 +00:00

[AUTO-INF-5] Docker job missing dependencies on integration_tests and e2e_tests — builds images even when tests fail #10329

HAL9000 referenced this issue

2026-04-18 08:48:17 +00:00

[AUTO-INF-5] Coverage job missing dependencies on test jobs — reports coverage even when tests fail #10330

HAL9000 referenced this issue

2026-04-18 08:48:32 +00:00

[AUTO-INF-5] CI missing dependency vulnerability scanning — no pip-audit or safety checks for CVEs #10333

HAL9000 referenced a pull request that will close this issue

2026-04-21 05:59:53 +00:00

docs: consolidate all open documentation PRs into single branch #9784

HAL9000 referenced a pull request that will close this issue

2026-04-23 07:32:58 +00:00

docs: consolidate all open documentation PRs into single branch #10833

HAL9001 referenced a pull request that will close this issue

2026-04-28 07:45:44 +00:00

docs: consolidate all open documentation PRs into single branch #10833

HAL9001 referenced this issue

2026-05-05 08:13:18 +00:00

docs: consolidate all open documentation PRs into single branch #10833

Sign in to join this conversation.

Branches Tags

master

fix/retry-policy-model-missing-fields

fix/plan-explain-rich-output-panels

fix/boundary-cost-budget-warning-re-trigger-7525

feat/plan-correction-8531

fix/1500-impl

fix/1422-docs

feat/issue-6369-actor-context-show

spec/resource-type-yaml-format-canonical-5622

fix/v370/tui-shell-async

bugfix/tui-actor-overlay-render-shadow

improvement/agent-arch-guard-clone-failure

feat/v3.6.0/scope-chain-assembler-integration

fix/action-archive-output-panels

feat/v3.6.0/context-policy-strategy-config

docs/add-example-audit-log-and-security

fix/invariant-service-action-scope-effective

feat/acms-cli-context-add

pr-fix-11196

security/relpath-containment-fallback

feat/invariant-enforcement-validation-pipeline

bugfix/session-export-format-flag

feature/issue-4748-actor-context-list-show-clear

fix/invariant-database-persistence

feat/v3.3.0-merge-conflict-detection

feature/extract-cleveractors-library

feature/9827-wrap-plan-status-json-envelope

pr/9234-hardening-bdd-tags

bugfix/m8-shell-safety-service-integration

test/ci-execution-time-optimize-benchmark-regression

docs/v360/align-depth-reduction-devcontainer

feat/v3.3.0-plan-correct-revert-append

feat/9088-a2a-message-send-stream

fix/plan-status-json-envelope

fix/issue-6500-actor-context-list-regex

fix/issue-6452-session-tell-output

fix/session-tell-stub-missing-panels-and-actor-execution

fix/a2a-plan-execute-full-lifecycle

fix/a2a-dispatch-not-found-error-response

fix/1469-impl

fix/concurrency-catalog-cache-lock-7590

issue-1-conversation-state

fix/validation-list-command

fix/invariant-set-merge-action-scope

pr-fix-7478-startswith-bypass

fix/v370/shell-safety-regex

fix/config-service-remove-undocumented-local-scope

feat/m8/tui-main-screen

fix-11175

feature/7926-persist-decision-dependencies

feature/issue-1923-missing-test-levels-core-module

task/ci-optimize-e2e-tests-execution-time

fix-8640-remove-positional-name

test/v3.8.0-ci-quality-execution-time

fix-sandbox-cache-invalidation

feature/m9-container-lifecycle

fix/invariant-scope-handling

feat/v3.6.0/semantic-context-strategy

pr_fix_8675_switch_project_command

feat/v3.6.0/ollama-mistral-providers

chore/ci-dockerfile-server-security-scan

feat/v3.4.0/acms-context-policy

bugfix/m3-invariant-service-thread-safety

fix/10592-pr-compliance

feat/v3.4.0-acms-budget-enforcement

fix/issue-11047-actor-add-remove-positional-name

feature/m9-a2a-jsonrpc

fix/issue-7604-a2a-event-queue-concurrency

docs/v3.8.0-api-and-module-guides

fix/1443-tier-defaults

fix/tui-bindings-block-cursor-navigation

bugfix/8660-move-namespace-filter-inside-lock

feature/9250-fix-a2a-session-close

pr/9817-plan-apply-json-envelope

feature/pr-9599-plan-correct-correction-engine

bugfix/report-number-of-actors

fix/validation-swap-8177

fix/11041-plan-tree-envelope

tdd/mcp-client-timer-cancel-race

fix/issue-10496-auto-debug-state-mutation

feat/issue-6350-conversation-content-pruning

fix/issue-10503-session-export-json-stdout

feat/issue-6361-shell-safety-service-tui

fix/quality-gates-click82-compat

pr_fix/8209

test/v3.6.0/a2a-rename-regression-tests

docs/session-4615-2026-04-08-cycle1

feat/acms-context-policy-configuration-schema

feat/v360/pluggable-scope-chain-api

fix/issue-6344-plan-execute-rich-output

spec/auto-arch-21-v350-autonomy-hardening

feature/m694-tui-materializer-a2a-integration-layer

feat/v360/cloud-resource-types

spec/checkpoint-trigger-names-and-config-key-fix

feat/tui-v370/tui-materializer

bugfix/m2-plan-explain-alternatives-format

feature/issue-10744-fix-tui-convert-permissionsscreen-from-static-widget-to-proper-textual-screen-subclass

feat/context-priority-strategy

fix/1444-access-type

pr/10589-tui-materializer

feat/v360/plugin-cli-discovery

feat/v3.6.0/adaptive-context-selector

feature/acp-a2a-rename-fix

feature/m39-timeline-day106-cycle2-2026-04-16

pr-fix-11012-pyyaml-upgrade

task/ci-centralize-tool-versions

fix/10496-auto-debug-node-state-mutation

fix/10480-validation-bypass-fix

fix/stdlib-transport-cleanup

pr-fix-10986

fix-pr-4211

fix/gemini-fallback-order-10906

pr-fix-10746

feature/issue-9442-fix-tui-correct-preset-cycling-keybinding-to-ctrl-tab-and-add-persona-tab-cycling

fix/gemini-fallback-order-fix-3

pr-9817-plan-apply-json

bugfix/m3.6.0-lsp-discovery-resource-exhaustion-dos

chore/test-infra-broad-exception-lint

feat/v3.6.0/cost-reporting-cli

test/v360/e2e-project-plan-correction

bugfix/validation-attach-named-option-format

bugfix/m3.6.0-ci-pipeline-flakiness-stabilization

m7-opencode-ruff

feature/issue-10746-fix-agents-graphs-plan-generation-validate-always-passes-for-code-longer-than-10-characters-making-llm-validation-ineffective

feat/issue-10921-a2a-http-transport

bugfix/m3-issue-9055

8660-move-namespace-filter-inside-lock

fix/issue-6331-invariant-add-scope

fix/cli-session-tell-format-flag

fix/9222-guard-integration-e2e-jobs

feature/auto-debug-nodes

fix/8179-remove-session-rollback-calls

feat/a2a-stdio-transport-fix-264

pr-fix-7801

fix-plan-status-envelope-11034

feat/v3.4.0-context-list-add-cli

feat/context-strategy-plugin-system

fix/tui-bindings-reload-settings

fix/pr-10027-acms-default-pipeline

feat/v3.6.0-context-strategy-protocol

feat/plan-correct-revert-append-modes

fix/uat-checkpoint-prune-test-isolation

fix/7527-sandbox-cache-invalidation

feature/issue-10820-chore-agents-fix-bug-hunt-pool-supervisor-tracking-prefix-auto-bug-pool-to-auto-bug-sup-complete-fix

feature/issue-3105-add-mandatory-labels-to-supervisor-tracking-issue-creation

feature/m6-sandbox-correction-invariant-docs

feature/issue-7957-bug-hunt-pool-supervisor-tracking-prefix

fix/v360/scope-chain-resolver-registration

feat/v370/tui-rebase-merge

feat/tui-v370/persona-registry

feat/v3.2.0-decision-recording-persistence

feat/v3.2.0-invariant-data-model-db-schema

feat/v370/tui-settings-sessions-screens

pr_fix/lsp-transport-subprocess-cleanup

fix/events-eventbus-unsubscribe

bugfix/m3-wf18-oom-sigkill

bugfix/m6-acms-path-matching-absolute

timeline/day-104-2026-04-14-auto-time-2

fix/v370/tui-session-persistence

agents/fix-10866-permissions-screen-to-textual-screen

feature/m7-timeline-day-106-update

bugfix/m6-gemini-fallback-order

fix/cleanup-service-sandbox-cache-invalidation

feat/acms-hot-storage-tier-lru-cache

bugfix/9558-plan-conflict-detection

bugfix/m3.6.0-lsp-transport-header-injection-ascii

feat/v370/tui-session-persistence

fix/invariant-service-thread-safety

pr-fix-7527-cache-invalidation

fix/pr-10890-shell-safety-integration

pr-fix-11170

fix/invariant-add-scope

pr-fix-8179-implementation

fix/concurrency-catalog-cache-lock-7590-cleandiff

fix/v360/resource-kind-field

fix/v370/tui-materializer-a2a

feat/v3.4.0-acms-storage-tiers

feat/ci-guard-llm-secrets

docs/add-showcase-cli-basics

fix/file-tools-startswith-bypass

fix-invalidate-sandbox-dirs-cache-after-purge-7527

feature/issue-5163-align-checkpoint-trigger-names

feature/m9-agent-card

cleveragents-pr-fix-11038

fix/actor-add-update-enforcement-fix

fix/10480-validate-logic-error

feat/v370/tui-web-mode

pr-fix-11002-validate-path-bypass

pr-fix-7478-validatepath

fix/isolate-checkpoint-prune-test

fix/issue-10813-strategize-decision-persistence

bugfix/9981-acms-indexing-optimize

feat/tui-v370/persona-registry-merge-v2

fix/plan-tree-color-format-ansi-output

auto-arch/spec-pr-10451-test-coverage

fix/10881-propagate-invariants-to-child-plans

bugfix/m7-audit-session-race

fix/sse-formatter-json-rpc-2.0

task/v3.8.0-ci-reusable-workflows

improvement/agent-ca-test-infra-improver-duplicate-avoidance

improvement/agent-label-compliance

feature/m9-timeline-day-99

docs/changelog-unreleased-cycle7

fix/issue-6316-session-list-json-empty-case

fix/issue-6425-tui-persona-cycling-keybinding

improvement/agent-evolution-pool-supervisor-pr-metadata

fix/project-switch-command

feat/v3.3.0-checkpoint-creation

fix/invariant-merge-action-scope

fix/tui-keybinding-preset-persona-cycling

auto-arch/spec-clarifications-cycle-1

feat/v360/plugin-architecture

feature/m39-auto-arch-23-minor-clarifications

feature/issue-4663-day-97-schedule-adherence-update

feature/issue-4221-docs-add-showcase-example-for-audit-log-and-security-commands

feature/issue-4381-docs-api-and-module-guides

feature/issue-10846-optimize-benchmark-regression-test-suite

bugfix/m3-session-tell-format

bugfix/m3-eventbus-unsubscribe

bugfix/m6-session-delete-format-json-envelope

bugfix/m6-plan-execute-rich-output

feature/issue-4749-split-monolithic-specification

feat/jwt-token-refresh

feat/agent-card-discovery

feature/pr-10916-close-reactive-event-bus

feature/m9-v3.8.0-v3.9.0-documentation

fix/10934-preserve-strategy-decisions-json

test/uko-persistence-coverage

feature/1915-timezone-aware-datetime

fix-gemini-fallback-order-10906

feat/context-show-cli-commands

pr-fix-10593

fix/plan-lifecycle-prompt-decision

pr/9451-fix-tui-thinking-effort-presets

fix/issue-pr-11002

fix/1514-structured-panels

pr-8177-validation-fix

fix-pr-10975-path-matching-normalize

pr-fix-6722-prompt-symbol

pr_fix_8256

pr_fix_8179

fix/pr-11004-tui-token-extraction

fix/9250-session-id-validation-handle-session-close

add-plan-start-alias

pr/fix-9183-bdd-tags

fix/pr-11050-subprocess-cleanup

fix/pyyaml-security-upgrade

pr/11029-review-started-notification

feat/adr-049-layer-boundary-enforcement

fix-lsp-subprocess-cleanup-10597

bugfix/11077-security-escape-bypass

bugfix/10608-lsp-header-injection

bugfix/9608-three-way-merge-engine

fix/8284-warned-sessions-reset

bugfix/9673-acms-budget-enforcement

fix/trailing-comma-opencode-json

bugfix/context-remove-path-traversal-10924

feature-10887-eventbus-unsubscribe

bugfix/mcp-race-condition-start

feature/issue-10952-provider-integration-tests

feature/issue-1925-add-asv-tests-for-domain-module

bugfix/m8-tui-on-input-changed

feature/1928-add-test-coverage-for-tui-module

task/ci-actor-context-mgmt-test-optimization

bugfix/m8-suggestions-query-extraction

fix/v370/quality-gates-command-injection

fix/multi-scope-skill-discovery-9369

fix/issue-7524-invariant-service-thread-safety-v2

bugfix/m3-langgraph-disposables

pr1482

tdd/m8-tui-sqlite-session-persistence

feature/m6-4213-resource-skill-showcase

tdd/mN-registry-thread-safety

feat/v3.3.0-parallel-subplan-scheduler

refactor/auto-guard-1-cli-a2a-boundary

feat/v3.3.0-plan-rollback-cli

feat/context-semantic-chunking-strategy

feat/resources-extension-interface

feature/m9-langgraph-platform

bugfix/m5-validation-attach-output-format

fix/tui-permissions-screen-wrong-base-class

feature/m3111-milestone-based-pr-prioritization

feat/acms-index-data-model

feat/acms-cli-context-show-clear

feat/context-sliding-window-strategy

feat/acms-scope-resolution-context-inheritance

feat/acms-core-pipeline-components

tdd/issue-10413-dollar-prefix-shell-mode

ci/cache-helm-binary-auto-inf-1

fix/issue-10485-fallback-selector-budget-limits

bugfix/m8-set-active-persona-preset-reset

bugfix/mN-registry-thread-safety

docs/v360/cli-version-info-diagnostics

test/v3.6.0/advanced-context-strategies-tests

fix/issue-6464-resource-add-auto-discovery

docs/v360/repl-actor-run-showcase

feat/v360/openrouter-provider

fix/v360/context-strategy-unification

fix/v360/compute-actor-impact-exceptions

docs/v360/actor-removal-impact

bugfix/project-show-resource-name

feat/v3.6.0/context-relevance-scoring

feat/v3.6.0/safety-profile-enforcement

refactor/v360/unify-service-initialization

refactor/v360/unify-error-handling-cli

refactor/v360/unify-api-naming

fix/v360/lsp-path-traversal-file-reading

fix/v360/resource-type-cycle-detection

refactor/v360/audit-rename-acp-imports

bugfix/m3.6.0-lsp-server-dos-message-read-timeout

refactor/clarify-behave-robot-framework-roles

fix/v360/lsp-env-var-injection

fix/v360/plugin-state-executing

feat/v360/anthropic-gemini-backends

refactor/auto-guard-1-address-todo-fixme-comments

fix/v360/remove-acp-module

fix/v360/llm-trace-latency-type

fix/v360/lsp-runtime-instantiation

refactor/v360/decouple-cli-services

feat/v3.6.0/cost-tracker

test/v360/e2e-a2a-context-management

feat/v3.6.0-virtual-resource-types

feat/v360/cost-session-budget

bugfix/m3.6.0-lsp-transport-resource-leak

auto-docs-1-mkdocs-setup

fix/m2-acceptance-test

docs/auto-docs-8-a2a-rename-documentation

feat/v3.6.0-llm-provider-abstraction

perf/acms-large-project-indexing-optimization

docs/timeline-day-107-2026-04-17

improvement/agent-test-infra-health-spam-fix-v2

auto-time/timeline-update-2026-04-18

docs/v3.6.0-v3.7.0-updates

fix/issue-6319-project-context-set-output

feat/v3.3.0-three-way-merge-engine

fix-orchestrator-scaling-32-workers

docs/auto-docs-2-v320-v330-features

feat/pure-graph-bdd-coverage

fix/plan-apply-json-envelope

feat/v3.3.0-merge-strategy-config

fix/project-show-missing-panels

test/cli-lifecycle-e2e-full-plan-lifecycle

timeline/day-105-2026-04-15-auto-time-1-v2

controller-coverage-optimization

feat/v3.4.0-context-show-clear-cli

fix/plan-status-missing-output-panels

auto-inf-3-consolidate-behave-fixtures

fix/plan-artifacts-missing-validation-apply-summary

fix/plan-lifecycle-service-rollback-method

fix/plan-prompt-json-timing-started

timeline/day-104-2026-04-14-auto-time-1

docs/timeline-day-97

fix/context-analysis-agent-path-traversal

improvement/agent-pr-self-reviewer-blocking-vs-nonblocking

fix/agent-task-list-memory-leak

fix/1473-plan-cancel

auto-arch-14/spec-anonymous-tool-enforcement

fix/a2a-facade-optional-param-validation

docs/reference-glossary

fix/invariant-precedence-chain-action-scope

refactor/agent-configurable-limits-context-analysis-plan-generation

feat/v3.2.0-plan-tree-cli

feat/m6/devcontainer-clone-into-sandbox

spec/subplan-system-v3.3.0

test/plan-tree-correction-visual-tdd

fix/action-schema-argument-default-type-validation

ci-quiet-logs

fix/action-schema-env-var-exfiltration

fix/plan-tree-json-missing-decision-id

fix/auto-debug-agent-prompt-injection

feat/output-renderer-registry

fix/issue-9124-add-bdd-tags

test/cli-docstring-example-validation

refactor/add-return-type-get-services

feature/aws-cloud-handler-sdk

test/plan-correct-json-output-tdd

fix/plan-start-spec-alignment

issue-7502-fix-get-for-plan

bugfix/6879-cli-format-option

fix/7566-engine-cache-toctou-race

fix/7927-apply-phase-dod-gating

fix/actor-loader-list-actors-race-condition

fix/issue-7623-validation-pipeline-stdout

spec/add-deleted-at-field-to-project-delete

bugfix/m3-error-handling-fileconfig-unhandled-exception

feat/automation-profile-precedence-chain

fix/auto-rev-sup-tracking-prefix

feat/issue-6450-tui-escape-cascade

fix/config-get-output-missing-origin-panel-and-envelope

coverage-engine-master-port

improvement/agent-uat-tester-parallel-docs-pr-fix

fix/project-service-namespaced-project

fix/issue-6441-session-create-json-output

fix/tui-help-command-full-catalog-listing

fix/issue-6323-project-context-show-output

fix/issue-6457-json-envelope-messages-text

fix/issue-6322-resource-add-url-flag

fix/issue-6325-plan-explain-decision-id

fix/resource-removal-children-check-6886

controller-state-machine

fix/issue-6345-automation-profile-add-output

docs/2026-04-08-unreleased-changelog

spec/tui-clarifications-session-export-persona

docs/add-example-tool-and-validation-management

bugfix/backlog-resource-schema-missing-overlay-strategy

fix/action-argument-schema/misleading-error-message

fix/remove-executable-resource-type

fix/automation-profile-remove-rich-output-panel

fix/container-handler-module-missing

fix/format-output-rich-color-renderers

fix/type-safety-legacy-migrator-type-ignore

spec/update-sse-streaming-event-example

fix/acms-skeleton-compressor-signature

fix/skill-add-yaml-wrapper-key

fix/1476-tool-list-cols

bugfix/permissions-diff-mode-cycle

fix/1429-node-ref

fix/1432-lsp

bugfix/1039-missing-validation-unit-tests-yaml

feature/audit-preserve-event-timestamp

feature/m8-tui-materializer

tdd/m4-automation-profile-di-bypass

fix/1441-ctrl-tab

feature/m9-entity-sync

feature/m9-team-collab

feature/m7-postgresql-backend

fix/issue-11189-config-actor-format

bugfix/m5-actor-options-ignored

fix-11004-tui-suggestions

fix/arg-swap-validation-attachment-8177

pr-fix/9663-hot-warm-cold-tier-reliability

pr_fix-11000-conflict-report

bugfix/m3.6.0-lsp-7044-subprocess-cleanup

fix/7478-file-ops-security-fix

impl-tui-materializer

test/hierarchical-plan-4phase-lifecycle

feature/security-fix-relpath-pr-11217

feature/m2-implementation-pool-supervisor-checklist

fix-file-tools-path-validation

bugfix/m8-tui-input-live-refresh

feature/9126-fix-action-scope-invariant-merge

bugfix/m7-tool-calling-llm-options

fix-7478-startswith-bypass

bugfix/m3-cleanup-subprocess-on-failed-init

bugfix/m8-tui-anthropic-model-name

feat/integrate-cleveractors

feature/m8-tui-llm-dispatch

fix/auto_debug-partial-state

pr-9673-budget-enforcement

pr-9675

fix/issue-7478-inline-executor-startswith-bypass

feat/tui-tuimat-5326

fix-9675-context-show-clear

agents/final-working

fix/10356-eventbus-unsubscribe

11229-fix-acms-hot-max-tokens-regression-tests

pr-8701-invariant-model

pr-fix/10597-lsp-transport-cleanup

pr-fix-9608

dmpipeline-v2

pr-fix-10608-header-injection

pr-9827-fix

bugfix/7492-validation-attachment-argument-swap

pr-fix-11002

feat/v370/multi-session-tabs

fix-branch

AUTO-IMP/PR-10069-checklist

feature/m2-pr-compliance-checklist

feature/pr-10592-cloud-resource-types

fix-lsp-transport-cleanup

feature/context-strategy-protocol

refactor/v3.6.0-acp-to-a2a-rename

fix/context-cli-consolidation

fix/10608-lsp-header-injection

feat/acms-context-index

pr/fix-arg-swap-validation-attachment-8177

fix-cli-plan-status-envelope

pr/9981

pr/11153-auto-debug-fix

fix/validate_path_security

pr-fix-11177-status-check-native-expressions

bugfix/m6-validate-path-startswith

a2a-materializer-pr-fix

pr-fix-10608

bugfix/9250-a2a-session-id-validation-before-cleanup

pr-fix-11053

fix/a2a-handle-session-close-missing-session-id

fix/validation-attachment-arg-swap-8177

pr-fix-11196-invariant

bugfix/m5-fix-hot-max-tokens-tier

pr-fix-9675

perf-fix

pr-9608

feature/ten-way-merge-engine

pr-fix-branch

pr-11217

11101-three-way-merge-engine

fix/remove-silent-argument-swap

fix-pr-11000-structured-conflict-report

pr-fix-11053-session-id-validation

agents/fix-eventbus-unsubscribe

pr-10356

fix/invariant-action-scope

bugfix/issue-8395-sanitise-db-url

bugfix/m3-fix-action-scope-invariant-merge

pr-9671

feature/wire-missing-event-emitters

bugfix/m3.6.0-lsp-transport-post-spawn-cleanup

dmpipeline

bugfix/m5-acms-project-budget-override

fix/iterate-all-actors

pr/11217-fix-prefix-collision-bypass

fix/pr-11011-subprocess-cleanup

pr-11217-fix

pr-11217-relpath-fix

bugfix/m5-revert-acms-budget-assembler

fix/eventbus-unsubscribe

feature/pr-9981

fix/v3.7.0/actor-add-update-flag

agents/fix-invariant-persistence-8573

feat/tui-materializer-a2a

fix/tui-tui-materializer-a2a-event-queue

fix/unsubscribe-eventbus

pr-11153

feature/11201

pr-fix-11153-patched

pr-branch

fix/10813-strategy-decision-persistence

fix-pr-11145-status-check

pr-11053

pr-fix-10597-subprocess-cleanup

bugfix/mcp-infer-resource-slots-null-properties

pr-11166

pr-9675-fix

feat/structural-component-output-validation

pr-fix-9313

fix/pr-11042-rename-render

fix/action-scope-inmerge

fix/wf12-oom-sigkill

fix/wf18-container-clone-e2e

bugfix/m6-actor-overlay-render-shadow

bugfix/m7-plan-strategy-decisions-json

fix/10911-tui-suggestions-query-extraction

fix/lsp-transport-subprocess-cleanup

pr-fix-8177-validation

bugfix/m3-plan-status-json-envelope

fix/invariant-persistence-8573

pr-fix-11037

pr-11015-fix

pr_fix_11015

fix/m1-security-fix-startswith-bypass

fix/automation-profile-gates-lifecycle

fix-status-check-brittle-pipeline-11212

feat/pr-10590-dual-capability-strategies

feat/structural-output-validation

bugfix/m2-ci-status-check-resilience

feature/m3-plan-correction-data-model

pr-fix-10356-unsubscribe

pr-fix-11011

pr_fix/lsp-transport-header-injection-ascii

fix-pr-11002-startswith-bypass-7478

bugfix/acms-project-budget-override

fix/ci-status-check-resilience

bugfix/pr-fix-10597-cleanup-subprocess-on-init-failure

bugfix/sandbox-reexecute-cleanup

pr-fix-8701-invariant-model

fix/test-dotdot-traversal-assertion

fix/cleanup-stale-preserve-commits

fix/security-file-tools-path-traversal-7478

pr-11180-fix

fix-combined-format

fix-9131-invariant-propagation

fix/tui-actor-selection-overlay

pr-11201

merge/pr-11196-invariant-fix

pr/11165

temp-pr-11174

pr-fix-10356-unsubscribe-eventbus

pr-fix-11156-python313-deprecation

feature/pr-7801-fix-validate-path-security

fix/11039-render-refresh

fix/tui-actor-selection-render-rename

pr-fix-11089-session-close-validation

pr-fix/11089-session-close-validation

pr-fix-11182

bugfix/m3-rxpy-subject-close

test/restore-e2e-tests

feature/issue-pr-9271-hot-max-tokens

pr-fix-8177

bugfix/issue-8426-stdio-cleanup

feature/eventbus-unsubscribe

bugfix/m3-integrate-mcp-transport

fix/concurrent-stdout-restoration

PR-fix-wf18

feature/sandbox-cache-invalidation

fix/python-313-asyncio-deprecations

pr-11128

pr-11180

pr-11165

pr-practice

structural-output-validation

fix/status-check-native-expressions

feat/merge-conflict-detection

11036-fix-acms-hot-max-tokens

pr/11166

fix/ci-status-check-native-expressions

fix/11176-actor-selection-render

pr-fix-10597

feature/pr-compliance-pool-supervisor

pr-10590

fix/python313-asyncio-get-event-loop-deprecation

pr-fix-#11053-session-id-validation

pr-fix-11042-renamed-render

feat/v360/acp-to-a2a-rename

fix-arg-swap-validation-attachment-8177

fix/asyncio-get-event-loop-deprecation

fix_8395_pr

pr-fix-11153-auto-debug-mutation

pr/11051-thread-safety-invariant

fix-plan-status-json-envelope

bugfix/pr-11015-pool-supervisor-checklist

feature/fix-7478-validate-path

feature/plans-conflict-detection

pr-11141-cleanup-stale-commits-beyond-head

fix/pyyaml-vulnerability-upgrade

pr-fix-9244

bugfix/m3-invariant-propagation

feature/issue-10480-fix-validation-bypass

feature/m3-invariant-enforcement-validation-pipeline

feat/invariant-enforcement-strategize-phase

issue-10438-fix

fix/mcp-timer-race-10516

feat/agents-invariant-add-list-remove-commands

restore-e2e-cleanup

fix/issue-11120-cleanup-stale-preserve-artifacts

feature/fix-issue-11121-cleanup-stale-reinvoke

fix/issue-10480-plan-validation

feature/m5-tdd-quality-gate

bugfix/11121-fix-cleanup_stale-preserve-meaningful-changes

bugfix/acms-dual-strategy-capabilities-incompatible-fields

feature/benchmark-scheduled-workflow

feature/m8-tui-mainscreen

feat/v3.4.0/acms-project-indexer

fix/10932-preserve-strategy-decisions-json

fix/data-integrity-session-rollback-7489

fix/issue-6329-resource-remove-edge-table

fix/issue-7524-invariant-service-thread-safety

pr-10932-fix-plan-strategy-decisions

pr-fix-9244-pyyaml-upgrade

refactor/noxfile-parallel-test-architecture

task/ci-matrix-strategy-python-versions

feat/v3.3.0-plan-rollback

feature/issue-10755-redirect-rich-panels-to-stderr

pr10871

pr-fix-10901

ci/optimize-benchmarks-regression

fix/tui-extract-at-token-suggestions

feature/m5-add-repo-indexing-showcase

PR-10910-a2a-json-rpc-routing

feature/milestone-based-pr-prioritization

auto-time-3-day106-cycle2

timeline/day-106-cycle2-2026-04-16-auto-time-3

pr/fix-10842

pr-10886

fix/session-delete-json-envelope

pr-10851

pr-10876

fix/gemini-fallback-order

pr/fix/mcp-client-start-race-condition

feat/three-way-merge-engine-9608

pr/9673

fix/1469-plan-execute-structured-panels

fix/actor-provider-validation

implement-pr-9442

cleveragents-push-23420b48

fix/validation-repo-silent-swap

fix/startswith-bypass-7478

fix/invariant-thread-safety

fix-thread-safety-invariant-service

docs/milestone-plan-navigation

feature/implementor-notification-11032

pr9452

pr/fix-9601

pr-8667

fix/10954-security-scan-dockerfile

bugfix/9183-bdd-tag-enforcement

fix/7566-engine_cache-toctou-race

fix/plan-tree-json-output-envelope

pr-9313-fix

bugfix/9244-pyyaml-security-upgrade

test/domain-asv-benchmarks

pr-fix-10958-async-cleanup-tests

fix/action-list-table-columns

fix/issue-7478-validate-path-startswith-bypass

pr-fix-ci-11000

fix/agent-skill-multi-scope-discovery

pr-fix-10982

pr-fix-10937-close-reactive-eventbus

pr-fix-7478-path-traversal

feature/benchmark-scheduled-workflow-fix

pr-9183-add-bdd-tags

fix-plan-status-panels

fix-pr-11037

feat/v3.6.0-database-resource-types

pr-10591-checkout

pr-10979

fix/invariant-thread-safety-8209

fix/10597-lsp-proc-cleanup

fix/plan/tree-envelope-9313

fix-6568-push

pr/11044

feature/m6-reduce-redundant-ci-status-reporting

fix/ca-test-infra-improver-health-spam

agents/pr-6628-fix

auto-time-1-day107-cycle

fix/issue-11047-actor-add-rename-from-config

pr-6741

fix/8675-project-switch

pr-fix-1485-updates

pr/6723-fix-session-create-json

improvement/agent-bug-hunt-pool-supervisor-tracking-prefix-complete

fix/pr-6695-session-list-empty-json

pr-9663-fix

docs/add-example-resource-and-skill-management

feature/m39-cli-basics-showcase

fix/gemini-fallback-order-fix-2

fix/validation-list-command-clean

fix-pr7957-complete-tracking-prefix

pr-7922-fix-lint

feature/pr-8304-container-clone-into

fix-pyyaml-11012

pr-fix-9461

pr/8685-correction-data-model-persistence

bugfix/lsp-stdio-transport-cleanup-10597

pr-8660

feat-scope-chain-resolution

chore/pyyaml-upgrade

fix/issue-7478-file-tools-validate-path

pr-fix-9442-tui-ctrltab

spec/update-cycle8-validation-gate-empty-run-guard

fix/tui-sqlite-session-persistence-10648

fix/8661-plan-start-alias

fix-10649

pr-fix-cache-init

pr9407-timeline

feat/tui-prompt-symbol

pr_fix_9407-plan-alternatives-structured

bugfix/8179-remove-session-rollback-calls

pr-9246

pr-fix-10635-fixed

pr-10069

pr/fix-9313

pr-10643

invariant-pr-8684-fix

pr-fix-6676-resource-remove-edge-table

fix/acms-consolidate-strategycapabilities

pr-fix-8661

fix/9250-validate-session-id-before-cleanup

bugfix/m6-file-tools-validate-path-bypass

bugfix/m3-shell-safety-service-tui

pr-8684-persist-invariants

pr-8209-fix

bugfix/8177-remove-silent-argument-swap

fix/plan-apply-rich-output-panels

pr-fix-11012

pr-fix-8667

pr/fix/11012-pyinsec

pr-fix-9407

pr-8853

bugfix/m3-evlv-9824-implementation-pool-compliance-checklist

pr/10069

docs/pr-creator-state-priority-labels

test/core-asv-benchmarks

pr-fix-10995

refactor/v3.6.0-acp-to-a2a-rename-push

pr-9663

pr-fix-work

pr-8304

pr_fix_1514_v2

timeline-update-2026-04-19

pr-fix-9313-plan-tree-envelope

pr/11004-fix-tui-suggestions-query-extraction

pr-fix-9817

feat/9558-plan-conflict-detection

docs/timeline-day-101

fix/v360/plugin-loader-security

feat/acms-context-policy-fix-9671

pr-fix-9460

pr/9671

pr-fix-9671

pr-10592-fix

fix/issue-7478-file-path-validation

feat/pr-10590-context-strategy-fix

bugfix/pr-9183-bdd-tags

feat/acms-context-show-clear-cli

fix/invariant-add-scope-required

pr-fix-10590-context-strategy

pr-fix-10590-local

pr-8662-fix

pr/1485

pr/9460-project-show-invariants-validations

pr-11013

fix-1469-impl

pr-8257

pr-3329

feat/v3.2.0-decision-recording-strategize

fix/strategize-full-context-snapshots

clone-verify-test

AUTO-IMP/PR-9672-context-list-add

AUTO-IMP/PR-9663-storage-tiers

AUTO-IMP/PR-10583-a2a-rename

fix-check-same-thread-migration-runner

d2188407

fix/a2a-handle-session-close-missing-session-id-pr-9250

pr-fix-8179

bugfix/m6-devcontainer-autodiscovery-wiring

bugfix/m5-event-bus-exception-swallow

pr/3458

acms-parallel-indexing-fix

acms-parallel-indexing

pr-fix-10958

fix/lsp-context-enrichment-acms-wiring

fix/cli-remove-positional-name-from-actor-add

fix/acms-context-cli

bugfix/m6-session-create-suppress-exception-logging

fix-10957

fix/6726-tui-persona-cycling-keybinding

feat/plan-rollback-cli-checkpoint-restore

pr-8661-plan-start-alias

pr/1486/resource-handler-return-type

feature/8667-add-validation-list-command

fix/actor-add-positional-name

improvement/agent-pr-review-pool-supervisor-tracking-prefix-complete

pr/fix/actor-loader-list-actors-race-condition

bugfix/m4-lsp-context-enrichment-acms-wiring

bugfix/m-error-suppression-reactive-registry-adapter-v2

fix/7501-plan-repository-success-derivation

pr-10492

pr-8225

docs/fix-automation-profile-default-supervised

pr-9229-path-traversal-fix

pr-10975

pr/1486/fix-resource-handler-return-type

pr-9257-fix

fix/validation-list-command-fixed

fix-executable-resource

pr-8179

spec/auto-arch-24-a2a-boundary-enforcement-adr

pr/10988/head

pr-fix-9407-plan-explain-structured-alternatives

pr_9454

feat/agent-switch-cmd

pr-9329

8661-plan-start-alias

feat/acms-context-analysis-summaries

fix/invariant-add-repeatable-plan-action

tdd/m6-session-create-suppress-exception

test-push-check-only

pr-10889

pr-10889-fix

pr/10879-benchmark-caching-parallelism

fix/bug-hunt-supervisor-tracking-prefix

fix/issue-6491-actor-remove-format-option

auto-discovered-stale-conflicts-review-task

fix/issue-9169

improvement/reduce-redundant-ci-status-reporting

feat/v3.4.0-acms-index-data-model-traversal

bugfix/m3-sqlite-check-same-thread

bugfix/m3-evlv-implementation-pool-compliance-checklist

docs/quickstart-guide

fix/1431-subgraph

bugfix/7529-a2a-terminal-phase-guard

bugfix/m3-bdd-feature-file-tags

ci/v360/isolate-slow-e2e-tests

feature/m3-consolidate-documentation

feature/m7-user-driven-review-agent

feature/m9-a2a-http

fix/1423-refactor

fix/tui-mainscreen-3state-sidebar-adr044

testbed/m9-hello

docs/add-label-verification-to-new-issue-creator

bugfix/m3-database-migration-runner-check-same-thread

feature/m4-plan-correction-revert

improvement/agent-architecture-pool-supervisor-milestone-assignment

feature/m9-changelog-unreleased-cycle7

fix/issue-10512-mcptooladapter-rlock

fix/data-integrity-llm-trace-repository-7505

agents/auto-working-new

fix/resource-removal-guard-linked-children

fix/1468-impl

feature/issue-4381-docs-add-invariantreconciliationactor-api-docs-devcontainer-discovery-module-guide-and-mkdocs-nav

fix/7619-git-tools-base-env-toctou

pr-fix-8661-updates

feature/issue-2798-chore-agents-improve-ca-test-infra-improver-strengthen-duplicate-avoidance

bugfix/m3-migration-runner-check-same-thread

feature/issue-10952-fix-database-migration-runner-check-same-thread

fix/dependency-security-aiohttp-cves

fix/security-b608-sql-fstring-migration-plan-phases

fix/cli-legacy-removal

bugfix/m3-langgraph-execute-state-bypass

feat/issue-6370-actor-context-clear

bugfix/m3-actor-run-response

fix/tui-auto-generate-presets-actor-schema

feature/issue-1917-optimize-robot-actor-context-management-tests

feature/issue-10803-fix-nox-sessions-use-uv-sync-frozen

bugfix/m3-output-plan-results

pr/9912-fix

bugfix/executor-error-details-overwrite-mini-max

fix-10866-permissions-screen

fix-pr-10852

fix/10922-conversation-state-mgmt

pr-check

bugfix/10931-preserve-strategy-decisions-json

fix/10903-nox-showcase-docs

pr/10885-pyyaml-upgrade

pr-fix-10931

bugfix/executor-error-details-overwrite-qwen

fix-pr-1107-asgi-uvicorn

fix-9912-branch

bugfix/10821-fix-tui-keybinding

fix/redaction-pattern-exception-handling

feature/spec-timeline-6003

feature/spec-timeline-6008

feature/issue-4746-update-spec-agents-diagnostics-all-9-providers

feat/v3.6.0/gemini-provider

pr/8194

tdd/prompt-input-textarea

fix/lsp-transport-security

temp-squash

feat/690-jsonrpc-routing

feat/v3.6.0-anthropic-gemini-backends

build/agents-system-rewrite

feature/issue-10826-docs-spec-align-checkpoint-trigger-names-and-config-key-path-with-implementation

feature/issue-10794-feat-a2a-implement-a2a-http-transport-for-server-mode

fix/tui-preset-cycling

pr-10820

feature/696-implement-a2a-http-transport-for-server-mode

feature/issue-10792-feat-server-langgraph-platform-remotegraph-integration

feature/issue-1486-fix-v3-7-0-resourcehandler-return-type-1444

feature/issue-1488-fix-v3-7-0-resolve-issue-1432

bugfix/m1-plan-execute-sandbox-root

feature/issue-10858-devops-run-linter

docs/milestone-v3.6.0-v3.7.0

feature/issue-10835-add-milestone-based-pr-prioritization

pr-8701-head

feature/m7-actor-management-showcase-metadata

feat/context-dynamic-budget-allocation

feat/acms-semantic-chunking-context-strategy

feat/v360/pluggable-scope-chain-api-v2

docs/v360/actor-management-showcase

fix/pr-10755

feat/v3.6.0/pluggable-scope-chain

feature/m3-timeline-day97-update

feature/m4652-module-guides

feature/m5-extend-agents-diagnostics-example

feature/m5832-add-unreleased-changelog-entries

docs/add-repo-indexing-showcase

feature/issue-8225-validation-gate-empty-summary

bugfix/m8179-fix-data-integrity-remove-session-rollback-calls-from-projectrepository

fix/plan-lifecycle-root-decision-type

bugfix/cancel-worktree-cleanup

pr-10586

pr-9215

feat/issue-6357-tui-loading-states

temp-bug2-combined

docs/consolidated-all-documentation

bugfix/m6-sandbox-reexecute-cleanup

fix/issue-9963-memory-service-timestamp-guards

docs/context-management-deep-dive-v2

docs/context-management-deep-dive

docs/agent-development-guide

feature/10008-file-level-correction-diff

docs/a2a-protocol-guide

docs/tui-user-guide-keybindings

fix/plan-generation-validate-logic

bugfix/issue-10408-dollar-prefix-shell-mode

test/issue-10500-persona-state-reset-tdd

docs/getting-started-tutorial

test/tdd-session-create-suppress-exception

docs/error-codes-guide

docs/common-tasks-recipes-guide

test/migration-runner-sqlite-threading

docs/configuration-reference

pr-10678

pr-10681

test/issue-10510-mcptooladapter-rlock-tdd

feature/tui-screens-directory

fix/issue-10511-suppress-runtimeerror

pr-10676

fix/tui-block-cursor-bindings

pr-10680

test/issue-10502-session-export-json-tdd

fix/issue-10507-sqlite-check-same-thread

docs/installation-setup

test/v3.6.0/scope-chain-integration-tests

fix/v370/loading-throbber-restore

feat/v370/tui-complete-squashed

feat/v3.6.0/budget-enforcement

auto-arch-1-spec-module-definitions

auto-time/timeline-update-2026-04-18-c3

auto-docs-2/add-changelog-contributing

auto-time/timeline-update-2026-04-18-c2

auto-docs-1/fix-mkdocs-nav-and-links

pr-5968

improvement/agent-bug-hunt-pool-supervisor-tracking-prefix

auto-time/update-2026-04-17

auto-docs-3-v340-v350

docs/timeline-update-2026-04-15

auto-docs/initial-documentation-assessment

feature/m1-initial-documentation

bugfix/m4-plan-diff-correction-stub

pr-9247

docs/timeline-update-2026-04-17

timeline/day-106-2026-04-17-auto-time-1

timeline/day-106-2026-04-16-auto-time-1-v2

spec/auto-arch-23-minor-clarifications

timeline/day-106-2026-04-16-auto-time-2

docs/auto-docs-2-v380-v390

bugfix/m3-actor-add-v3-schema-validation

timeline/day-106-2026-04-16-auto-time-1

auto-docs/changelog-architecture-readme

chore/timeline-day-105-2026-04-15

docs/timeline-update-2026-04-15-auto-time-1

timeline/day-105-2026-04-15-auto-time-1

benchmark-ci

fix/plan-phase-migration-raw-sql-root-plan-id

auto-arch-12/spec-acms-context-tier-hydrator

timeline/day-106-2026-04-15-auto-time-1

feat/invariant-enforcement-strategize

feat/plan-tree-decision-rendering

docs/auto-docs-4-fix-conflicts

docs/auto-docs-1-milestone-docs-v3.0.0-v3.1.0

feat/v3.4.0-acms-lifecycle-policy

pr-9220

pr-9214

feat/v3.3.0-subplan-status-tracking

uat/checkpoint-rollback-merge-tests

fix/pr-review-pool-supervisor-prefix-mismatch

feat/v3.3.0-spawn-subplan-step

auto-time-1-day103-cycle1-session6

feat/v3.8.0-agent-card-endpoint

docs/auto-docs-cycle-24-showcase-nav

fix/issue-7663-docs-writer-missing

auto-time-1-day103-cycle2

docs/timeline-day-104-auto-time-1

auto-arch-16/spec-xml-prompt-injection-mitigation

bugfix/m4-invariant-persistence

uat-a2a-facade-tests-v350

bugfix/m3-behave-parallel-failed-chunk-logs

bugfix/7664-automation-tracking-label-requirements

docs/auto-time-1-timeline-update-2026-04-14

docs/auto-docs-1-milestone-v3-updates

docs/action-config-schema-api

fix/bug-hunt-supervisor-nonexistent-file-preflight

docs/validation-gate-empty-run-guard

auto-arch-15/spec-retry-policy-canonical-fields

docs/lockservice-advisory-locking

docs/changelog-plan-fix-4197

spec/milestone-plan-section

docs/update-changelog-recent-features

fix/test-infra-remove-redundant-python-variable-robot-files

timeline/day-104-2026-04-14-cycle2

fix/bdd-feature-file-tags

auto-arch-13/spec-default-automation-profile

docs/auto-docs-cycle-1-2026-04-12

docs/cycle-1-git-worktree-sandbox

spec/architecture-critical-gap-fixes

docs/timeline-day-104-auto-time-2

auto-arch-1/add-v380-v390-milestone-plan

docs/developer-setup-guide

fix/auto-profile-spec-prose-description

auto-arch-10/spec-tui-a2a-integration-layer

spec/resource-event-types-clarification

auto-docs-4/changelog-and-observability

auto-arch-4/adr-049-layered-boundary-enforcement

docs/a2a-protocol-autonomy-hardening

auto-arch-9/spec-v3.8.0-milestone-plan

docs/auto-docs-3-reference-index

auto-arch-7/spec-apply-git-worktree

docs/timeline-day104-cycle1-auto-time-4

docs/auto-docs-cycle-1-changelog-updates

auto-arch-6/adr-049-spec-restructuring

docs/auto-docs-1-v340-acms-context-management

docs/auto-docs-1-v320-v330-cli-reference

auto-arch-5/v3.9.0-milestone-plan

test/create-scripts

auto-time-1-day104

timeline/day-104-2026-04-14

docs/auto-time-4-day103-cycle5

auto-time-3-day103-cycle4

auto-docs-5-architecture-overview

spec/three-way-merge-strategy-v3.3.0

spec/checkpoint-system-v3.3.0

auto-docs-4-api-docs-update

auto-docs-1-changelog-expansion

spec/invariant-management-system-v3.2.0

pr-8289

spec/plan-correction-engine-v3.2.0

spec/layered-architecture-boundary-policy

spec/tui-materializer-a2a-integration-v3.7.0

spec/decision-recording-system-v3.2.0

docs/auto-docs-1-milestone-overview

pr-7484

pr-4212

auto-arch-3/v3.8.0-milestone-plan

auto-docs-6/troubleshooting-and-config

auto-time-1-day103-session5

auto-docs-5/contributor-guide-and-readme

docs/plan-tree-ulid-examples

docs/m3-spec-clarify-path-datetime-plugin-contracts

docs/auto-docs-cycle-10-diagnostics-ref

auto-docs-3/user-guide-and-architecture

docs/cycle-7-changelog-update

spec/reconciliation-failure-behavior

auto-docs-2/api-documentation

auto-arch-2/adr-053-repositories-decomposition

auto-docs-1/release-notes-v3.0-v3.1

spec/update-validation-attach-project-delete

spec/architecture-cycle2-impl-clarifications

auto-arch-1/adr-049-052-violations

auto-time-1-day103

docs/auto-docs-cycle-13-updates

docs/timeline-day-102-auto-time

timeline/day-103-2026-04-13

spec/arch-invariant-cli-completeness

spec/update-cycle1-validation-attach-project-delete

docs/add-session-management-showcase

spec/arch-sandbox-path-correction-cycle9

spec/architecture-v380-milestone-plan

docs/auto-docs-cycle-12-updates

docs/cycle-1-validation-gate-fix

docs/auto-docs-cycle-2-2026-04-10

spec/architecture-cycle-25-new-features

docs/timeline-day-102-2026-04-12

docs/cycle-2-git-worktree-acms-hydrator

spec/arch-sandbox-cleanup-discovery

docs/timeline-day96-2026-04-08

docs/auto-docs-cycle-11

spec/fix-sandbox-strategy-protocol-name

spec/arch-acms-tier-hydration

fix/v3.4.0/context-settings-defaults

docs/add-example-repl-and-actor-run

docs/auto-docs-cycle-10-updates

docs/session-4-2026-04-08-updates

docs/showcase-all-examples-consolidated

docs/acms-context-hydrator-cycle2

docs/add-example-output-format-flags

spec/arch-failfast-cancel-semantics

timeline/day-101-2026-04-11

docs/timeline-day99-2026-04-09-v2

docs/auto-docs-cycle-2-worktree-acms

spec/architecture-v3.8.0-milestone-plan

docs/api-lsp-acms-reference

improvement/agent-bug-hunt-pool-supervisor-yaml-syntax-fix

spec/project-delete-deleted-at-field

spec/architecture-provider-registry-tui-materializer

spec/document-reconciliation-blocked-error-5942

fix/issue-7482-git-log-injection

spec/devcontainer-auto-discovery-schema

docs/update-module-guides-2026-04-10

timeline/day-100-2026-04-10-auto-time-cycle1

timeline/day-99-2026-04-09-auto-time-v2

docs/cycle-3-module-guides

timeline/day-99-2026-04-09-auto-time

pr-4226

spec/additional-llm-providers-gemini-groq-cohere-together-ollama-mistral

spec/document-context-tier-hydrator-6175

docs/timeline-day99-2026-04-09

spec/invariant-cli-clarifications

docs/add-example-project-init-and-context-management

spec/reconciliation-blocked-error-documentation

spec/fix-invariant-precedence-reference-5861

spec/fix-plan-correct-accepts-plan-id-5558

spec/fix-validation-attach-synopsis-5328

docs/timeline-day-99-cycle-1

docs/timeline-day-99-cycle-2

fix/actor-context-list-regex-arg

docs/timeline-day-99-cycle-3

spec/arch-security-mode-init

docs/auto-docs-cycle-9-updates

fix-resource-fix-resource-remove-to-check-correct-edge-table

feat/issue-6434-tui-env-var-expansion

fix/issue-6321-plan-prompt-timing-field

feat/issue-6348-sessions-screen

spec/plan-show-command

temp

feat/harden-label-restrictions-1775753628

spec/invariant-reconciliation-failure-behavior

spec/add-reconciliation-failure-behavior-5942

spec/architecture-corrections-cycle3

spec/fix-ai-provider-interface-5801

spec/azure-api-version-default-update

docs/auto-docs-writer-cycle1-labels

spec/fix-resource-type-yaml-format-5622

spec/add-plan-revert-resume-commands-5574

docs/auto-docs-cycle-1-2026-04-09

spec/plan-correct-plan-id-or-decision-id-5558

spec/fix-subgraph-node-actor-ref-field-5427

issue/5284-master-ci-fix

timeline/day-99-2026-04-09-v2

merge-me

docs/session-3377-initial-docs-update

fix/llm-provider-subpackage-exports

spec/arce-acronym-and-tui-keybinding-fixes

spec/architecture-corrections-cycle2

spec/architecture-corrections-cycle1

docs/cycle-1-updates

docs/session-4940-2026-04-08-cycle1

spec/architecture-milestone-plan-v3.2-v3.7

docs/session-4743-2026-04-08-cycle1

docs/timeline-day-98

docs/timeline-day98-2026-04-08-v2

docs/add-example-action-and-plan-management

docs/session-2026-04-06-updates

docs/ca-docs-writer-v3.8.1-2026-04-05

improvement/agent-arch-guard-clone-failure-handling

fix-tdd-invert-non-assertion-exceptions

bugfix/3472-fix-tdd-inversion-logic

bugfix/989-fix-persistence-json-decode-error

improvement/agent-supervisor-tracking-labels-v2

docs/timeline-day95-v2

docs/timeline-day95-final

docs/update-lsp-api-and-changelog

fix/lsp-resource-handler-module-missing

docs/timeline-day95-final-2026-04-05

fix/a2a-plan-correct-rollback-wiring

docs/add-lsp-api-and-changelog-2026-04-05

fix/tool-registry-validation-type-discriminator

docs/v3.7.0-documentation-update

docs/ca-docs-writer-2026-04-05-cycle2

docs/unreleased-feature-docs

fix/concurrency-cost-tracker-record-usage-race-condition

improvement/agent-ca-test-infra-improver-failure-handling

docs/update-changelog-mcp-plan-ci-2026-04-05

improvement/agent-pr-reviewer-milestone-prioritization

docs/timeline-day95-refresh-2026-04-05

improvement/agent-mandatory-labels-tracking-issues

docs/api-domain-providers-changelog-2026-04-05

docs/ca-docs-writer-2026-04-05

docs/timeline-day95-refresh

fix/skill-add-include-validation

docs/timeline-day-95-2026-04-05-update3

docs/timeline-day-95-2026-04-05-update2

docs/ci-incident-runbook-2597

improvement/agent-ca-test-infra-improver-worker-api-mode

docs/shell-safety-api-and-readme-highlights

docs/timeline-day-55-2026-04-04-v2

docs/timeline-day-55-2026-04-04

docs/timeline-day54-update3

improvement/agent-ca-test-infra-improver-fixes

spec/restructure-monolithic-to-split

docs/timeline-day54-update-v2

docs/timeline-day54-update

fix-agents

docs/shell-safety-and-domain-base-model

fix/1452-impl

fix/1425-test

fix/1426-config

fix/1421-perf

fix/1424-impl

test/int-wf16-devcontainer

feature/m8-tui-persona-export

feature/m7-post-resource-equivalence

test/e2e-m4-acceptance

feature/m6-tantivy-backend

feature/m6-estimation

feature/m6-estimation-report-model

feature/observability-prometheus-audit

feat/server-auth-namespace

feature/m8-session-editing

feature/llm-actor-subplan-wiring

feature/m8-tui-first-run-actor-selection

feature/m8-tui-conversation-block-catalog

feature/m8-tui-settings-screen

feature/m7-e2e-porting

feature/m6-estimation-historical-stats

feature/m8-tui-persona-export-import

feature/m8-tui-sessions-screen

feature/m7-graph-backend

feature/m8-tui-block-context-menu

feature/m8-tui-tool-call-expand

feature/m4-missing-builtin-tools

docs/v3.7.0-release-docs

feature/m8-tui-session-export

test/e2e-wf15-disaster-recovery

test/e2e-wf03-refactoring

test/e2e-m3-acceptance

feature/m8-tui-prompt-history

feature/m8-tui-actor-thought-block-rendering

bugfix/m6-build-hierarchy-child-ids

feature/resource-inheritance-wiring

test/e2e-wf09-session

test/e2e-wf06-doc-generation

test/e2e-wf08-cloud-infra

test/e2e-wf02-test-generation

test/e2e-wf13-custom-profile

test/e2e-wf11-graph-actor

test/e2e-wf01-hello-world

test/int-wf17-explicit-container

test/int-wf12-hierarchical

test/int-wf15-disaster-recovery

test/int-wf13-custom-profile

test/int-wf03-refactoring

test/int-wf11-graph-actor

test/int-wf10-batch

test/int-wf09-session

feature/m3-tdd-issue-consistency-gate

feature/m3-invariant-enforcement-strategize

test/int-wf18-container-clone

test/int-wf01-hello-world

feature/m6-diagnostic-dashboard-health-categories

feature/m6-cli-polish

fix/e2e-db-isolation

feature/m7-post-tui

feature/m9-asgi-endpoint

feature/m7-post-server

tdd/m7-audit-session-race

tdd/m3-skill-add-regression

feature/m9-remote-repos

feature/fs-mount-file-types

tdd/container-resolve-crash

test/e2e-m1-acceptance

test/e2e-m2-acceptance

eugen.thaci-patch-3

eugen.thaci-patch-2

eugen.thaci-patch-1

aditya-fix-latest

feature/m4-secret-masking-llm-context

aditya-fix

refactor/m3-replace-mktemp

refactor/m3-remove-unittest-mock-integration

refactor/m3-remove-robot-mock-imports

refactor/m3-remove-mock-llm-integration

docs/improved-menu-adr

feature/m7-post-auth

feature/m3-fix-resource-bootstrap

feature/post-safety-profile-tests

integration/batch-2026-03-02

feat/slipcover

docs/safety-profile-spec-composition

integrate/freemo-batch-1

feature/m4-error-recovery

feature/m4-security-template

feature/m3-validation-pipeline

develop-aditya-2

feature/m3-diff-review

feature/m3-validation-apply

feature/m6-acp-stubs

feature/m4-correction-flows

feature/m1-plan-execute-runtime

feature/m4-security-exceptions

feature/m4-definition-of-done

feature/m4-correction-model

feature/m1-apply-pipeline

feature/m5-automation-profiles

feature/m2-lsp-stubs

feature/m3-invariants

feature/m1-actor-runtime

feature/docs-v2-restore

feature/m6-perf-scale

feature/m6-validation-edge

feature/m3-session-cli

feature/m1-persistence-tests-robot

feature/m3-config-cli

feature/m1-cli-tests-robot

feature/m5-subplan-tests

feature/m6-review-playbook

feature/aditya-m3-actor-loader

feature/m3-skill-protocol

feature/m4-automation-legacy-cleanup

feature/m3-change-model

feature/m3-skill-git

feature/m3-skill-registry

feature/m4-security-eval

fix/robot-tests

feature/m3-actor-registry

feature/m3-tool-cli

feature/m4-automation-profiles-cli

feature/m2-resource-cli-extensions

feature/m3-actor-loader

feature/m3-tool-domain-robot

feature/m3-skill-domain-robot

feature/m3-skill-cli

feature/m1-resource-db-robot-tests

feature/m3-session-domain-robot

feature/m1-persistence-tests

feature/m1-cli-tests

ten-branches-backup

feature/m3-skill-schema

feature/m3-session-persistence

feature/automation-profiles-and-resource-dag

feature/m1-plan-repo

feature/m1-db-plan-phase-rebaseline

feat/B4-sandbox

feat/B2-cli-wiring

feat/B5-project-persistence

feat/B1-project-data-models

feat/b1-data-models

feat-repo-manager-and-sourcegraph-support

feat/actor-schema

fix/component-isolation-security-fix

feat/ontology-agent

fix/error-handling-security-fix

fix/concurrency-security-fix

fix/serialization-security-fix

fix/server-side-request-forgery-security-fix

fix/file-system-security

fix/template-injection-fix

fix/data-injection-fix

tests/unit-tests

latest/poetry-generator

poetry-generator

config/contract-metadata-extractor

docs/readme-yaml-syntax

config/memory-yaml

fix/double-response

brent-additions

intel_2_demo

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: cleveragents/cleveragents-core#9783

[AUTO-INF-1] Reduce CI execution time for cleveragents-core #9783

Summary

Current CI Behavior

Pipeline Overview

Job/Step Duration Data

Run-Level Statistics (Most Recent 20 Runs)

Key Bottlenecks

1. 🔴 Secret-Dependent Jobs Fail Outright on Every Fork/External PR (~40–50% of failures)

2. 🔴 Coverage Job Re-Runs Entire Behave Suite Sequentially (~12 min duplicate work)

3. 🔴 Per-Job Bootstrap: apt-get + pip install on Every Job (~9–18 min total overhead)

4. 🟡 Helm/Kubeconform/Docker Tooling Downloaded on Every Run (No Path-Based Gating)

5. 🟡 E2E Tests Have 45-Minute Timeout But Run on Every PR

6. 🟡 No Job-Level Timeouts on Most Jobs (Runaway Runs Up to 29+ Hours)

7. 🟡 Unlayered Test Execution: Full Suite on Every PR

Optimization Proposals

P0 — Immediate Reliability Fixes (Target: +20–30 pp reliability gain)

P0.1: Make Secret-Dependent Jobs Dual-Path (Not Hard-Fail)

P0.2: Add timeout-minutes to All Jobs

P0.3: Add Concurrency Cancellation

P1 — Bootstrap Elimination (Target: -9 to -18 min per run, +5–10 pp reliability)

P1.1: Publish and Use a Pre-Baked CI Runner Image

P1.2: Cache .nox Virtualenvs Across Jobs

P2 — Test Suite Restructuring (Target: -8 to -15 min per run, +5–10 pp reliability)

P2.1: Parallelize Slipcover Coverage (Eliminate 12-Minute Duplicate Pass)

P2.2: Gate Docker/Helm Jobs by Path Changes

P2.3: Introduce Smoke Tag for Unit/Integration Tests

P3 — Tier the Job Graph (Target: +3–5 pp reliability)

P3.1: Re-Tier the Workflow Graph

Combined Expected Impact

Duplicate Check

References

P0.2: Add `timeout-minutes` to All Jobs

P1.2: Cache `.nox` Virtualenvs Across Jobs