test(e2e): workflow example 16 — devcontainer-driven development (supervised profile) #818

Merged
hurui200320 merged 1 commit from test/e2e-wf16-devcontainer into master 2026-03-30 11:13:30 +00:00
Owner

Summary

This PR delivers WF16 E2E coverage for devcontainer-driven development (supervised profile). The test exercises the full devcontainer-specific plan lifecycle: auto-detection during resource registration, lazy container build during execution, tool invocation routing to the container workspace, and apply writing changes back to the host filesystem via bind mount.

Closes #762

ISSUES CLOSED: #762

Approach

  • Test structure: Single Robot Framework test case in robot/e2e/wf16_devcontainer.robot following established E2E patterns (WF05, M1, M6).
  • Devcontainer-specific ACs: Four acceptance criteria (AC-3 through AC-6) are checked via pattern-matching on CLI output. Missing indicators are collected and reported as explicit test failures.
  • Expected failure handling: The test is tagged tdd_expected_fail tdd_issue tdd_issue_1208 because devcontainer features are not yet fully wired (#1208 tracks the integration wiring work). The tdd_expected_fail_listener inverts failures in CI until all AC indicators are present — this is the project's standard mechanism per CONTRIBUTING.md.
  • Diagnostic teardown: WF16 Test Teardown keyword captures plan status JSON on failure (mirroring the WF05 pattern).
  • Stderr guards: All Traceback/INTERNAL assertions check both stdout and stderr.
  • Dynamic actor selection: Test selects OpenAI first (to reduce Anthropic credit-quota flakiness), matching WF05/M6 pattern.
  • Parallel safety: UUID-suffixed names prevent UNIQUE constraint collisions.

Key files

  • robot/e2e/wf16_devcontainer.robot — WF16 E2E test
  • robot/resource_dag.robot — Minor fix: explicit shared_session.close() calls with finally block

Scope note on robot/resource_dag.robot

A prior robot/resource_dag.robot adjustment (session factory pattern, timeout additions) exists on this branch. Attempting to split/revert it caused integration regression (Robot.Resource Dag → Cycle Detection Rejects A To B To A failing with ResourceTypeNotFoundError), so it is retained to preserve green quality gates. This change is out-of-scope for WF16 and a follow-up issue should be created after merge to properly attribute this work.

Changes from review feedback

  1. Added --yes flag to plan apply — All 24 spec examples use --yes; without it CLI may prompt for confirmation in CI.
  2. Added action name output verificationOutput Should Contain ${r_action} ${ACTION_NAME} after action creation, matching m1_acceptance.robot pattern.
  3. Added reusable and read_only fields to action YAML — Consistency with m1_acceptance.robot and m6_acceptance.robot.
  4. Updated TDD tags from tdd_bug/tdd_bug_762 to tdd_issue/tdd_issue_1208 — Aligned with master's tag rename (commit 1878998b) which renamed all tdd_bug/tdd_bug_N tags to tdd_issue/tdd_issue_N. The tdd_issue_<N> tag now correctly references #1208 (the devcontainer wiring ticket) rather than #762 (this test-writing ticket), since the test failure is caused by missing devcontainer integration, not by any defect in the test itself.

Quality gates (latest run — rebased onto abf7b47d)

  • nox -e lint
  • nox -e typecheck (0 errors)
  • nox -e unit_tests (498 features, 12822 scenarios, 0 failed)
  • nox -e integration_tests (1825 tests, 1825 passed, 0 failed)
  • nox -e e2e_tests (63 tests, 62 passed, 0 failed, 1 skipped — WF16 inverted via tdd_expected_fail listener)
  • nox -e coverage_report (97% coverage, threshold 97%)

Known limitations

  • Devcontainer features are not yet fully wired (#1208). ACs 3–6 currently fail and are handled by the tdd_expected_fail tag system. When #1208 is completed, the test will pass naturally and the tdd_expected_fail tag should be removed (leaving tdd_issue and tdd_issue_1208 as permanent regression markers).
  • AC-3 evidence remains runtime-output dependent and may warn when environment output omits the expected explicit detected (not built) signal.
## Summary This PR delivers WF16 E2E coverage for devcontainer-driven development (supervised profile). The test exercises the full devcontainer-specific plan lifecycle: auto-detection during resource registration, lazy container build during execution, tool invocation routing to the container workspace, and apply writing changes back to the host filesystem via bind mount. Closes #762 ISSUES CLOSED: #762 ## Approach - **Test structure:** Single Robot Framework test case in `robot/e2e/wf16_devcontainer.robot` following established E2E patterns (WF05, M1, M6). - **Devcontainer-specific ACs:** Four acceptance criteria (AC-3 through AC-6) are checked via pattern-matching on CLI output. Missing indicators are collected and reported as explicit test failures. - **Expected failure handling:** The test is tagged `tdd_expected_fail tdd_issue tdd_issue_1208` because devcontainer features are not yet fully wired (#1208 tracks the integration wiring work). The `tdd_expected_fail_listener` inverts failures in CI until all AC indicators are present — this is the project's standard mechanism per CONTRIBUTING.md. - **Diagnostic teardown:** `WF16 Test Teardown` keyword captures plan status JSON on failure (mirroring the WF05 pattern). - **Stderr guards:** All Traceback/INTERNAL assertions check both stdout and stderr. - **Dynamic actor selection:** Test selects OpenAI first (to reduce Anthropic credit-quota flakiness), matching WF05/M6 pattern. - **Parallel safety:** UUID-suffixed names prevent UNIQUE constraint collisions. ## Key files - `robot/e2e/wf16_devcontainer.robot` — WF16 E2E test - `robot/resource_dag.robot` — Minor fix: explicit `shared_session.close()` calls with `finally` block ## Scope note on `robot/resource_dag.robot` A prior `robot/resource_dag.robot` adjustment (session factory pattern, timeout additions) exists on this branch. Attempting to split/revert it caused integration regression (`Robot.Resource Dag → Cycle Detection Rejects A To B To A` failing with `ResourceTypeNotFoundError`), so it is retained to preserve green quality gates. This change is out-of-scope for WF16 and **a follow-up issue should be created after merge** to properly attribute this work. ## Changes from review feedback 1. **Added `--yes` flag to `plan apply`** — All 24 spec examples use `--yes`; without it CLI may prompt for confirmation in CI. 2. **Added action name output verification** — `Output Should Contain ${r_action} ${ACTION_NAME}` after action creation, matching `m1_acceptance.robot` pattern. 3. **Added `reusable` and `read_only` fields to action YAML** — Consistency with `m1_acceptance.robot` and `m6_acceptance.robot`. 4. **Updated TDD tags from `tdd_bug`/`tdd_bug_762` to `tdd_issue`/`tdd_issue_1208`** — Aligned with master's tag rename (commit `1878998b`) which renamed all `tdd_bug`/`tdd_bug_N` tags to `tdd_issue`/`tdd_issue_N`. The `tdd_issue_<N>` tag now correctly references #1208 (the devcontainer wiring ticket) rather than #762 (this test-writing ticket), since the test failure is caused by missing devcontainer integration, not by any defect in the test itself. ## Quality gates (latest run — rebased onto `abf7b47d`) - `nox -e lint` ✅ - `nox -e typecheck` ✅ (0 errors) - `nox -e unit_tests` ✅ (498 features, 12822 scenarios, 0 failed) - `nox -e integration_tests` ✅ (1825 tests, 1825 passed, 0 failed) - `nox -e e2e_tests` ✅ (63 tests, 62 passed, 0 failed, 1 skipped — WF16 inverted via tdd_expected_fail listener) - `nox -e coverage_report` ✅ (97% coverage, threshold 97%) ## Known limitations - Devcontainer features are not yet fully wired (#1208). ACs 3–6 currently fail and are handled by the `tdd_expected_fail` tag system. When #1208 is completed, the test will pass naturally and the `tdd_expected_fail` tag should be removed (leaving `tdd_issue` and `tdd_issue_1208` as permanent regression markers). - AC-3 evidence remains runtime-output dependent and may warn when environment output omits the expected explicit `detected (not built)` signal.
freemo added this to the v3.7.0 milestone 2026-03-13 17:13:04 +00:00
freemo force-pushed test/e2e-wf16-devcontainer from 1022517065
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 13s
CI / build (pull_request) Successful in 15s
CI / quality (pull_request) Successful in 17s
CI / e2e_tests (pull_request) Failing after 28s
CI / typecheck (pull_request) Successful in 36s
CI / security (pull_request) Successful in 36s
CI / unit_tests (pull_request) Successful in 2m12s
CI / integration_tests (pull_request) Successful in 2m39s
CI / docker (pull_request) Successful in 35s
CI / coverage (pull_request) Successful in 4m38s
CI / benchmark-regression (pull_request) Has been cancelled
to de87a5c972
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 15s
CI / quality (pull_request) Successful in 19s
CI / build (pull_request) Successful in 15s
CI / security (pull_request) Successful in 31s
CI / e2e_tests (pull_request) Failing after 33s
CI / typecheck (pull_request) Successful in 51s
CI / unit_tests (pull_request) Successful in 2m54s
CI / docker (pull_request) Successful in 47s
CI / integration_tests (pull_request) Successful in 4m7s
CI / coverage (pull_request) Successful in 4m35s
CI / benchmark-regression (pull_request) Has been cancelled
2026-03-13 17:28:49 +00:00
Compare
freemo force-pushed test/e2e-wf16-devcontainer from de87a5c972
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 15s
CI / quality (pull_request) Successful in 19s
CI / build (pull_request) Successful in 15s
CI / security (pull_request) Successful in 31s
CI / e2e_tests (pull_request) Failing after 33s
CI / typecheck (pull_request) Successful in 51s
CI / unit_tests (pull_request) Successful in 2m54s
CI / docker (pull_request) Successful in 47s
CI / integration_tests (pull_request) Successful in 4m7s
CI / coverage (pull_request) Successful in 4m35s
CI / benchmark-regression (pull_request) Has been cancelled
to a8dd7c551b
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 20s
CI / quality (pull_request) Successful in 18s
CI / build (pull_request) Successful in 16s
CI / typecheck (pull_request) Successful in 32s
CI / security (pull_request) Successful in 33s
CI / e2e_tests (pull_request) Failing after 32s
CI / unit_tests (pull_request) Successful in 2m48s
CI / integration_tests (pull_request) Successful in 2m52s
CI / docker (pull_request) Successful in 49s
CI / coverage (pull_request) Successful in 5m12s
CI / benchmark-regression (pull_request) Failing after 18m28s
2026-03-13 17:46:58 +00:00
Compare
freemo force-pushed test/e2e-wf16-devcontainer from a8dd7c551b
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 20s
CI / quality (pull_request) Successful in 18s
CI / build (pull_request) Successful in 16s
CI / typecheck (pull_request) Successful in 32s
CI / security (pull_request) Successful in 33s
CI / e2e_tests (pull_request) Failing after 32s
CI / unit_tests (pull_request) Successful in 2m48s
CI / integration_tests (pull_request) Successful in 2m52s
CI / docker (pull_request) Successful in 49s
CI / coverage (pull_request) Successful in 5m12s
CI / benchmark-regression (pull_request) Failing after 18m28s
to 340ac3d53f
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 18s
CI / quality (pull_request) Successful in 15s
CI / build (pull_request) Successful in 16s
CI / typecheck (pull_request) Successful in 31s
CI / e2e_tests (pull_request) Failing after 26s
CI / benchmark-regression (pull_request) Has started running
CI / security (pull_request) Successful in 1m7s
CI / unit_tests (pull_request) Successful in 3m4s
CI / docker (pull_request) Successful in 9s
CI / integration_tests (pull_request) Successful in 3m50s
CI / coverage (pull_request) Successful in 5m4s
2026-03-13 18:13:12 +00:00
Compare
freemo force-pushed test/e2e-wf16-devcontainer from 340ac3d53f
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 18s
CI / quality (pull_request) Successful in 15s
CI / build (pull_request) Successful in 16s
CI / typecheck (pull_request) Successful in 31s
CI / e2e_tests (pull_request) Failing after 26s
CI / benchmark-regression (pull_request) Has started running
CI / security (pull_request) Successful in 1m7s
CI / unit_tests (pull_request) Successful in 3m4s
CI / docker (pull_request) Successful in 9s
CI / integration_tests (pull_request) Successful in 3m50s
CI / coverage (pull_request) Successful in 5m4s
to ad63e7d69f
Some checks failed
CI / lint (pull_request) Successful in 14s
CI / benchmark-publish (pull_request) Has been skipped
CI / quality (pull_request) Successful in 16s
CI / build (pull_request) Successful in 13s
CI / typecheck (pull_request) Successful in 32s
CI / security (pull_request) Successful in 36s
CI / e2e_tests (pull_request) Successful in 42s
CI / unit_tests (pull_request) Successful in 2m26s
CI / integration_tests (pull_request) Successful in 2m39s
CI / docker (pull_request) Successful in 14s
CI / coverage (pull_request) Successful in 5m9s
CI / benchmark-regression (pull_request) Failing after 20m57s
2026-03-13 18:25:24 +00:00
Compare
freemo force-pushed test/e2e-wf16-devcontainer from ad63e7d69f
Some checks failed
CI / lint (pull_request) Successful in 14s
CI / benchmark-publish (pull_request) Has been skipped
CI / quality (pull_request) Successful in 16s
CI / build (pull_request) Successful in 13s
CI / typecheck (pull_request) Successful in 32s
CI / security (pull_request) Successful in 36s
CI / e2e_tests (pull_request) Successful in 42s
CI / unit_tests (pull_request) Successful in 2m26s
CI / integration_tests (pull_request) Successful in 2m39s
CI / docker (pull_request) Successful in 14s
CI / coverage (pull_request) Successful in 5m9s
CI / benchmark-regression (pull_request) Failing after 20m57s
to 17e8fd2a47
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 15s
CI / build (pull_request) Successful in 17s
CI / quality (pull_request) Successful in 17s
CI / e2e_tests (pull_request) Failing after 24s
CI / security (pull_request) Successful in 32s
CI / typecheck (pull_request) Successful in 34s
CI / unit_tests (pull_request) Successful in 2m15s
CI / integration_tests (pull_request) Successful in 2m37s
CI / docker (pull_request) Successful in 36s
CI / coverage (pull_request) Successful in 5m2s
CI / benchmark-regression (pull_request) Successful in 35m7s
2026-03-13 20:14:05 +00:00
Compare
freemo force-pushed test/e2e-wf16-devcontainer from 17e8fd2a47
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 15s
CI / build (pull_request) Successful in 17s
CI / quality (pull_request) Successful in 17s
CI / e2e_tests (pull_request) Failing after 24s
CI / security (pull_request) Successful in 32s
CI / typecheck (pull_request) Successful in 34s
CI / unit_tests (pull_request) Successful in 2m15s
CI / integration_tests (pull_request) Successful in 2m37s
CI / docker (pull_request) Successful in 36s
CI / coverage (pull_request) Successful in 5m2s
CI / benchmark-regression (pull_request) Successful in 35m7s
to 1c7609bff5
All checks were successful
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 13s
CI / build (pull_request) Successful in 16s
CI / quality (pull_request) Successful in 17s
CI / security (pull_request) Successful in 30s
CI / typecheck (pull_request) Successful in 32s
CI / e2e_tests (pull_request) Successful in 41s
CI / unit_tests (pull_request) Successful in 2m6s
CI / docker (pull_request) Successful in 40s
CI / integration_tests (pull_request) Successful in 4m32s
CI / coverage (pull_request) Successful in 5m0s
CI / benchmark-regression (pull_request) Successful in 35m25s
2026-03-13 21:02:55 +00:00
Compare
freemo force-pushed test/e2e-wf16-devcontainer from 1c7609bff5
All checks were successful
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 13s
CI / build (pull_request) Successful in 16s
CI / quality (pull_request) Successful in 17s
CI / security (pull_request) Successful in 30s
CI / typecheck (pull_request) Successful in 32s
CI / e2e_tests (pull_request) Successful in 41s
CI / unit_tests (pull_request) Successful in 2m6s
CI / docker (pull_request) Successful in 40s
CI / integration_tests (pull_request) Successful in 4m32s
CI / coverage (pull_request) Successful in 5m0s
CI / benchmark-regression (pull_request) Successful in 35m25s
to 72cccd0358
Some checks failed
CI / lint (pull_request) Successful in 16s
CI / benchmark-publish (pull_request) Has been skipped
CI / quality (pull_request) Successful in 14s
CI / security (pull_request) Successful in 29s
CI / build (pull_request) Successful in 23s
CI / typecheck (pull_request) Successful in 42s
CI / e2e_tests (pull_request) Failing after 34s
CI / unit_tests (pull_request) Successful in 3m29s
CI / integration_tests (pull_request) Successful in 4m1s
CI / docker (pull_request) Successful in 8s
CI / coverage (pull_request) Successful in 5m16s
CI / benchmark-regression (pull_request) Failing after 35m57s
2026-03-13 23:19:51 +00:00
Compare
Author
Owner

PM Review — Day 34

Status: Mergeable, 0 reviews, M8 (v3.7.0)
Author: @freemo

E2E test for WF16 (devcontainer-driven development, supervised profile). M8 — low urgency.

Action Items

Who Action Deadline
Peer review deferred to M8 sprint
## PM Review — Day 34 **Status**: Mergeable, 0 reviews, M8 (v3.7.0) **Author**: @freemo E2E test for WF16 (devcontainer-driven development, supervised profile). M8 — low urgency. ### Action Items | Who | Action | Deadline | |-----|--------|----------| | Peer review deferred to M8 sprint | | |
Author
Owner

PM Status — Day 36 (2026-03-16)

Day 34 review assignment deadline check. This PR has 0 reviewer activity after 2 days.

Priority note: M3 PRs take precedence. Reviewers should complete M3 reviews first, then address M4+ PRs in milestone order.

Assigned reviewer: Please acknowledge and provide an ETA for your review, or flag if reassignment is needed.

## PM Status — Day 36 (2026-03-16) Day 34 review assignment deadline check. This PR has 0 reviewer activity after 2 days. **Priority note**: M3 PRs take precedence. Reviewers should complete M3 reviews first, then address M4+ PRs in milestone order. **Assigned reviewer**: Please acknowledge and provide an ETA for your review, or flag if reassignment is needed.
Author
Owner

@hurui200320 I am going to have you take over this PR, it is mostly completed but is waiting on #628 and #966 One is yours and one is Brent's. Please be sure to get this PR and the two blocking PRs I listed in asap, thanks.

@hurui200320 I am going to have you take over this PR, it is mostly completed but is waiting on https://git.cleverthis.com/cleveragents/cleveragents-core/issues/628 and https://git.cleverthis.com/cleveragents/cleveragents-core/issues/966 One is yours and one is Brent's. Please be sure to get this PR and the two blocking PRs I listed in asap, thanks.
Author
Owner

PM Status — Day 37

Reviewers assigned. This PR needs at least 2 approving reviews per CONTRIBUTING.md before merge.

Author: Please ensure this PR is rebased on latest master and all quality gates pass before requesting merge.


PM status — Day 37

## PM Status — Day 37 Reviewers assigned. This PR needs at least 2 approving reviews per `CONTRIBUTING.md` before merge. **Author**: Please ensure this PR is rebased on latest `master` and all quality gates pass before requesting merge. --- *PM status — Day 37*
hurui200320 force-pushed test/e2e-wf16-devcontainer from 72cccd0358
Some checks failed
CI / lint (pull_request) Successful in 16s
CI / benchmark-publish (pull_request) Has been skipped
CI / quality (pull_request) Successful in 14s
CI / security (pull_request) Successful in 29s
CI / build (pull_request) Successful in 23s
CI / typecheck (pull_request) Successful in 42s
CI / e2e_tests (pull_request) Failing after 34s
CI / unit_tests (pull_request) Successful in 3m29s
CI / integration_tests (pull_request) Successful in 4m1s
CI / docker (pull_request) Successful in 8s
CI / coverage (pull_request) Successful in 5m16s
CI / benchmark-regression (pull_request) Failing after 35m57s
to dcfa7ec8b8
All checks were successful
CI / lint (pull_request) Successful in 15s
CI / typecheck (pull_request) Successful in 46s
CI / security (pull_request) Successful in 41s
CI / quality (pull_request) Successful in 27s
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 15s
CI / unit_tests (pull_request) Successful in 5m4s
CI / e2e_tests (pull_request) Successful in 5m25s
CI / integration_tests (pull_request) Successful in 5m47s
CI / docker (pull_request) Successful in 56s
CI / coverage (pull_request) Successful in 6m49s
CI / benchmark-regression (pull_request) Successful in 41m30s
2026-03-18 08:35:56 +00:00
Compare
Author
Owner

Code Review — PR #818

(Cannot submit formal approval — self-authored PR.)

E2E test for WF16. Well-structured with proper labels, milestone, and issue linkage. No issues found.

## Code Review — PR #818 *(Cannot submit formal approval — self-authored PR.)* E2E test for WF16. Well-structured with proper labels, milestone, and issue linkage. No issues found.
hurui200320 force-pushed test/e2e-wf16-devcontainer from dcfa7ec8b8
All checks were successful
CI / lint (pull_request) Successful in 15s
CI / typecheck (pull_request) Successful in 46s
CI / security (pull_request) Successful in 41s
CI / quality (pull_request) Successful in 27s
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 15s
CI / unit_tests (pull_request) Successful in 5m4s
CI / e2e_tests (pull_request) Successful in 5m25s
CI / integration_tests (pull_request) Successful in 5m47s
CI / docker (pull_request) Successful in 56s
CI / coverage (pull_request) Successful in 6m49s
CI / benchmark-regression (pull_request) Successful in 41m30s
to aab047eb7e
All checks were successful
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 18s
CI / build (pull_request) Successful in 18s
CI / quality (pull_request) Successful in 29s
CI / typecheck (pull_request) Successful in 46s
CI / security (pull_request) Successful in 53s
CI / unit_tests (pull_request) Successful in 3m12s
CI / integration_tests (pull_request) Successful in 3m36s
CI / docker (pull_request) Successful in 54s
CI / e2e_tests (pull_request) Successful in 5m53s
CI / coverage (pull_request) Successful in 7m1s
CI / benchmark-regression (pull_request) Successful in 38m44s
2026-03-19 08:23:02 +00:00
Compare
hurui200320 force-pushed test/e2e-wf16-devcontainer from aab047eb7e
All checks were successful
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 18s
CI / build (pull_request) Successful in 18s
CI / quality (pull_request) Successful in 29s
CI / typecheck (pull_request) Successful in 46s
CI / security (pull_request) Successful in 53s
CI / unit_tests (pull_request) Successful in 3m12s
CI / integration_tests (pull_request) Successful in 3m36s
CI / docker (pull_request) Successful in 54s
CI / e2e_tests (pull_request) Successful in 5m53s
CI / coverage (pull_request) Successful in 7m1s
CI / benchmark-regression (pull_request) Successful in 38m44s
to ffe90c6b90
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 18s
CI / build (pull_request) Successful in 24s
CI / quality (pull_request) Successful in 30s
CI / typecheck (pull_request) Successful in 42s
CI / security (pull_request) Successful in 53s
CI / unit_tests (pull_request) Successful in 3m16s
CI / integration_tests (pull_request) Successful in 3m36s
CI / docker (pull_request) Successful in 55s
CI / e2e_tests (pull_request) Successful in 5m58s
CI / coverage (pull_request) Successful in 7m5s
CI / benchmark-regression (pull_request) Has been cancelled
2026-03-19 09:36:11 +00:00
Compare
hurui200320 force-pushed test/e2e-wf16-devcontainer from ffe90c6b90
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 18s
CI / build (pull_request) Successful in 24s
CI / quality (pull_request) Successful in 30s
CI / typecheck (pull_request) Successful in 42s
CI / security (pull_request) Successful in 53s
CI / unit_tests (pull_request) Successful in 3m16s
CI / integration_tests (pull_request) Successful in 3m36s
CI / docker (pull_request) Successful in 55s
CI / e2e_tests (pull_request) Successful in 5m58s
CI / coverage (pull_request) Successful in 7m5s
CI / benchmark-regression (pull_request) Has been cancelled
to 9bb530a1c4
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 16s
CI / build (pull_request) Successful in 16s
CI / quality (pull_request) Successful in 29s
CI / security (pull_request) Successful in 42s
CI / typecheck (pull_request) Successful in 52s
CI / unit_tests (pull_request) Successful in 3m27s
CI / integration_tests (pull_request) Successful in 3m48s
CI / docker (pull_request) Successful in 55s
CI / e2e_tests (pull_request) Failing after 7m1s
CI / coverage (pull_request) Successful in 7m15s
CI / benchmark-regression (pull_request) Successful in 38m18s
2026-03-19 10:02:51 +00:00
Compare
hurui200320 force-pushed test/e2e-wf16-devcontainer from 9bb530a1c4
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 16s
CI / build (pull_request) Successful in 16s
CI / quality (pull_request) Successful in 29s
CI / security (pull_request) Successful in 42s
CI / typecheck (pull_request) Successful in 52s
CI / unit_tests (pull_request) Successful in 3m27s
CI / integration_tests (pull_request) Successful in 3m48s
CI / docker (pull_request) Successful in 55s
CI / e2e_tests (pull_request) Failing after 7m1s
CI / coverage (pull_request) Successful in 7m15s
CI / benchmark-regression (pull_request) Successful in 38m18s
to 34685e83f8
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 16s
CI / build (pull_request) Successful in 18s
CI / quality (pull_request) Successful in 30s
CI / security (pull_request) Successful in 50s
CI / typecheck (pull_request) Successful in 54s
CI / integration_tests (pull_request) Successful in 3m35s
CI / unit_tests (pull_request) Successful in 3m43s
CI / docker (pull_request) Successful in 9s
CI / e2e_tests (pull_request) Failing after 6m45s
CI / coverage (pull_request) Successful in 7m9s
CI / benchmark-regression (pull_request) Successful in 40m27s
2026-03-19 10:42:15 +00:00
Compare
hurui200320 force-pushed test/e2e-wf16-devcontainer from 34685e83f8
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 16s
CI / build (pull_request) Successful in 18s
CI / quality (pull_request) Successful in 30s
CI / security (pull_request) Successful in 50s
CI / typecheck (pull_request) Successful in 54s
CI / integration_tests (pull_request) Successful in 3m35s
CI / unit_tests (pull_request) Successful in 3m43s
CI / docker (pull_request) Successful in 9s
CI / e2e_tests (pull_request) Failing after 6m45s
CI / coverage (pull_request) Successful in 7m9s
CI / benchmark-regression (pull_request) Successful in 40m27s
to 395d8bbd97
All checks were successful
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 16s
CI / build (pull_request) Successful in 17s
CI / quality (pull_request) Successful in 42s
CI / security (pull_request) Successful in 47s
CI / typecheck (pull_request) Successful in 1m2s
CI / unit_tests (pull_request) Successful in 3m27s
CI / integration_tests (pull_request) Successful in 3m41s
CI / docker (pull_request) Successful in 55s
CI / e2e_tests (pull_request) Successful in 6m0s
CI / coverage (pull_request) Successful in 7m16s
CI / benchmark-regression (pull_request) Successful in 37m39s
2026-03-19 11:35:45 +00:00
Compare
hurui200320 force-pushed test/e2e-wf16-devcontainer from 395d8bbd97
All checks were successful
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 16s
CI / build (pull_request) Successful in 17s
CI / quality (pull_request) Successful in 42s
CI / security (pull_request) Successful in 47s
CI / typecheck (pull_request) Successful in 1m2s
CI / unit_tests (pull_request) Successful in 3m27s
CI / integration_tests (pull_request) Successful in 3m41s
CI / docker (pull_request) Successful in 55s
CI / e2e_tests (pull_request) Successful in 6m0s
CI / coverage (pull_request) Successful in 7m16s
CI / benchmark-regression (pull_request) Successful in 37m39s
to 97cabf504e
All checks were successful
CI / lint (pull_request) Successful in 16s
CI / typecheck (pull_request) Successful in 38s
CI / security (pull_request) Successful in 50s
CI / quality (pull_request) Successful in 27s
CI / unit_tests (pull_request) Successful in 3m31s
CI / integration_tests (pull_request) Successful in 3m41s
CI / e2e_tests (pull_request) Successful in 6m1s
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 15s
CI / coverage (pull_request) Successful in 7m24s
CI / docker (pull_request) Successful in 1m7s
CI / benchmark-regression (pull_request) Successful in 38m32s
2026-03-20 05:37:56 +00:00
Compare
hurui200320 force-pushed test/e2e-wf16-devcontainer from 97cabf504e
All checks were successful
CI / lint (pull_request) Successful in 16s
CI / typecheck (pull_request) Successful in 38s
CI / security (pull_request) Successful in 50s
CI / quality (pull_request) Successful in 27s
CI / unit_tests (pull_request) Successful in 3m31s
CI / integration_tests (pull_request) Successful in 3m41s
CI / e2e_tests (pull_request) Successful in 6m1s
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 15s
CI / coverage (pull_request) Successful in 7m24s
CI / docker (pull_request) Successful in 1m7s
CI / benchmark-regression (pull_request) Successful in 38m32s
to 434790c5c2
All checks were successful
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 14s
CI / lint (pull_request) Successful in 4m2s
CI / typecheck (pull_request) Successful in 4m34s
CI / integration_tests (pull_request) Successful in 3m45s
CI / quality (pull_request) Successful in 4m45s
CI / security (pull_request) Successful in 5m15s
CI / e2e_tests (pull_request) Successful in 9m52s
CI / unit_tests (pull_request) Successful in 11m17s
CI / docker (pull_request) Successful in 1m9s
CI / coverage (pull_request) Successful in 12m14s
CI / status-check (pull_request) Successful in 1s
CI / benchmark-regression (pull_request) Successful in 1h17m44s
2026-03-23 04:11:16 +00:00
Compare
hurui200320 force-pushed test/e2e-wf16-devcontainer from 434790c5c2
All checks were successful
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 14s
CI / lint (pull_request) Successful in 4m2s
CI / typecheck (pull_request) Successful in 4m34s
CI / integration_tests (pull_request) Successful in 3m45s
CI / quality (pull_request) Successful in 4m45s
CI / security (pull_request) Successful in 5m15s
CI / e2e_tests (pull_request) Successful in 9m52s
CI / unit_tests (pull_request) Successful in 11m17s
CI / docker (pull_request) Successful in 1m9s
CI / coverage (pull_request) Successful in 12m14s
CI / status-check (pull_request) Successful in 1s
CI / benchmark-regression (pull_request) Successful in 1h17m44s
to 9b9a414090
All checks were successful
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 36s
CI / lint (pull_request) Successful in 5m14s
CI / quality (pull_request) Successful in 5m54s
CI / typecheck (pull_request) Successful in 5m59s
CI / security (pull_request) Successful in 6m3s
CI / unit_tests (pull_request) Successful in 8m53s
CI / integration_tests (pull_request) Successful in 9m1s
CI / docker (pull_request) Successful in 1m11s
CI / e2e_tests (pull_request) Successful in 11m55s
CI / coverage (pull_request) Successful in 11m8s
CI / status-check (pull_request) Successful in 1s
CI / benchmark-regression (pull_request) Successful in 55m51s
2026-03-24 05:40:47 +00:00
Compare
CoreRasurae left a comment

Code Review Report — PR #818 (test/e2e-wf16-devcontainer)

Reviewer: Automated deep review (test coverage, test flaws, performance, bug detection, security, spec compliance)
Scope: All changes in branch test/e2e-wf16-devcontainer vs master (commit 9b9a4140 by Rui Hu), cross-referenced against docs/specification.md Example 16 and issue #762 acceptance criteria.
Method: Multiple global analysis cycles across all categories until convergence (no new findings).


Summary

  • 10 findings across 5 categories
  • 1 Critical, 3 High, 3 Medium, 3 Low
  • The WF16 Robot test itself is well-structured and follows established E2E patterns, but a spec-compliance bug in plan apply and several out-of-scope deletions require attention before merge.

1. Specification Compliance / Bug

1.1 [CRITICAL] Missing --yes flag on plan apply

File: robot/e2e/wf16_devcontainer.robot:282

The test invokes:

plan    apply    ${plan_id}    --format    json

But the specification Example 16 Step 5 (spec line 42536) explicitly uses:

$ agents plan apply --yes 01J0B2C3D4E5F6G7H8J9K0L1M2

All 24 plan apply examples in the specification include --yes. The existing m1_acceptance.robot:98 also uses --yes. Without this flag, the command may prompt for user confirmation in non-interactive mode, causing the test to hang until the 180s timeout and then fail.

Fix: Add --yes before ${plan_id}:

...    plan    apply    --yes    ${plan_id}

2. Scope / Regression Risk

2.1 [HIGH] Out-of-scope deletion of bug #647 regression tests

Files: features/container_resolve_crash.feature, features/steps/container_resolve_crash_steps.py, robot/container_resolve_crash.robot, robot/helper_container_resolve_crash.py (~649 lines deleted)

These regression tests were merged via PR #1053 to guard against re-introducing the Container.resolve() crash in plan tree, plan explain, and plan correct commands. Their deletion is unrelated to WF16 devcontainer E2E testing and is not referenced in the commit message or issue #762 acceptance criteria. Removing regression guards without justification risks re-introducing the original bug.

Recommendation: Revert these deletions or move them to a separate PR with explicit justification.

2.2 [HIGH] Out-of-scope deletion of Settings.reset() and associated tests

Files: src/cleveragents/config/settings.py (13 lines), features/settings_configuration.feature (10 lines), features/steps/settings_steps.py (~45 lines)

The Settings.reset() classmethod (a documented test-only utility) is removed from production code, along with the "Settings reset clears singleton cache" test scenario and 4 step definitions (step_load_singleton_settings_via_get_settings, step_reset_singleton_settings, step_singleton_instances_should_be_different, step_singleton_environment_should_be). The @when decorator on step_set_env_var is also removed. These changes are unrelated to WF16.

Recommendation: Revert these changes or move them to a separate PR.

2.3 [HIGH] CHANGELOG removes unrelated bug #647 entry

File: CHANGELOG.md (around line 202)

The WF16 changelog addition is appropriate, but the removal of the bug #647 changelog entry ("Added TDD regression tests for bug #647...") is unrelated scope creep and inconsistent with preserving project history.

Recommendation: Keep the existing bug #647 CHANGELOG entry.


3. Data Integrity

3.1 [MEDIUM] Duplicate contributor entry

File: CONTRIBUTORS.md:5,8

"Rui Hu rui.hu@cleverthis.com" appears on both line 5 and line 8.

Fix: Remove the duplicate entry on line 8.

3.2 [MEDIUM] Removal of contributor without justification

File: CONTRIBUTORS.md

"Aditya Chhabra aditya.chhabra@cleverthis.com" was removed. Aditya is referenced extensively in docs/timeline.md as a contributor to multiple milestones (actor YAML, MCP adapter, agent skills, ACMS context pipeline, etc.). This removal is unrelated to WF16 and lacks justification.

Recommendation: Restore the entry or explain the removal in a separate PR.


4. Test Quality / Coverage

4.1 [MEDIUM] All devcontainer acceptance criteria are soft assertions

File: robot/e2e/wf16_devcontainer.robot (lines 130-319)

All four devcontainer-specific acceptance criteria use conditional IF/ELSE blocks that only log WARN when expected indicators are absent:

  • AC-3 (auto-detection, line 138): logs warning if devcontainer not in output
  • AC-4 (lazy build, line 236): logs warning if building+devcontainer not in output
  • AC-5 (container routing, line 246-250): logs warning if no routing indicators
  • AC-6 (host write, line 306-314): logs warning if HEAD unchanged and no bind-mount indicators

The test can pass with zero devcontainer-specific behavior verified, making it functionally a generic plan lifecycle test. While the TODO comments (TODO(#762)) explain this is intentional pending feature wiring, the issue's acceptance criteria list these as checkable items. Consider adding a summary log or counter at the end indicating how many AC checks actually passed vs. warned.

4.2 [LOW] No output verification for action creation

File: robot/e2e/wf16_devcontainer.robot:170-174

After action create --config, the test checks for Traceback and INTERNAL absence but does not verify the action name appears in the output. Compare with m1_acceptance.robot:43 which uses Output Should Contain. If action creation behaves unexpectedly (rc=0 but wrong action name), downstream failures would be confusing.

Suggestion: Add Output Should Contain ${r_action} ${ACTION_NAME} after line 173.

4.3 [LOW] Action YAML missing reusable and read_only fields

File: robot/e2e/wf16_devcontainer.robot:162-167

The WF16 action YAML omits reusable and read_only fields that m1_acceptance.robot and m6_acceptance.robot include. While these are likely optional with sensible defaults, adding them would improve consistency with established E2E patterns.


5. No Issues Found

The following categories were analyzed across multiple cycles with no findings:

  • Security: No credential exposure, no shell=True usage, API keys handled via environment variables.
  • Performance: Timeout values (20min test, 180-300s commands) are reasonable for E2E LLM tests. The defensive second execute call (line 219) with 300s timeout is a minor inefficiency but justified by the comment.
  • Robot Framework syntax: All Evaluate expressions, variable references, and keyword calls are syntactically correct.
  • Resource cleanup: SUITE_HOME files (action YAML, temp repos) are cleaned up by E2E Suite Teardown.
  • Parallel safety: UUID-suffixed names prevent UNIQUE constraint collisions in parallel CI.

Verdict

REQUEST_CHANGES — The missing --yes flag (finding 1.1) is a likely test failure in non-interactive CI. The out-of-scope deletions (findings 2.1-2.3, 3.2) should be reverted or split into separate PRs. The duplicate contributor entry (finding 3.1) needs a quick fix.

## Code Review Report — PR #818 (`test/e2e-wf16-devcontainer`) **Reviewer:** Automated deep review (test coverage, test flaws, performance, bug detection, security, spec compliance) **Scope:** All changes in branch `test/e2e-wf16-devcontainer` vs `master` (commit `9b9a4140` by Rui Hu), cross-referenced against `docs/specification.md` Example 16 and issue #762 acceptance criteria. **Method:** Multiple global analysis cycles across all categories until convergence (no new findings). --- ### Summary - **10 findings** across 5 categories - **1 Critical**, **3 High**, **3 Medium**, **3 Low** - The WF16 Robot test itself is well-structured and follows established E2E patterns, but a spec-compliance bug in `plan apply` and several out-of-scope deletions require attention before merge. --- ## 1. Specification Compliance / Bug ### 1.1 [CRITICAL] Missing `--yes` flag on `plan apply` **File:** `robot/e2e/wf16_devcontainer.robot:282` The test invokes: ``` plan apply ${plan_id} --format json ``` But the specification Example 16 Step 5 (spec line 42536) explicitly uses: ``` $ agents plan apply --yes 01J0B2C3D4E5F6G7H8J9K0L1M2 ``` All **24** `plan apply` examples in the specification include `--yes`. The existing `m1_acceptance.robot:98` also uses `--yes`. Without this flag, the command may prompt for user confirmation in non-interactive mode, causing the test to **hang until the 180s timeout** and then fail. **Fix:** Add `--yes` before `${plan_id}`: ``` ... plan apply --yes ${plan_id} ``` --- ## 2. Scope / Regression Risk ### 2.1 [HIGH] Out-of-scope deletion of bug #647 regression tests **Files:** `features/container_resolve_crash.feature`, `features/steps/container_resolve_crash_steps.py`, `robot/container_resolve_crash.robot`, `robot/helper_container_resolve_crash.py` (~649 lines deleted) These regression tests were merged via PR #1053 to guard against re-introducing the `Container.resolve()` crash in `plan tree`, `plan explain`, and `plan correct` commands. Their deletion is **unrelated to WF16 devcontainer E2E testing** and is not referenced in the commit message or issue #762 acceptance criteria. Removing regression guards without justification risks re-introducing the original bug. **Recommendation:** Revert these deletions or move them to a separate PR with explicit justification. ### 2.2 [HIGH] Out-of-scope deletion of `Settings.reset()` and associated tests **Files:** `src/cleveragents/config/settings.py` (13 lines), `features/settings_configuration.feature` (10 lines), `features/steps/settings_steps.py` (~45 lines) The `Settings.reset()` classmethod (a documented test-only utility) is removed from production code, along with the "Settings reset clears singleton cache" test scenario and 4 step definitions (`step_load_singleton_settings_via_get_settings`, `step_reset_singleton_settings`, `step_singleton_instances_should_be_different`, `step_singleton_environment_should_be`). The `@when` decorator on `step_set_env_var` is also removed. These changes are **unrelated to WF16**. **Recommendation:** Revert these changes or move them to a separate PR. ### 2.3 [HIGH] CHANGELOG removes unrelated bug #647 entry **File:** `CHANGELOG.md` (around line 202) The WF16 changelog addition is appropriate, but the removal of the bug #647 changelog entry ("Added TDD regression tests for bug #647...") is unrelated scope creep and inconsistent with preserving project history. **Recommendation:** Keep the existing bug #647 CHANGELOG entry. --- ## 3. Data Integrity ### 3.1 [MEDIUM] Duplicate contributor entry **File:** `CONTRIBUTORS.md:5,8` "Rui Hu <rui.hu@cleverthis.com>" appears on **both** line 5 and line 8. **Fix:** Remove the duplicate entry on line 8. ### 3.2 [MEDIUM] Removal of contributor without justification **File:** `CONTRIBUTORS.md` "Aditya Chhabra <aditya.chhabra@cleverthis.com>" was removed. Aditya is referenced extensively in `docs/timeline.md` as a contributor to multiple milestones (actor YAML, MCP adapter, agent skills, ACMS context pipeline, etc.). This removal is unrelated to WF16 and lacks justification. **Recommendation:** Restore the entry or explain the removal in a separate PR. --- ## 4. Test Quality / Coverage ### 4.1 [MEDIUM] All devcontainer acceptance criteria are soft assertions **File:** `robot/e2e/wf16_devcontainer.robot` (lines 130-319) All four devcontainer-specific acceptance criteria use conditional IF/ELSE blocks that only log `WARN` when expected indicators are absent: - **AC-3** (auto-detection, line 138): logs warning if `devcontainer` not in output - **AC-4** (lazy build, line 236): logs warning if `building`+`devcontainer` not in output - **AC-5** (container routing, line 246-250): logs warning if no routing indicators - **AC-6** (host write, line 306-314): logs warning if HEAD unchanged and no bind-mount indicators The test can pass with **zero devcontainer-specific behavior verified**, making it functionally a generic plan lifecycle test. While the TODO comments (`TODO(#762)`) explain this is intentional pending feature wiring, the issue's acceptance criteria list these as checkable items. Consider adding a summary log or counter at the end indicating how many AC checks actually passed vs. warned. ### 4.2 [LOW] No output verification for action creation **File:** `robot/e2e/wf16_devcontainer.robot:170-174` After `action create --config`, the test checks for `Traceback` and `INTERNAL` absence but does not verify the action name appears in the output. Compare with `m1_acceptance.robot:43` which uses `Output Should Contain`. If action creation behaves unexpectedly (rc=0 but wrong action name), downstream failures would be confusing. **Suggestion:** Add `Output Should Contain ${r_action} ${ACTION_NAME}` after line 173. ### 4.3 [LOW] Action YAML missing `reusable` and `read_only` fields **File:** `robot/e2e/wf16_devcontainer.robot:162-167` The WF16 action YAML omits `reusable` and `read_only` fields that `m1_acceptance.robot` and `m6_acceptance.robot` include. While these are likely optional with sensible defaults, adding them would improve consistency with established E2E patterns. --- ## 5. No Issues Found The following categories were analyzed across multiple cycles with **no findings**: - **Security**: No credential exposure, no `shell=True` usage, API keys handled via environment variables. - **Performance**: Timeout values (20min test, 180-300s commands) are reasonable for E2E LLM tests. The defensive second execute call (line 219) with 300s timeout is a minor inefficiency but justified by the comment. - **Robot Framework syntax**: All `Evaluate` expressions, variable references, and keyword calls are syntactically correct. - **Resource cleanup**: `SUITE_HOME` files (action YAML, temp repos) are cleaned up by `E2E Suite Teardown`. - **Parallel safety**: UUID-suffixed names prevent UNIQUE constraint collisions in parallel CI. --- ### Verdict **REQUEST_CHANGES** — The missing `--yes` flag (finding 1.1) is a likely test failure in non-interactive CI. The out-of-scope deletions (findings 2.1-2.3, 3.2) should be reverted or split into separate PRs. The duplicate contributor entry (finding 3.1) needs a quick fix.
CONTRIBUTORS.md Outdated
Member

[MEDIUM] "Aditya Chhabra" was removed from the contributor list without justification. This contributor is referenced extensively in docs/timeline.md across multiple milestones.

**[MEDIUM]** "Aditya Chhabra" was removed from the contributor list without justification. This contributor is referenced extensively in `docs/timeline.md` across multiple milestones.
CONTRIBUTORS.md Outdated
Member

[MEDIUM] Duplicate entry: "Rui Hu" appears on both line 5 and line 8. Remove one of them.

**[MEDIUM]** Duplicate entry: "Rui Hu" appears on both line 5 and line 8. Remove one of them.
@ -0,0 +171,4 @@
... action create --config ${action_path}
Should Not Contain ${r_action.stdout}${r_action.stderr} Traceback
Should Not Contain ${r_action.stdout}${r_action.stderr} INTERNAL
Log Action created: ${ACTION_NAME} with actor ${SELECTED_ACTOR}
Member

[LOW] Consider adding Output Should Contain ${r_action} ${ACTION_NAME} after this block for parity with m1_acceptance.robot:43, which verifies the action name in the output.

**[LOW]** Consider adding `Output Should Contain ${r_action} ${ACTION_NAME}` after this block for parity with `m1_acceptance.robot:43`, which verifies the action name in the output.
@ -0,0 +279,4 @@
# with a plan ID calls ``_lifecycle_apply_with_id`` which completes
# the full transition to the terminal Apply/applied state.
${r_apply}= Run CleverAgents Command
... plan apply ${plan_id}
Member

[CRITICAL] Missing --yes flag. The specification Example 16 Step 5 explicitly uses agents plan apply --yes <PLAN_ID> (spec line 42536). All 24 plan apply examples in the spec use --yes. Without it, the command may prompt for confirmation, hanging the test until the 180s timeout.

Fix: Change to:

...    plan    apply    --yes    ${plan_id}
**[CRITICAL]** Missing `--yes` flag. The specification Example 16 Step 5 explicitly uses `agents plan apply --yes <PLAN_ID>` (spec line 42536). All 24 `plan apply` examples in the spec use `--yes`. Without it, the command may prompt for confirmation, hanging the test until the 180s timeout. **Fix:** Change to: ``` ... plan apply --yes ${plan_id} ```
hurui200320 force-pushed test/e2e-wf16-devcontainer from 9b9a414090
All checks were successful
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 36s
CI / lint (pull_request) Successful in 5m14s
CI / quality (pull_request) Successful in 5m54s
CI / typecheck (pull_request) Successful in 5m59s
CI / security (pull_request) Successful in 6m3s
CI / unit_tests (pull_request) Successful in 8m53s
CI / integration_tests (pull_request) Successful in 9m1s
CI / docker (pull_request) Successful in 1m11s
CI / e2e_tests (pull_request) Successful in 11m55s
CI / coverage (pull_request) Successful in 11m8s
CI / status-check (pull_request) Successful in 1s
CI / benchmark-regression (pull_request) Successful in 55m51s
to f5d8e17c32
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 17s
CI / lint (pull_request) Successful in 3m17s
CI / typecheck (pull_request) Successful in 3m47s
CI / security (pull_request) Successful in 4m1s
CI / integration_tests (pull_request) Successful in 6m39s
CI / unit_tests (pull_request) Successful in 6m48s
CI / docker (pull_request) Successful in 10s
CI / e2e_tests (pull_request) Successful in 8m18s
CI / coverage (pull_request) Successful in 10m1s
CI / quality (pull_request) Failing after 16m43s
CI / benchmark-regression (pull_request) Successful in 55m24s
CI / status-check (pull_request) Successful in 1s
2026-03-24 12:15:57 +00:00
Compare
Member

Review Fix Summary — Cycle 5

Addressed review comments from @CoreRasurae (Review #2694). All fixes applied, rebased onto latest master (a854de7e), and force pushed.

Fixes Applied

  1. [CRITICAL] Finding 1.1 — Added --yes flag to plan apply
    Changed plan apply ${plan_id}plan apply --yes ${plan_id} in robot/e2e/wf16_devcontainer.robot. All 24 plan apply examples in the spec use --yes, and m1_acceptance.robot also uses it. Without this flag, the CLI could prompt for confirmation in non-interactive CI.

  2. [LOW] Finding 4.2 — Added action name output verification
    Added Output Should Contain ${r_action} ${ACTION_NAME} after action creation, matching the m1_acceptance.robot pattern for parity.

  3. [LOW] Finding 4.3 — Added reusable and read_only fields
    Added reusable: true and read_only: false to the action YAML for consistency with m1_acceptance.robot and m6_acceptance.robot.

Findings Not Applicable (False Positives)

Findings 2.1–2.3 and 3.1–3.2 reference files NOT changed in this PR:

  • 2.1 (bug #647 regression test deletions): Not in PR diff
  • 2.2 (Settings.reset() deletion): Not in PR diff
  • 2.3 (CHANGELOG bug #647 entry removal): CHANGELOG only has additions (7 added, 0 deleted)
  • 3.1 (duplicate CONTRIBUTORS.md entry): CONTRIBUTORS.md unchanged in this PR
  • 3.2 (contributor removal): CONTRIBUTORS.md unchanged in this PR

Verified via git diff --name-status origin/master...HEAD — only 2 files: M CHANGELOG.md, A robot/e2e/wf16_devcontainer.robot.

Informational

  • Finding 4.1 (soft assertions): By design — all devcontainer-specific AC checks are conditional with # TODO(#762) annotations, as documented in the PR description. These will be upgraded to hard assertions when devcontainer features are wired.

Quality Gates

  • nox -e lint
  • nox -e typecheck (0 errors)
  • nox -e unit_tests (464 features, 12295 scenarios, 0 failed)
  • nox -e integration_tests (all passed)
  • nox -e e2e_tests (38 tests, 38 passed — including WF16)
  • nox -e coverage_report (98% coverage)
## Review Fix Summary — Cycle 5 Addressed review comments from @CoreRasurae (Review #2694). All fixes applied, rebased onto latest master (`a854de7e`), and force pushed. ### Fixes Applied 1. **[CRITICAL] Finding 1.1 — Added `--yes` flag to `plan apply`** ✅ Changed `plan apply ${plan_id}` → `plan apply --yes ${plan_id}` in `robot/e2e/wf16_devcontainer.robot`. All 24 `plan apply` examples in the spec use `--yes`, and `m1_acceptance.robot` also uses it. Without this flag, the CLI could prompt for confirmation in non-interactive CI. 2. **[LOW] Finding 4.2 — Added action name output verification** ✅ Added `Output Should Contain ${r_action} ${ACTION_NAME}` after action creation, matching the `m1_acceptance.robot` pattern for parity. 3. **[LOW] Finding 4.3 — Added `reusable` and `read_only` fields** ✅ Added `reusable: true` and `read_only: false` to the action YAML for consistency with `m1_acceptance.robot` and `m6_acceptance.robot`. ### Findings Not Applicable (False Positives) Findings 2.1–2.3 and 3.1–3.2 reference files NOT changed in this PR: - **2.1** (bug #647 regression test deletions): Not in PR diff - **2.2** (Settings.reset() deletion): Not in PR diff - **2.3** (CHANGELOG bug #647 entry removal): CHANGELOG only has additions (7 added, 0 deleted) - **3.1** (duplicate CONTRIBUTORS.md entry): CONTRIBUTORS.md unchanged in this PR - **3.2** (contributor removal): CONTRIBUTORS.md unchanged in this PR Verified via `git diff --name-status origin/master...HEAD` — only 2 files: `M CHANGELOG.md`, `A robot/e2e/wf16_devcontainer.robot`. ### Informational - **Finding 4.1** (soft assertions): By design — all devcontainer-specific AC checks are conditional with `# TODO(#762)` annotations, as documented in the PR description. These will be upgraded to hard assertions when devcontainer features are wired. ### Quality Gates - `nox -e lint` ✅ - `nox -e typecheck` ✅ (0 errors) - `nox -e unit_tests` ✅ (464 features, 12295 scenarios, 0 failed) - `nox -e integration_tests` ✅ (all passed) - `nox -e e2e_tests` ✅ (38 tests, 38 passed — including WF16) - `nox -e coverage_report` ✅ (98% coverage)
hurui200320 force-pushed test/e2e-wf16-devcontainer from f5d8e17c32
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 17s
CI / lint (pull_request) Successful in 3m17s
CI / typecheck (pull_request) Successful in 3m47s
CI / security (pull_request) Successful in 4m1s
CI / integration_tests (pull_request) Successful in 6m39s
CI / unit_tests (pull_request) Successful in 6m48s
CI / docker (pull_request) Successful in 10s
CI / e2e_tests (pull_request) Successful in 8m18s
CI / coverage (pull_request) Successful in 10m1s
CI / quality (pull_request) Failing after 16m43s
CI / benchmark-regression (pull_request) Successful in 55m24s
CI / status-check (pull_request) Successful in 1s
to 50fb9eac96
All checks were successful
CI / lint (pull_request) Successful in 3m20s
CI / build (pull_request) Successful in 21s
CI / quality (pull_request) Successful in 4m14s
CI / typecheck (pull_request) Successful in 4m26s
CI / security (pull_request) Successful in 4m36s
CI / integration_tests (pull_request) Successful in 7m29s
CI / unit_tests (pull_request) Successful in 7m35s
CI / docker (pull_request) Successful in 56s
CI / e2e_tests (pull_request) Successful in 10m27s
CI / benchmark-publish (pull_request) Has been skipped
CI / coverage (pull_request) Successful in 11m33s
CI / status-check (pull_request) Successful in 1s
CI / benchmark-regression (pull_request) Successful in 51m1s
2026-03-26 08:47:04 +00:00
Compare
hurui200320 force-pushed test/e2e-wf16-devcontainer from 50fb9eac96
All checks were successful
CI / lint (pull_request) Successful in 3m20s
CI / build (pull_request) Successful in 21s
CI / quality (pull_request) Successful in 4m14s
CI / typecheck (pull_request) Successful in 4m26s
CI / security (pull_request) Successful in 4m36s
CI / integration_tests (pull_request) Successful in 7m29s
CI / unit_tests (pull_request) Successful in 7m35s
CI / docker (pull_request) Successful in 56s
CI / e2e_tests (pull_request) Successful in 10m27s
CI / benchmark-publish (pull_request) Has been skipped
CI / coverage (pull_request) Successful in 11m33s
CI / status-check (pull_request) Successful in 1s
CI / benchmark-regression (pull_request) Successful in 51m1s
to fc3b38e370
Some checks are pending
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 21s
CI / lint (pull_request) Successful in 3m17s
CI / typecheck (pull_request) Successful in 3m47s
CI / benchmark-regression (pull_request) Has started running
CI / quality (pull_request) Successful in 3m55s
CI / security (pull_request) Successful in 4m2s
CI / integration_tests (pull_request) Successful in 8m47s
CI / unit_tests (pull_request) Successful in 9m2s
CI / e2e_tests (pull_request) Successful in 9m34s
CI / docker (pull_request) Successful in 1m5s
CI / coverage (pull_request) Successful in 11m20s
CI / status-check (pull_request) Successful in 1s
2026-03-26 10:14:06 +00:00
Compare
hurui200320 force-pushed test/e2e-wf16-devcontainer from fc3b38e370
Some checks are pending
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 21s
CI / lint (pull_request) Successful in 3m17s
CI / typecheck (pull_request) Successful in 3m47s
CI / benchmark-regression (pull_request) Has started running
CI / quality (pull_request) Successful in 3m55s
CI / security (pull_request) Successful in 4m2s
CI / integration_tests (pull_request) Successful in 8m47s
CI / unit_tests (pull_request) Successful in 9m2s
CI / e2e_tests (pull_request) Successful in 9m34s
CI / docker (pull_request) Successful in 1m5s
CI / coverage (pull_request) Successful in 11m20s
CI / status-check (pull_request) Successful in 1s
to 7d9c071567
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 15s
CI / lint (pull_request) Successful in 3m19s
CI / quality (pull_request) Successful in 3m44s
CI / security (pull_request) Successful in 4m10s
CI / typecheck (pull_request) Successful in 4m25s
CI / unit_tests (pull_request) Successful in 7m21s
CI / docker (pull_request) Successful in 1m13s
CI / integration_tests (pull_request) Successful in 7m16s
CI / e2e_tests (pull_request) Failing after 8m14s
CI / coverage (pull_request) Successful in 11m47s
CI / status-check (pull_request) Successful in 1s
CI / benchmark-regression (pull_request) Successful in 52m11s
2026-03-26 11:05:04 +00:00
Compare
hurui200320 force-pushed test/e2e-wf16-devcontainer from 7d9c071567
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 15s
CI / lint (pull_request) Successful in 3m19s
CI / quality (pull_request) Successful in 3m44s
CI / security (pull_request) Successful in 4m10s
CI / typecheck (pull_request) Successful in 4m25s
CI / unit_tests (pull_request) Successful in 7m21s
CI / docker (pull_request) Successful in 1m13s
CI / integration_tests (pull_request) Successful in 7m16s
CI / e2e_tests (pull_request) Failing after 8m14s
CI / coverage (pull_request) Successful in 11m47s
CI / status-check (pull_request) Successful in 1s
CI / benchmark-regression (pull_request) Successful in 52m11s
to ece8b83359
Some checks failed
CI / docker (pull_request) Blocked by required conditions
CI / coverage (pull_request) Blocked by required conditions
CI / benchmark-regression (pull_request) Blocked by required conditions
CI / status-check (pull_request) Blocked by required conditions
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 19s
CI / lint (pull_request) Successful in 3m19s
CI / integration_tests (pull_request) Failing after 10m34s
CI / unit_tests (pull_request) Failing after 10m34s
CI / security (pull_request) Failing after 10m34s
CI / e2e_tests (pull_request) Failing after 15m34s
CI / quality (pull_request) Failing after 15m34s
CI / typecheck (pull_request) Failing after 15m34s
2026-03-26 12:43:20 +00:00
Compare
hurui200320 force-pushed test/e2e-wf16-devcontainer from ece8b83359
Some checks failed
CI / docker (pull_request) Blocked by required conditions
CI / coverage (pull_request) Blocked by required conditions
CI / benchmark-regression (pull_request) Blocked by required conditions
CI / status-check (pull_request) Blocked by required conditions
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 19s
CI / lint (pull_request) Successful in 3m19s
CI / integration_tests (pull_request) Failing after 10m34s
CI / unit_tests (pull_request) Failing after 10m34s
CI / security (pull_request) Failing after 10m34s
CI / e2e_tests (pull_request) Failing after 15m34s
CI / quality (pull_request) Failing after 15m34s
CI / typecheck (pull_request) Failing after 15m34s
to 1a542b7fd6
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 25s
CI / lint (pull_request) Successful in 3m45s
CI / typecheck (pull_request) Successful in 4m6s
CI / security (pull_request) Successful in 4m29s
CI / quality (pull_request) Successful in 3m43s
CI / unit_tests (pull_request) Successful in 4m1s
CI / docker (pull_request) Successful in 1m20s
CI / integration_tests (pull_request) Successful in 6m23s
CI / e2e_tests (pull_request) Successful in 10m9s
CI / coverage (pull_request) Successful in 13m24s
CI / status-check (pull_request) Successful in 3s
CI / benchmark-regression (pull_request) Has been cancelled
2026-03-26 15:53:50 +00:00
Compare
hurui200320 force-pushed test/e2e-wf16-devcontainer from 1a542b7fd6
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 25s
CI / lint (pull_request) Successful in 3m45s
CI / typecheck (pull_request) Successful in 4m6s
CI / security (pull_request) Successful in 4m29s
CI / quality (pull_request) Successful in 3m43s
CI / unit_tests (pull_request) Successful in 4m1s
CI / docker (pull_request) Successful in 1m20s
CI / integration_tests (pull_request) Successful in 6m23s
CI / e2e_tests (pull_request) Successful in 10m9s
CI / coverage (pull_request) Successful in 13m24s
CI / status-check (pull_request) Successful in 3s
CI / benchmark-regression (pull_request) Has been cancelled
to a9929f2ef8
Some checks failed
CI / build (pull_request) Successful in 17s
CI / lint (pull_request) Successful in 3m20s
CI / quality (pull_request) Successful in 3m42s
CI / typecheck (pull_request) Successful in 3m56s
CI / security (pull_request) Successful in 4m1s
CI / integration_tests (pull_request) Successful in 5m59s
CI / unit_tests (pull_request) Successful in 6m15s
CI / docker (pull_request) Successful in 1m11s
CI / e2e_tests (pull_request) Successful in 10m23s
CI / coverage (pull_request) Successful in 10m13s
CI / status-check (pull_request) Successful in 1s
CI / benchmark-publish (pull_request) Has been skipped
CI / benchmark-regression (pull_request) Failing after 23m28s
2026-03-26 16:26:07 +00:00
Compare
hurui200320 force-pushed test/e2e-wf16-devcontainer from a9929f2ef8
Some checks failed
CI / build (pull_request) Successful in 17s
CI / lint (pull_request) Successful in 3m20s
CI / quality (pull_request) Successful in 3m42s
CI / typecheck (pull_request) Successful in 3m56s
CI / security (pull_request) Successful in 4m1s
CI / integration_tests (pull_request) Successful in 5m59s
CI / unit_tests (pull_request) Successful in 6m15s
CI / docker (pull_request) Successful in 1m11s
CI / e2e_tests (pull_request) Successful in 10m23s
CI / coverage (pull_request) Successful in 10m13s
CI / status-check (pull_request) Successful in 1s
CI / benchmark-publish (pull_request) Has been skipped
CI / benchmark-regression (pull_request) Failing after 23m28s
to 8a0086ab3a
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 33s
CI / lint (pull_request) Successful in 6m14s
CI / quality (pull_request) Successful in 6m40s
CI / security (pull_request) Successful in 6m58s
CI / typecheck (pull_request) Successful in 7m5s
CI / integration_tests (pull_request) Successful in 9m58s
CI / unit_tests (pull_request) Successful in 12m48s
CI / e2e_tests (pull_request) Failing after 12m57s
CI / docker (pull_request) Successful in 1m8s
CI / coverage (pull_request) Successful in 11m25s
CI / status-check (pull_request) Successful in 1s
CI / benchmark-regression (pull_request) Successful in 58m26s
2026-03-27 09:59:50 +00:00
Compare
hurui200320 force-pushed test/e2e-wf16-devcontainer from 8a0086ab3a
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 33s
CI / lint (pull_request) Successful in 6m14s
CI / quality (pull_request) Successful in 6m40s
CI / security (pull_request) Successful in 6m58s
CI / typecheck (pull_request) Successful in 7m5s
CI / integration_tests (pull_request) Successful in 9m58s
CI / unit_tests (pull_request) Successful in 12m48s
CI / e2e_tests (pull_request) Failing after 12m57s
CI / docker (pull_request) Successful in 1m8s
CI / coverage (pull_request) Successful in 11m25s
CI / status-check (pull_request) Successful in 1s
CI / benchmark-regression (pull_request) Successful in 58m26s
to 616c1fc68b
Some checks failed
CI / lint (pull_request) Successful in 18s
CI / typecheck (pull_request) Successful in 48s
CI / quality (pull_request) Successful in 34s
CI / security (pull_request) Successful in 57s
CI / build (pull_request) Successful in 20s
CI / helm (pull_request) Successful in 22s
CI / integration_tests (pull_request) Successful in 3m57s
CI / unit_tests (pull_request) Successful in 4m36s
CI / docker (pull_request) Successful in 21s
CI / coverage (pull_request) Successful in 8m32s
CI / e2e_tests (pull_request) Has been cancelled
CI / status-check (pull_request) Has been cancelled
CI / benchmark-publish (pull_request) Has been cancelled
CI / benchmark-regression (pull_request) Has been cancelled
2026-03-30 09:37:39 +00:00
Compare
hurui200320 force-pushed test/e2e-wf16-devcontainer from 616c1fc68b
Some checks failed
CI / lint (pull_request) Successful in 18s
CI / typecheck (pull_request) Successful in 48s
CI / quality (pull_request) Successful in 34s
CI / security (pull_request) Successful in 57s
CI / build (pull_request) Successful in 20s
CI / helm (pull_request) Successful in 22s
CI / integration_tests (pull_request) Successful in 3m57s
CI / unit_tests (pull_request) Successful in 4m36s
CI / docker (pull_request) Successful in 21s
CI / coverage (pull_request) Successful in 8m32s
CI / e2e_tests (pull_request) Has been cancelled
CI / status-check (pull_request) Has been cancelled
CI / benchmark-publish (pull_request) Has been cancelled
CI / benchmark-regression (pull_request) Has been cancelled
to 55661dd924
Some checks failed
CI / lint (pull_request) Successful in 3m29s
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 24s
CI / helm (pull_request) Successful in 30s
CI / typecheck (pull_request) Successful in 1m12s
CI / quality (pull_request) Successful in 4m9s
CI / security (pull_request) Successful in 4m28s
CI / coverage (pull_request) Successful in 8m37s
CI / integration_tests (pull_request) Successful in 9m43s
CI / unit_tests (pull_request) Successful in 9m59s
CI / docker (pull_request) Successful in 1m32s
CI / e2e_tests (pull_request) Successful in 18m53s
CI / status-check (pull_request) Successful in 1s
CI / benchmark-regression (pull_request) Has been cancelled
2026-03-30 09:56:45 +00:00
Compare
hurui200320 force-pushed test/e2e-wf16-devcontainer from 55661dd924
Some checks failed
CI / lint (pull_request) Successful in 3m29s
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 24s
CI / helm (pull_request) Successful in 30s
CI / typecheck (pull_request) Successful in 1m12s
CI / quality (pull_request) Successful in 4m9s
CI / security (pull_request) Successful in 4m28s
CI / coverage (pull_request) Successful in 8m37s
CI / integration_tests (pull_request) Successful in 9m43s
CI / unit_tests (pull_request) Successful in 9m59s
CI / docker (pull_request) Successful in 1m32s
CI / e2e_tests (pull_request) Successful in 18m53s
CI / status-check (pull_request) Successful in 1s
CI / benchmark-regression (pull_request) Has been cancelled
to dacfb43100
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / typecheck (pull_request) Has been cancelled
CI / integration_tests (pull_request) Has been cancelled
CI / e2e_tests (pull_request) Has been cancelled
CI / quality (pull_request) Has been cancelled
CI / lint (pull_request) Has been cancelled
CI / security (pull_request) Has been cancelled
CI / unit_tests (pull_request) Has been cancelled
CI / coverage (pull_request) Has been cancelled
CI / build (pull_request) Has been cancelled
CI / docker (pull_request) Has been cancelled
CI / status-check (pull_request) Has been cancelled
CI / benchmark-regression (pull_request) Has been cancelled
CI / helm (pull_request) Has been cancelled
2026-03-30 10:30:16 +00:00
Compare
hurui200320 force-pushed test/e2e-wf16-devcontainer from dacfb43100
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / typecheck (pull_request) Has been cancelled
CI / integration_tests (pull_request) Has been cancelled
CI / e2e_tests (pull_request) Has been cancelled
CI / quality (pull_request) Has been cancelled
CI / lint (pull_request) Has been cancelled
CI / security (pull_request) Has been cancelled
CI / unit_tests (pull_request) Has been cancelled
CI / coverage (pull_request) Has been cancelled
CI / build (pull_request) Has been cancelled
CI / docker (pull_request) Has been cancelled
CI / status-check (pull_request) Has been cancelled
CI / benchmark-regression (pull_request) Has been cancelled
CI / helm (pull_request) Has been cancelled
to 54e0c65fd4
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 20s
CI / quality (pull_request) Successful in 53s
CI / typecheck (pull_request) Successful in 1m0s
CI / security (pull_request) Successful in 59s
CI / build (pull_request) Successful in 20s
CI / helm (pull_request) Successful in 23s
CI / unit_tests (pull_request) Successful in 4m9s
CI / integration_tests (pull_request) Successful in 3m57s
CI / docker (pull_request) Successful in 2m15s
CI / benchmark-regression (pull_request) Has been cancelled
CI / coverage (pull_request) Has been cancelled
CI / e2e_tests (pull_request) Has been cancelled
CI / status-check (pull_request) Has been cancelled
2026-03-30 10:32:28 +00:00
Compare
hurui200320 scheduled this pull request to auto merge when all checks succeed 2026-03-30 10:34:46 +00:00
hurui200320 force-pushed test/e2e-wf16-devcontainer from 54e0c65fd4
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 20s
CI / quality (pull_request) Successful in 53s
CI / typecheck (pull_request) Successful in 1m0s
CI / security (pull_request) Successful in 59s
CI / build (pull_request) Successful in 20s
CI / helm (pull_request) Successful in 23s
CI / unit_tests (pull_request) Successful in 4m9s
CI / integration_tests (pull_request) Successful in 3m57s
CI / docker (pull_request) Successful in 2m15s
CI / benchmark-regression (pull_request) Has been cancelled
CI / coverage (pull_request) Has been cancelled
CI / e2e_tests (pull_request) Has been cancelled
CI / status-check (pull_request) Has been cancelled
to b6c3169634
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 28s
CI / quality (pull_request) Successful in 32s
CI / security (pull_request) Successful in 58s
CI / build (pull_request) Successful in 14s
CI / helm (pull_request) Successful in 22s
CI / typecheck (pull_request) Successful in 3m54s
CI / unit_tests (pull_request) Successful in 4m9s
CI / integration_tests (pull_request) Successful in 7m12s
CI / docker (pull_request) Successful in 22s
CI / e2e_tests (pull_request) Successful in 14m37s
CI / coverage (pull_request) Successful in 10m11s
CI / status-check (pull_request) Successful in 1s
CI / lint (push) Successful in 16s
CI / build (push) Successful in 15s
CI / helm (push) Successful in 22s
CI / typecheck (push) Successful in 3m53s
CI / benchmark-regression (push) Has been skipped
CI / security (push) Successful in 4m5s
CI / quality (push) Successful in 3m41s
CI / unit_tests (push) Successful in 7m15s
CI / integration_tests (push) Successful in 7m34s
CI / docker (push) Successful in 1m41s
CI / e2e_tests (push) Successful in 16m45s
CI / coverage (push) Successful in 11m54s
CI / status-check (push) Successful in 1s
CI / benchmark-publish (push) Has been cancelled
CI / benchmark-regression (pull_request) Successful in 1h5m26s
2026-03-30 10:45:21 +00:00
Compare
hurui200320 deleted branch test/e2e-wf16-devcontainer 2026-03-30 11:13:31 +00:00
Sign in to join this conversation.
No reviewers
No milestone
No project
No assignees
3 participants
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Reference
cleveragents/cleveragents-core!818
No description provided.