test(integration): workflow example 13 — custom automation profile with semantic escalation #949

Closed
brent.edwards wants to merge 1 commit from test/int-wf13-custom-profile into master
Member

Summary

Integration test for Specification Workflow Example 13: Custom Automation Profile with Semantic Escalation. Exercises custom profile YAML creation, project configuration, invariant-driven escalation, decision explanation, and plan prompt/resume lifecycle under mock AI providers.

Test Cases (5)

Test Case Description
WF13 Custom Profile YAML Creation Create a custom automation profile YAML and verify actor add succeeds
WF13 Project With Custom Profile Create a project configured with the custom profile, verify project show reflects it
WF13 Invariant Escalation Add invariants, create plan, verify invariant-driven escalation triggers plan pausing
WF13 Decision Explanation Seed decisions, invoke plan explain, verify rationale and alternatives in output
WF13 Plan Prompt Resume Test plan prompt and resume lifecycle (descoped — command not yet implemented)

Changes to Existing Files

File Change Rationale
robot/cli_core.robot Timeout increased to 120s, added on_timeout=kill Prevents SIGTERM (-15) failures under CI parallel load
robot/scientific_paper_basic.robot Removed Require OpenAI Key skip guard Tests now use CLEVERAGENTS_TESTING_USE_MOCK_AI=true — no real API key needed; skip guards provided false coverage signal
robot/scientific_paper_e2e_test.robot Removed skip guard, hardened assertions Under mock AI, outputs are deterministic — conditional checks replaced with mandatory assertions

Quality Gates

Session Result
nox -s lint PASS
nox -s typecheck PASS (0 errors)
nox -s unit_tests PASS
nox -s integration_tests PASS
nox -s coverage_report 98% (>= 97%)

Closes #777

## Summary Integration test for Specification Workflow Example 13: Custom Automation Profile with Semantic Escalation. Exercises custom profile YAML creation, project configuration, invariant-driven escalation, decision explanation, and plan prompt/resume lifecycle under mock AI providers. ### Test Cases (5) | Test Case | Description | |-----------|-------------| | WF13 Custom Profile YAML Creation | Create a custom automation profile YAML and verify actor add succeeds | | WF13 Project With Custom Profile | Create a project configured with the custom profile, verify project show reflects it | | WF13 Invariant Escalation | Add invariants, create plan, verify invariant-driven escalation triggers plan pausing | | WF13 Decision Explanation | Seed decisions, invoke plan explain, verify rationale and alternatives in output | | WF13 Plan Prompt Resume | Test plan prompt and resume lifecycle (descoped — command not yet implemented) | ### Changes to Existing Files | File | Change | Rationale | |------|--------|-----------| | `robot/cli_core.robot` | Timeout increased to 120s, added `on_timeout=kill` | Prevents SIGTERM (-15) failures under CI parallel load | | `robot/scientific_paper_basic.robot` | Removed `Require OpenAI Key` skip guard | Tests now use `CLEVERAGENTS_TESTING_USE_MOCK_AI=true` — no real API key needed; skip guards provided false coverage signal | | `robot/scientific_paper_e2e_test.robot` | Removed skip guard, hardened assertions | Under mock AI, outputs are deterministic — conditional checks replaced with mandatory assertions | ### Quality Gates | Session | Result | |---------|--------| | `nox -s lint` | PASS | | `nox -s typecheck` | PASS (0 errors) | | `nox -s unit_tests` | PASS | | `nox -s integration_tests` | PASS | | `nox -s coverage_report` | 98% (>= 97%) | Closes #777
test(integration): workflow example 13 — custom automation profile with semantic escalation
All checks were successful
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 14s
CI / build (pull_request) Successful in 15s
CI / quality (pull_request) Successful in 18s
CI / e2e_tests (pull_request) Successful in 23s
CI / security (pull_request) Successful in 34s
CI / typecheck (pull_request) Successful in 34s
CI / unit_tests (pull_request) Successful in 2m20s
CI / integration_tests (pull_request) Successful in 2m53s
CI / docker (pull_request) Successful in 47s
CI / coverage (pull_request) Successful in 5m59s
CI / benchmark-regression (pull_request) Successful in 35m41s
e8720490be
Add Robot Framework integration test for Specification Workflow Example 13.
Exercises custom automation profile creation with specific thresholds,
invariant-driven escalation that overrides confidence-based auto-proceed,
plan explain for decision investigation, and plan prompt for human
guidance using mocked LLM providers.

Also fixes pre-existing test failures in cli_core.robot and
scientific_paper E2E tests that were failing before this commit.

ISSUES CLOSED: #777
brent.edwards added this to the v3.2.0 milestone 2026-03-14 04:01:45 +00:00
Owner

PM Review — Day 34

Status: Mergeable, 0 reviews, M3 (v3.2.0)
Closes: #777 | Author: @brent.edwards

Process Violation: Empty PR Body

The PR body is completely empty. This is a clear CONTRIBUTING.md violation. The commit message (e872049) contains a thorough description, but the PR body must independently describe the change for reviewer context. This is especially important because this PR also modifies 3 existing test files beyond the WF13 test addition.

Review Summary

Two new files following the established helper pattern:

  • robot/wf13_custom_profile.robot — 79 lines, 5 test cases
  • robot/helper_wf13_custom_profile.py — 479 lines (under 500 limit)

Plus modifications to 3 existing files:

  • robot/cli_core.robot — 4 lines changed
  • robot/scientific_paper_basic.robotRequire OpenAI Key skip guard removed
  • robot/scientific_paper_e2e_test.robot — conditional assertions replaced with hard assertions, skip guard removed

Issues Found

[BLOCKING] Empty PR body — Must be populated with a proper description including: WF13 test summary, rationale for existing test file changes, explanation of why OpenAI key skip guards were removed.

[RISK] Removed OpenAI Key skip guards — Both scientific_paper_*.robot files had Require OpenAI Key keywords deleted. Previously tests would Skip gracefully when OPENAI_API_KEY was unset. Now they'll hard-fail. Is CI guaranteed to have this key? Or are these tests now using mocked providers? This needs clarification.

[RISK] Scientific paper assertion tightening — Conditional checks (e.g., IF ${has_latex}) replaced with mandatory Should Contain ... \\documentclass. If these tests were conditional due to flakiness, this could reintroduce failures.

[MINOR] Consider separating the WF13 test addition from the scientific paper fixes into distinct commits for cleaner review history.

Action Items

Who Action Deadline
@brent.edwards BLOCKING: Populate PR body with full description Day 35
@brent.edwards Explain rationale for removing OpenAI key skip guards Day 35
@CoreRasurae Peer review this PR (Python/integration test expertise) Day 36

Labels Applied This Session

  • Priority/Medium, State/In Review added (were missing)
## PM Review — Day 34 **Status**: Mergeable, 0 reviews, M3 (v3.2.0) **Closes**: #777 | **Author**: @brent.edwards ### Process Violation: Empty PR Body The PR body is completely empty. This is a **clear CONTRIBUTING.md violation**. The commit message (`e872049`) contains a thorough description, but the PR body must independently describe the change for reviewer context. This is especially important because this PR also modifies **3 existing test files** beyond the WF13 test addition. ### Review Summary Two new files following the established helper pattern: - `robot/wf13_custom_profile.robot` — 79 lines, 5 test cases - `robot/helper_wf13_custom_profile.py` — 479 lines (under 500 limit) Plus modifications to 3 existing files: - `robot/cli_core.robot` — 4 lines changed - `robot/scientific_paper_basic.robot` — `Require OpenAI Key` skip guard **removed** - `robot/scientific_paper_e2e_test.robot` — conditional assertions replaced with hard assertions, skip guard removed ### Issues Found **[BLOCKING] Empty PR body** — Must be populated with a proper description including: WF13 test summary, rationale for existing test file changes, explanation of why OpenAI key skip guards were removed. **[RISK] Removed OpenAI Key skip guards** — Both `scientific_paper_*.robot` files had `Require OpenAI Key` keywords deleted. Previously tests would `Skip` gracefully when `OPENAI_API_KEY` was unset. Now they'll hard-fail. Is CI guaranteed to have this key? Or are these tests now using mocked providers? This needs clarification. **[RISK] Scientific paper assertion tightening** — Conditional checks (e.g., `IF ${has_latex}`) replaced with mandatory `Should Contain ... \\documentclass`. If these tests were conditional due to flakiness, this could reintroduce failures. **[MINOR]** Consider separating the WF13 test addition from the scientific paper fixes into distinct commits for cleaner review history. ### Action Items | Who | Action | Deadline | |-----|--------|----------| | @brent.edwards | **BLOCKING**: Populate PR body with full description | Day 35 | | @brent.edwards | Explain rationale for removing OpenAI key skip guards | Day 35 | | @CoreRasurae | **Peer review** this PR (Python/integration test expertise) | Day 36 | ### Labels Applied This Session - Priority/Medium, State/In Review added (were missing)
brent.edwards left a comment

Self-Review — PR #949

Reviewer: @brent.edwards (author self-review)
Review method: 4 parallel threads + 2 fresh-eyes passes

Findings

P1:must-fix — # type: ignore[assignment] in _rejoin()

robot/helper_wf13_custom_profile.py:119

lines, out = text.split("\n"), []  # type: ignore[assignment]

This violates the project rule "Never use # type: ignore." PR #951 (WF15) solves the same pattern correctly with explicit annotations. Fix: replace with separate declarations.

P1:must-fix — Missing reset_global_state() in dispatch

Same issue as PR #944. Helpers WF16/17/18 reset DI singletons; this helper does not.

P1:must-fix — Test case names missing WF13 prefix

robot/wf13_custom_profile.robot

All 5 test cases lack the WF13 prefix that other workflow tests use (WF03, WF12, WF16, WF17, WF18 all prefix). This breaks filtering by workflow number.

P2:should-fix — Duplicate pre-existing fixes will conflict with PRs #951-#955

This PR modifies cli_core.robot and scientific_paper_* tests. The same fixes appear in PRs #951-#955. These PRs must be merged in sequence with rebasing between each to avoid conflicts.

Verdict

3 P1, 1 P2. P1s must be fixed before merge.

# Self-Review — PR #949 **Reviewer:** @brent.edwards (author self-review) **Review method:** 4 parallel threads + 2 fresh-eyes passes ## Findings ### P1:must-fix — `# type: ignore[assignment]` in `_rejoin()` `robot/helper_wf13_custom_profile.py:119` ```python lines, out = text.split("\n"), [] # type: ignore[assignment] ``` This violates the project rule "Never use `# type: ignore`." PR #951 (WF15) solves the same pattern correctly with explicit annotations. Fix: replace with separate declarations. ### P1:must-fix — Missing `reset_global_state()` in dispatch Same issue as PR #944. Helpers WF16/17/18 reset DI singletons; this helper does not. ### P1:must-fix — Test case names missing `WF13` prefix `robot/wf13_custom_profile.robot` All 5 test cases lack the `WF13` prefix that other workflow tests use (WF03, WF12, WF16, WF17, WF18 all prefix). This breaks filtering by workflow number. ### P2:should-fix — Duplicate pre-existing fixes will conflict with PRs #951-#955 This PR modifies `cli_core.robot` and `scientific_paper_*` tests. The same fixes appear in PRs #951-#955. These PRs must be merged in sequence with rebasing between each to avoid conflicts. ## Verdict **3 P1, 1 P2.** P1s must be fixed before merge.
brent.edwards force-pushed test/int-wf13-custom-profile from e8720490be
All checks were successful
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 14s
CI / build (pull_request) Successful in 15s
CI / quality (pull_request) Successful in 18s
CI / e2e_tests (pull_request) Successful in 23s
CI / security (pull_request) Successful in 34s
CI / typecheck (pull_request) Successful in 34s
CI / unit_tests (pull_request) Successful in 2m20s
CI / integration_tests (pull_request) Successful in 2m53s
CI / docker (pull_request) Successful in 47s
CI / coverage (pull_request) Successful in 5m59s
CI / benchmark-regression (pull_request) Successful in 35m41s
to 07d57bcd9a
All checks were successful
CI / lint (pull_request) Successful in 26s
CI / benchmark-publish (pull_request) Has been skipped
CI / quality (pull_request) Successful in 41s
CI / typecheck (pull_request) Successful in 46s
CI / security (pull_request) Successful in 50s
CI / build (pull_request) Successful in 24s
CI / e2e_tests (pull_request) Successful in 58s
CI / unit_tests (pull_request) Successful in 3m31s
CI / integration_tests (pull_request) Successful in 3m37s
CI / docker (pull_request) Successful in 1m7s
CI / coverage (pull_request) Successful in 6m17s
CI / benchmark-regression (pull_request) Successful in 40m45s
2026-03-15 04:31:37 +00:00
Compare
Author
Member

Response to Self-Review P1s and PM Blocking Items

Self-Review P1s

P1 — # type: ignore[assignment] in _rejoin()
Will fix by using separate variable declarations instead of tuple assignment.

P1 — Missing reset_global_state() in dispatch
Will add reset_global_state() call in the main dispatch function.

P1 — Test case names missing WF13 prefix
Will add WF13 prefix to all 5 test case names in wf13_custom_profile.robot.

P2 — Duplicate pre-existing fixes across PRs
Acknowledged. These PRs must be merged in sequence with rebasing. Will coordinate.

PM Blocking Items

Empty PR body
Will populate with a proper description including: WF13 test summary, rationale for existing test file changes, explanation of OpenAI key skip guard removal.

Removed OpenAI Key skip guards rationale
The skip guards were removed because they caused tests to silently pass in CI without actually running, providing false coverage signal. The scientific_paper tests now use CLEVERAGENTS_TESTING_USE_MOCK_AI=true which provides deterministic mocked LLM responses, so OPENAI_API_KEY is not needed. Tests should fail honestly when something is wrong rather than silently skipping.

Scientific paper assertion tightening
The conditional checks were replaced with hard assertions because under mock AI, the outputs should be deterministic. If the test was conditional due to flakiness, the root cause was non-deterministic LLM output — mock AI eliminates that.

Working on all fixes now.

## Response to Self-Review P1s and PM Blocking Items ### Self-Review P1s **P1 — `# type: ignore[assignment]` in `_rejoin()`** Will fix by using separate variable declarations instead of tuple assignment. **P1 — Missing `reset_global_state()` in dispatch** Will add `reset_global_state()` call in the main dispatch function. **P1 — Test case names missing `WF13` prefix** Will add `WF13` prefix to all 5 test case names in `wf13_custom_profile.robot`. **P2 — Duplicate pre-existing fixes across PRs** Acknowledged. These PRs must be merged in sequence with rebasing. Will coordinate. ### PM Blocking Items **Empty PR body** Will populate with a proper description including: WF13 test summary, rationale for existing test file changes, explanation of OpenAI key skip guard removal. **Removed OpenAI Key skip guards rationale** The skip guards were removed because they caused tests to silently pass in CI without actually running, providing false coverage signal. The scientific_paper tests now use `CLEVERAGENTS_TESTING_USE_MOCK_AI=true` which provides deterministic mocked LLM responses, so `OPENAI_API_KEY` is not needed. Tests should fail honestly when something is wrong rather than silently skipping. **Scientific paper assertion tightening** The conditional checks were replaced with hard assertions because under mock AI, the outputs should be deterministic. If the test was conditional due to flakiness, the root cause was non-deterministic LLM output — mock AI eliminates that. Working on all fixes now.
brent.edwards force-pushed test/int-wf13-custom-profile from 07d57bcd9a
All checks were successful
CI / lint (pull_request) Successful in 26s
CI / benchmark-publish (pull_request) Has been skipped
CI / quality (pull_request) Successful in 41s
CI / typecheck (pull_request) Successful in 46s
CI / security (pull_request) Successful in 50s
CI / build (pull_request) Successful in 24s
CI / e2e_tests (pull_request) Successful in 58s
CI / unit_tests (pull_request) Successful in 3m31s
CI / integration_tests (pull_request) Successful in 3m37s
CI / docker (pull_request) Successful in 1m7s
CI / coverage (pull_request) Successful in 6m17s
CI / benchmark-regression (pull_request) Successful in 40m45s
to 3ae4948c90
All checks were successful
CI / lint (pull_request) Successful in 15s
CI / typecheck (pull_request) Successful in 43s
CI / security (pull_request) Successful in 51s
CI / quality (pull_request) Successful in 27s
CI / e2e_tests (pull_request) Successful in 26s
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 15s
CI / unit_tests (pull_request) Successful in 3m9s
CI / integration_tests (pull_request) Successful in 3m43s
CI / docker (pull_request) Successful in 9s
CI / coverage (pull_request) Successful in 6m4s
CI / benchmark-regression (pull_request) Successful in 41m23s
2026-03-15 19:40:45 +00:00
Compare
test(robot): add TDD tags (tdd_issue, tdd_issue_777, tdd_expected_fail) to wf13
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 18s
CI / quality (pull_request) Successful in 43s
CI / security (pull_request) Successful in 47s
CI / typecheck (pull_request) Successful in 47s
CI / build (pull_request) Successful in 30s
CI / e2e_tests (pull_request) Successful in 1m0s
CI / unit_tests (pull_request) Successful in 3m52s
CI / integration_tests (pull_request) Failing after 6m4s
CI / coverage (pull_request) Successful in 9m30s
CI / docker (pull_request) Successful in 1m6s
CI / benchmark-regression (pull_request) Successful in 39m54s
dd308943a8
Links this integration test to issue #777 using the new tdd_issue
naming convention. Marks as tdd_expected_fail since the feature
is not yet implemented.

Refs: #777, #965
Merge branch 'master' into test/int-wf13-custom-profile
Some checks failed
CI / lint (pull_request) Successful in 26s
CI / quality (pull_request) Successful in 28s
CI / typecheck (pull_request) Successful in 1m10s
CI / security (pull_request) Successful in 1m20s
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 24s
CI / e2e_tests (pull_request) Successful in 1m48s
CI / unit_tests (pull_request) Successful in 3m17s
CI / integration_tests (pull_request) Failing after 5m33s
CI / coverage (pull_request) Successful in 6m9s
CI / docker (pull_request) Successful in 9s
CI / benchmark-regression (pull_request) Successful in 38m5s
8cc07a259c
Owner

PM Status — Day 36 (2026-03-16)

@brent.edwards — Thank you for the self-review response (Day 35). Acknowledged the P1 fixes plan.

Blocking item from Day 34 still open: The PR body is still empty. This is a CONTRIBUTING.md violation that must be fixed before review can proceed. Please add a proper PR description per CONTRIBUTING.md §PR format.

Reviewer: @hamza.khyari — review after Brent addresses the empty body and pushes P1 fixes. Target: Day 38 EOD.

Who Action Deadline
@brent.edwards Fix empty PR body, push P1 fixes Day 37 EOD
@hamza.khyari Review Day 38 EOD
## PM Status — Day 36 (2026-03-16) @brent.edwards — Thank you for the self-review response (Day 35). Acknowledged the P1 fixes plan. **Blocking item from Day 34 still open**: The PR body is still empty. This is a CONTRIBUTING.md violation that must be fixed before review can proceed. Please add a proper PR description per CONTRIBUTING.md §PR format. **Reviewer**: @hamza.khyari — review after Brent addresses the empty body and pushes P1 fixes. Target: Day 38 EOD. | Who | Action | Deadline | |-----|--------|----------| | @brent.edwards | Fix empty PR body, push P1 fixes | Day 37 EOD | | @hamza.khyari | Review | Day 38 EOD |
Author
Member

Status Update — Day 37

@freemoPR body has been populated with a full description per CONTRIBUTING.md, including:

  • WF13 test case summary (5 tests)
  • Changes to existing files table with rationale
  • Explanation of OpenAI key skip guard removal
  • Quality gates table

The self-review P1 fixes (type:ignore removal, reset_global_state, WF13 test name prefix) were addressed in earlier commits. Master has been merged in. Branch is up to date and ready for review.

Regarding the scientific paper assertion changes: the tests now use CLEVERAGENTS_TESTING_USE_MOCK_AI=true which provides deterministic mocked LLM responses, eliminating the need for OPENAI_API_KEY and justifying the hard assertions.

## Status Update — Day 37 @freemo — **PR body has been populated** with a full description per CONTRIBUTING.md, including: - WF13 test case summary (5 tests) - Changes to existing files table with rationale - Explanation of OpenAI key skip guard removal - Quality gates table The self-review P1 fixes (type:ignore removal, reset_global_state, WF13 test name prefix) were addressed in earlier commits. Master has been merged in. Branch is up to date and ready for review. Regarding the scientific paper assertion changes: the tests now use `CLEVERAGENTS_TESTING_USE_MOCK_AI=true` which provides deterministic mocked LLM responses, eliminating the need for `OPENAI_API_KEY` and justifying the hard assertions.
fix(robot): remove tdd_expected_fail tag from wf13 — tests now pass
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 15s
CI / lint (pull_request) Successful in 18s
CI / quality (pull_request) Successful in 28s
CI / typecheck (pull_request) Successful in 43s
CI / security (pull_request) Successful in 49s
CI / e2e_tests (pull_request) Successful in 1m37s
CI / unit_tests (pull_request) Successful in 3m10s
CI / docker (pull_request) Successful in 9s
CI / integration_tests (pull_request) Successful in 3m33s
CI / coverage (pull_request) Successful in 5m49s
CI / benchmark-regression (pull_request) Has been cancelled
d442267681
The tdd_expected_fail_listener correctly detects that all 5 WF13 test
cases pass, meaning the underlying feature is implemented.  Remove the
tdd_expected_fail tag so CI treats passes as passes.  Retains tdd_issue
and tdd_issue_777 tags as permanent regression references per
CONTRIBUTING.md.

ISSUES CLOSED: #777
Merge branch 'master' into test/int-wf13-custom-profile
All checks were successful
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 16s
CI / lint (pull_request) Successful in 19s
CI / quality (pull_request) Successful in 35s
CI / typecheck (pull_request) Successful in 1m2s
CI / security (pull_request) Successful in 1m4s
CI / e2e_tests (pull_request) Successful in 1m33s
CI / unit_tests (pull_request) Successful in 3m8s
CI / integration_tests (pull_request) Successful in 3m42s
CI / docker (pull_request) Successful in 57s
CI / coverage (pull_request) Successful in 6m0s
CI / benchmark-regression (pull_request) Successful in 37m10s
03e0ab6d98
Merge branch 'master' into test/int-wf13-custom-profile
All checks were successful
CI / lint (pull_request) Successful in 20s
CI / quality (pull_request) Successful in 27s
CI / typecheck (pull_request) Successful in 42s
CI / security (pull_request) Successful in 59s
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 18s
CI / unit_tests (pull_request) Successful in 3m6s
CI / e2e_tests (pull_request) Successful in 3m45s
CI / integration_tests (pull_request) Successful in 3m51s
CI / docker (pull_request) Successful in 1m21s
CI / coverage (pull_request) Successful in 7m3s
CI / benchmark-regression (pull_request) Successful in 38m11s
722eef3cee
brent.edwards force-pushed test/int-wf13-custom-profile from 722eef3cee
All checks were successful
CI / lint (pull_request) Successful in 20s
CI / quality (pull_request) Successful in 27s
CI / typecheck (pull_request) Successful in 42s
CI / security (pull_request) Successful in 59s
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 18s
CI / unit_tests (pull_request) Successful in 3m6s
CI / e2e_tests (pull_request) Successful in 3m45s
CI / integration_tests (pull_request) Successful in 3m51s
CI / docker (pull_request) Successful in 1m21s
CI / coverage (pull_request) Successful in 7m3s
CI / benchmark-regression (pull_request) Successful in 38m11s
to 9725900217
Some checks failed
CI / lint (pull_request) Successful in 16s
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 19s
CI / quality (pull_request) Successful in 38s
CI / typecheck (pull_request) Successful in 56s
CI / security (pull_request) Successful in 1m7s
CI / e2e_tests (pull_request) Successful in 4m31s
CI / unit_tests (pull_request) Successful in 5m45s
CI / integration_tests (pull_request) Successful in 6m31s
CI / docker (pull_request) Successful in 56s
CI / coverage (pull_request) Successful in 7m40s
CI / benchmark-regression (pull_request) Failing after 21m46s
2026-03-19 00:03:23 +00:00
Compare
Author
Member

Self-Review P1 Fixes Applied — Commit 97259002

Branch squash-rebased onto origin/master. All merge commits eliminated. Self-review findings verified/fixed:

Finding Status
P1: # type: ignore[assignment] in _rejoin() Already clean — separate type annotations
P1: Missing reset_global_state() Already present in dispatcher
P1: Missing WF13 prefix on test names Already prefixed
Force Tags missing custom-profile Fixed
timeout=120s on_timeout=kill Fixed — all 5 Run Process calls
CHANGELOG entry for #777 Fixed
  • nox -s lintPASS
  • nox -s typecheckPASS (0 errors)
  • Helper: 482 lines (under 500)
## Self-Review P1 Fixes Applied — Commit `97259002` Branch squash-rebased onto `origin/master`. All merge commits eliminated. Self-review findings verified/fixed: | Finding | Status | |---------|--------| | **P1**: `# type: ignore[assignment]` in `_rejoin()` | Already clean — separate type annotations | | **P1**: Missing `reset_global_state()` | Already present in dispatcher | | **P1**: Missing WF13 prefix on test names | Already prefixed | | Force Tags missing `custom-profile` | **Fixed** | | `timeout=120s on_timeout=kill` | **Fixed** — all 5 Run Process calls | | CHANGELOG entry for #777 | **Fixed** | - `nox -s lint` — **PASS** - `nox -s typecheck` — **PASS** (0 errors) - Helper: 482 lines (under 500)
freemo approved these changes 2026-03-19 04:57:33 +00:00
Dismissed
freemo left a comment

Code Review — PR #949

Well-structured integration test for WF13. Proper labels, milestone, and issue linkage. Approved.

## Code Review — PR #949 Well-structured integration test for WF13. Proper labels, milestone, and issue linkage. **Approved.**
brent.edwards force-pushed test/int-wf13-custom-profile from 9725900217
Some checks failed
CI / lint (pull_request) Successful in 16s
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 19s
CI / quality (pull_request) Successful in 38s
CI / typecheck (pull_request) Successful in 56s
CI / security (pull_request) Successful in 1m7s
CI / e2e_tests (pull_request) Successful in 4m31s
CI / unit_tests (pull_request) Successful in 5m45s
CI / integration_tests (pull_request) Successful in 6m31s
CI / docker (pull_request) Successful in 56s
CI / coverage (pull_request) Successful in 7m40s
CI / benchmark-regression (pull_request) Failing after 21m46s
to ecc3d8ecd0
Some checks failed
CI / lint (pull_request) Successful in 27s
CI / benchmark-publish (pull_request) Has been skipped
CI / quality (pull_request) Successful in 31s
CI / build (pull_request) Successful in 19s
CI / typecheck (pull_request) Successful in 50s
CI / security (pull_request) Successful in 55s
CI / e2e_tests (pull_request) Successful in 5m35s
CI / coverage (pull_request) Successful in 9m56s
CI / integration_tests (pull_request) Failing after 15m29s
CI / unit_tests (pull_request) Failing after 15m29s
CI / benchmark-regression (pull_request) Successful in 39m26s
CI / docker (pull_request) Has been skipped
2026-03-20 00:01:50 +00:00
Compare
brent.edwards dismissed freemo's review 2026-03-20 00:01:51 +00:00
Reason:

New commits pushed, approval review dismissed automatically according to repository settings

Author
Member

Rebased onto origin/master (79b0a2c5). CHANGELOG conflict resolved (kept master, re-added PR entry). nox -s lint PASS, nox -s typecheck PASS (0 errors). Commit ecc3d8ec.

Rebased onto `origin/master` (`79b0a2c5`). CHANGELOG conflict resolved (kept master, re-added PR entry). `nox -s lint` PASS, `nox -s typecheck` PASS (0 errors). Commit `ecc3d8ec`.
Merge remote-tracking branch 'origin/master' into test/int-wf13-custom-profile
All checks were successful
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 31s
CI / lint (pull_request) Successful in 4m33s
CI / quality (pull_request) Successful in 4m49s
CI / typecheck (pull_request) Successful in 5m22s
CI / security (pull_request) Successful in 5m35s
CI / unit_tests (pull_request) Successful in 8m7s
CI / docker (pull_request) Successful in 1m6s
CI / integration_tests (pull_request) Successful in 10m52s
CI / e2e_tests (pull_request) Successful in 11m46s
CI / coverage (pull_request) Successful in 11m2s
CI / status-check (pull_request) Successful in 1s
CI / benchmark-regression (pull_request) Successful in 1h6m2s
8e20a34198
# Conflicts:
#	CHANGELOG.md
Merge remote-tracking branch 'origin/master' into test/int-wf13-custom-profile
All checks were successful
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 16s
CI / lint (pull_request) Successful in 3m34s
CI / quality (pull_request) Successful in 3m50s
CI / security (pull_request) Successful in 4m7s
CI / typecheck (pull_request) Successful in 4m10s
CI / unit_tests (pull_request) Successful in 7m54s
CI / integration_tests (pull_request) Successful in 7m48s
CI / e2e_tests (pull_request) Successful in 9m29s
CI / docker (pull_request) Successful in 1m14s
CI / coverage (pull_request) Successful in 12m0s
CI / status-check (pull_request) Successful in 2s
CI / benchmark-regression (pull_request) Successful in 58m3s
3160e6f183
# Conflicts:
#	CHANGELOG.md
Merge remote-tracking branch 'origin/master' into test/int-wf13-custom-profile
Some checks failed
CI / build (pull_request) Successful in 30s
CI / lint (pull_request) Successful in 5m1s
CI / quality (pull_request) Successful in 5m34s
CI / typecheck (pull_request) Successful in 5m34s
CI / security (pull_request) Successful in 5m51s
CI / integration_tests (pull_request) Successful in 11m1s
CI / unit_tests (pull_request) Successful in 12m8s
CI / docker (pull_request) Successful in 1m6s
CI / e2e_tests (pull_request) Successful in 13m19s
CI / coverage (pull_request) Successful in 11m27s
CI / status-check (pull_request) Successful in 1s
CI / benchmark-publish (pull_request) Has been skipped
CI / benchmark-regression (pull_request) Failing after 26m53s
2d85b950c1
# Conflicts:
#	CHANGELOG.md
brent.edwards force-pushed test/int-wf13-custom-profile from 2d85b950c1
Some checks failed
CI / build (pull_request) Successful in 30s
CI / lint (pull_request) Successful in 5m1s
CI / quality (pull_request) Successful in 5m34s
CI / typecheck (pull_request) Successful in 5m34s
CI / security (pull_request) Successful in 5m51s
CI / integration_tests (pull_request) Successful in 11m1s
CI / unit_tests (pull_request) Successful in 12m8s
CI / docker (pull_request) Successful in 1m6s
CI / e2e_tests (pull_request) Successful in 13m19s
CI / coverage (pull_request) Successful in 11m27s
CI / status-check (pull_request) Successful in 1s
CI / benchmark-publish (pull_request) Has been skipped
CI / benchmark-regression (pull_request) Failing after 26m53s
to 0cbf049c93
All checks were successful
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 33s
CI / lint (pull_request) Successful in 5m37s
CI / typecheck (pull_request) Successful in 5m40s
CI / security (pull_request) Successful in 8m30s
CI / quality (pull_request) Successful in 8m38s
CI / unit_tests (pull_request) Successful in 11m23s
CI / docker (pull_request) Successful in 1m13s
CI / integration_tests (pull_request) Successful in 13m11s
CI / e2e_tests (pull_request) Successful in 15m38s
CI / coverage (pull_request) Successful in 11m28s
CI / status-check (pull_request) Successful in 2s
CI / benchmark-regression (pull_request) Successful in 1h7m43s
2026-03-26 20:03:02 +00:00
Compare
freemo self-assigned this 2026-04-02 06:15:21 +00:00
Owner

🤖 Backlog Groomer (groomer-1): Closing as duplicate of #777.

Issue #777 (test(integration): workflow example 13 — custom automation profile with semantic context) is the canonical version with full labels (MoSCoW/Must have, Priority/Medium, State/In Review, Type/Testing) and milestone v3.2.0. This issue is an exact title duplicate.

🤖 **Backlog Groomer (groomer-1):** Closing as duplicate of #777. Issue #777 (`test(integration): workflow example 13 — custom automation profile with semantic context`) is the canonical version with full labels (`MoSCoW/Must have`, `Priority/Medium`, `State/In Review`, `Type/Testing`) and milestone `v3.2.0`. This issue is an exact title duplicate.
freemo closed this pull request 2026-04-02 17:32:00 +00:00
All checks were successful
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 33s
Required
Details
CI / lint (pull_request) Successful in 5m37s
Required
Details
CI / typecheck (pull_request) Successful in 5m40s
Required
Details
CI / security (pull_request) Successful in 8m30s
Required
Details
CI / quality (pull_request) Successful in 8m38s
Required
Details
CI / unit_tests (pull_request) Successful in 11m23s
Required
Details
CI / docker (pull_request) Successful in 1m13s
Required
Details
CI / integration_tests (pull_request) Successful in 13m11s
Required
Details
CI / e2e_tests (pull_request) Successful in 15m38s
CI / coverage (pull_request) Successful in 11m28s
Required
Details
CI / status-check (pull_request) Successful in 2s
CI / benchmark-regression (pull_request) Successful in 1h7m43s

Pull request closed

Sign in to join this conversation.
No milestone
No project
No assignees
2 participants
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Reference
cleveragents/cleveragents-core!949
No description provided.