feat(plan): implement LLM-powered Strategy Actor (#828) #1175

M1: except (json.JSONDecodeError, Exception): is redundant — Exception is a superclass of json.JSONDecodeError. Either narrow to specific exceptions or simplify to except Exception: if the broad catch is intentional.

**M1**: `except (json.JSONDecodeError, Exception):` is redundant — `Exception` is a superclass of `json.JSONDecodeError`. Either narrow to specific exceptions or simplify to `except Exception:` if the broad catch is intentional.

src/cleveragents/application/services/strategy_actor.py Outdated

						
				@@ -0,0 +620,4 @@

				                    time.sleep(delay)

				        # All retries exhausted — re-raise last exception

				        raise last_exc  # type: ignore[misc]

freemo commented

B1 [BLOCKER]: # type: ignore[misc] is strictly forbidden per CONTRIBUTING.md. Fix by adding an assertion before the raise:

assert last_exc is not None  # guaranteed: loop runs at least once
raise last_exc

This gives Pyright the type narrowing it needs without suppressing the check.

**B1 [BLOCKER]**: `# type: ignore[misc]` is strictly forbidden per CONTRIBUTING.md. Fix by adding an assertion before the raise: ```python assert last_exc is not None # guaranteed: loop runs at least once raise last_exc ``` This gives Pyright the type narrowing it needs without suppressing the check.

src/cleveragents/application/services/strategy_actor.py Outdated

						
				@@ -0,0 +635,4 @@

				            raw_content = getattr(response, "text", None)

				        if raw_content is None:

				            raw_content = str(response)

				        if isinstance(raw_content, list):

freemo commented

M3: This bare except Exception: catches programming errors (TypeError, AttributeError, NameError) alongside legitimate ACMS failures. Consider narrowing to (RuntimeError, ConnectionError, TimeoutError, ValueError, OSError).

**M3**: This bare `except Exception:` catches programming errors (TypeError, AttributeError, NameError) alongside legitimate ACMS failures. Consider narrowing to `(RuntimeError, ConnectionError, TimeoutError, ValueError, OSError)`.

src/cleveragents/cli/commands/plan.py Outdated

						
				@@ -1297,0 +1307,4 @@

				        config_service = container.config_service()

				        resolved = config_service.resolve("actor.default.strategy")

				        config_value = resolved.value

				    except Exception:

freemo commented

2026-04-02 18:18:43 +00:00

H2: Bare except Exception: pass silently swallows ALL exceptions including programming errors. Per CONTRIBUTING.md fail-fast principles, narrow to the specific exceptions config_service.resolve() can raise (e.g., (KeyError, ValueError, RuntimeError)).

**H2**: Bare `except Exception: pass` silently swallows ALL exceptions including programming errors. Per CONTRIBUTING.md fail-fast principles, narrow to the specific exceptions `config_service.resolve()` can raise (e.g., `(KeyError, ValueError, RuntimeError)`).

freemo referenced this pull request

feat(plan): implement LLM-powered Strategy Actor #828

freemo commented

2026-04-02 19:01:58 +00:00

Review claimed by reviewer pool instance pr-reviewer-pool-2988182-1775156309. Dispatching independent code review.

Automated by CleverAgents Bot
Supervisor: PR Review | Agent: ca-continuous-pr-reviewer

Review claimed by reviewer pool instance pr-reviewer-pool-2988182-1775156309. Dispatching independent code review. --- **Automated by CleverAgents Bot** Supervisor: PR Review | Agent: ca-continuous-pr-reviewer

freemo requested changes 2026-04-02 19:05:50 +00:00

Dismissed

freemo left a comment

Independent Code Review — PR #1175: feat(plan): implement LLM-powered Strategy Actor (#828)

Reviewer: ca-pr-self-reviewer (independent perspective)
Commit: ad554e3b
Branch: feature/strategy-actor-llm → master
Spec Reference: §Strategize Phase, §Decision Record Structure, §Prompt Injection Mitigation

Overall Assessment

This is a substantial, well-architected feature that introduces LLM-powered strategy generation with hierarchical action trees, dependency graph validation (Kahn's algorithm), prompt injection hardening, and graceful fallback. The code is cleanly decomposed across four modules (strategy_actor.py, strategy_models.py, strategy_parsing.py, strategy_prompt.py), and the test coverage is extensive (107 BDD scenarios + 7 Robot tests).

Many issues from the earlier review rounds (H1 response extraction, H2 invariants in prompt, H3/H4 narrowed exceptions, L2 decoupled stub) have been addressed. However, several blocking issues remain that prevent merge.

Blocking Issues (Must Fix Before Merge)

B1. `# type: ignore[misc]` suppression — FORBIDDEN

File: src/cleveragents/application/services/strategy_actor.py:623

raise last_exc  # type: ignore[misc]

CONTRIBUTING.md §Static Typing: "The use of # type: ignore or any other mechanism to suppress or disable type checking is strictly forbidden." This is a hard rule with no exceptions.

Fix: Refactor _invoke_llm_with_retry to avoid the need. For example:

last_exc: Exception = RuntimeError("LLM invocation failed")
# ... in the loop, assign last_exc = exc ...
raise last_exc

Or restructure to raise inside the loop's final iteration.

B2. Merge Conflicts

The PR shows mergeable: false. The branch must be rebased onto current master before merge is possible.

B3. File Size Violations — 3 files exceed 500-line limit

CONTRIBUTING.md requires files to be under 500 lines:

File	Lines	Over by
`features/steps/strategy_actor_llm_steps.py`	2,084	4.2×
`features/strategy_actor_llm.feature`	750	1.5×
`src/cleveragents/application/services/strategy_actor.py`	830	1.66×

Fix for strategy_actor.py: The module already delegates to strategy_models.py, strategy_parsing.py, and strategy_prompt.py. Consider extracting validate_no_cycles, _parse_actor_name, and resolve_strategy_actor into a separate strategy_resolution.py module, and/or moving build_decisions into its own module.

Fix for steps/feature files: Split the feature file into logical groups (e.g., strategy_actor_init.feature, strategy_actor_llm.feature, strategy_actor_parsing.feature, strategy_actor_decisions.feature) with corresponding step files.

B4. Empty PR Body — Missing Description and Closing Keyword

The PR description is empty. CONTRIBUTING.md requires:

A detailed summary of changes
A closing keyword linking to the issue: Closes #828
The PR must be marked as blocking issue #828

B5. Redundant Exception Catch in `_build_decisions`

File: src/cleveragents/application/services/plan_executor.py (new code in _build_decisions)

except (json.JSONDecodeError, Exception):

Exception is a superclass of json.JSONDecodeError, making the latter redundant. More importantly, catching bare Exception here violates the fail-fast principle. If a TypeError or AttributeError occurs during deserialization, it indicates a programming error that should propagate.

Fix: Narrow to except (json.JSONDecodeError, ValidationError, KeyError): or similar specific exceptions.

Significant Issues (Should Fix)

S1. Tests Call Private Method `_execute_with_llm` (7 occurrences)

File: features/steps/strategy_actor_llm_steps.py lines 621, 627, 889, 979, 1095, 1299, 1766

This was flagged as H5 in the original review and acknowledged by @freemo as requiring action. The test creates coupling to implementation details and produces inconsistent state (two different StrategyTree instances with different ULIDs from the same logical execution).

Fix: Expose tree inspection through the public result object, or capture via mock interception on _build_tree.

S2. Broad `except Exception: pass` in Config Resolution

File: src/cleveragents/cli/commands/plan.py (new code around line 1310)

try:
    config_service = container.config_service()
    resolved = config_service.resolve("actor.default.strategy")
    config_value = resolved.value
except Exception:
    pass  # Config unavailable — proceed with default resolution

This silently swallows all exceptions including programming errors. Should be narrowed to expected exceptions (e.g., KeyError, AttributeError, RuntimeError).

S3. Broad `except Exception:` in ACMS Context Retrieval

File: src/cleveragents/application/services/strategy_actor.py (~line 640)
The comment justifies this as "ACMS failures are explicitly non-fatal" with recovery logic (proceed without context). While the recovery is meaningful, the catch is still overly broad. Consider narrowing to (RuntimeError, ConnectionError, TimeoutError, ValueError, OSError).

Positive Observations

Clean module decomposition: Models, parsing, prompt, and actor logic are well-separated.
Invariants now flow to LLM prompt: The <constraints> section in build_strategy_prompt addresses the earlier H2 finding.
Robust response extraction: _extract_content handles .content, .text, list content, and str() fallback.
Input size guards: _MAX_ACTIONS, _MAX_DOD_CHARS, _MAX_RESOURCES, _MAX_CONTEXT_CHARS, _MAX_INVARIANTS prevent token limit overflows.
Prompt injection hardening: XML-delimited sections with entity escaping.
Hierarchical tree construction: _build_tree infers parent_id from dependency graph.
Retry with exponential backoff: _invoke_llm_with_retry handles transient LLM failures.
ULID validation on plan_id: Proper argument validation in execute() and build_decisions().
Comprehensive test coverage: 107 BDD scenarios covering edge cases (NaN risk, cyclic deps, self-deps, duplicate steps, etc.).

Summary

Category	Count	Items
Blocking	5	B1 type:ignore, B2 conflicts, B3 file sizes, B4 empty body, B5 broad catch
Significant	3	S1 private method in tests, S2 broad catch in plan.py, S3 broad ACMS catch
Positive	9	Clean architecture, invariant flow, robust parsing, security hardening

The implementation is architecturally sound and addresses the spec requirements well. The blocking issues are primarily process/standards violations (type:ignore, file sizes, PR metadata) rather than fundamental design problems. Once these are addressed, this PR should be ready for merge.

Automated by CleverAgents Bot
Supervisor: PR Review | Agent: ca-pr-self-reviewer

# Independent Code Review — PR #1175: feat(plan): implement LLM-powered Strategy Actor (#828) **Reviewer**: ca-pr-self-reviewer (independent perspective) **Commit**: `ad554e3b` **Branch**: `feature/strategy-actor-llm` → `master` **Spec Reference**: §Strategize Phase, §Decision Record Structure, §Prompt Injection Mitigation --- ## Overall Assessment This is a substantial, well-architected feature that introduces LLM-powered strategy generation with hierarchical action trees, dependency graph validation (Kahn's algorithm), prompt injection hardening, and graceful fallback. The code is cleanly decomposed across four modules (`strategy_actor.py`, `strategy_models.py`, `strategy_parsing.py`, `strategy_prompt.py`), and the test coverage is extensive (107 BDD scenarios + 7 Robot tests). Many issues from the earlier review rounds (H1 response extraction, H2 invariants in prompt, H3/H4 narrowed exceptions, L2 decoupled stub) have been addressed. However, several **blocking issues** remain that prevent merge. --- ## Blocking Issues (Must Fix Before Merge) ### B1. `# type: ignore[misc]` suppression — FORBIDDEN **File**: `src/cleveragents/application/services/strategy_actor.py:623` ```python raise last_exc # type: ignore[misc] ``` CONTRIBUTING.md §Static Typing: *"The use of `# type: ignore` or any other mechanism to suppress or disable type checking is strictly forbidden."* This is a hard rule with no exceptions. **Fix**: Refactor `_invoke_llm_with_retry` to avoid the need. For example: ```python last_exc: Exception = RuntimeError("LLM invocation failed") # ... in the loop, assign last_exc = exc ... raise last_exc ``` Or restructure to raise inside the loop's final iteration. ### B2. Merge Conflicts The PR shows `mergeable: false`. The branch must be rebased onto current `master` before merge is possible. ### B3. File Size Violations — 3 files exceed 500-line limit CONTRIBUTING.md requires files to be under 500 lines: | File | Lines | Over by | |------|-------|---------| | `features/steps/strategy_actor_llm_steps.py` | 2,084 | 4.2× | | `features/strategy_actor_llm.feature` | 750 | 1.5× | | `src/cleveragents/application/services/strategy_actor.py` | 830 | 1.66× | **Fix for `strategy_actor.py`**: The module already delegates to `strategy_models.py`, `strategy_parsing.py`, and `strategy_prompt.py`. Consider extracting `validate_no_cycles`, `_parse_actor_name`, and `resolve_strategy_actor` into a separate `strategy_resolution.py` module, and/or moving `build_decisions` into its own module. **Fix for steps/feature files**: Split the feature file into logical groups (e.g., `strategy_actor_init.feature`, `strategy_actor_llm.feature`, `strategy_actor_parsing.feature`, `strategy_actor_decisions.feature`) with corresponding step files. ### B4. Empty PR Body — Missing Description and Closing Keyword The PR description is empty. CONTRIBUTING.md requires: - A detailed summary of changes - A closing keyword linking to the issue: `Closes #828` - The PR must be marked as blocking issue #828 ### B5. Redundant Exception Catch in `_build_decisions` **File**: `src/cleveragents/application/services/plan_executor.py` (new code in `_build_decisions`) ```python except (json.JSONDecodeError, Exception): ``` `Exception` is a superclass of `json.JSONDecodeError`, making the latter redundant. More importantly, catching bare `Exception` here violates the fail-fast principle. If a `TypeError` or `AttributeError` occurs during deserialization, it indicates a programming error that should propagate. **Fix**: Narrow to `except (json.JSONDecodeError, ValidationError, KeyError):` or similar specific exceptions. --- ## Significant Issues (Should Fix) ### S1. Tests Call Private Method `_execute_with_llm` (7 occurrences) **File**: `features/steps/strategy_actor_llm_steps.py` lines 621, 627, 889, 979, 1095, 1299, 1766 This was flagged as H5 in the original review and acknowledged by @freemo as requiring action. The test creates coupling to implementation details and produces inconsistent state (two different `StrategyTree` instances with different ULIDs from the same logical execution). **Fix**: Expose tree inspection through the public result object, or capture via mock interception on `_build_tree`. ### S2. Broad `except Exception: pass` in Config Resolution **File**: `src/cleveragents/cli/commands/plan.py` (new code around line 1310) ```python try: config_service = container.config_service() resolved = config_service.resolve("actor.default.strategy") config_value = resolved.value except Exception: pass # Config unavailable — proceed with default resolution ``` This silently swallows all exceptions including programming errors. Should be narrowed to expected exceptions (e.g., `KeyError`, `AttributeError`, `RuntimeError`). ### S3. Broad `except Exception:` in ACMS Context Retrieval **File**: `src/cleveragents/application/services/strategy_actor.py` (~line 640) The comment justifies this as "ACMS failures are explicitly non-fatal" with recovery logic (proceed without context). While the recovery is meaningful, the catch is still overly broad. Consider narrowing to `(RuntimeError, ConnectionError, TimeoutError, ValueError, OSError)`. --- ## Positive Observations 1. **Clean module decomposition**: Models, parsing, prompt, and actor logic are well-separated. 2. **Invariants now flow to LLM prompt**: The `<constraints>` section in `build_strategy_prompt` addresses the earlier H2 finding. 3. **Robust response extraction**: `_extract_content` handles `.content`, `.text`, list content, and `str()` fallback. 4. **Input size guards**: `_MAX_ACTIONS`, `_MAX_DOD_CHARS`, `_MAX_RESOURCES`, `_MAX_CONTEXT_CHARS`, `_MAX_INVARIANTS` prevent token limit overflows. 5. **Prompt injection hardening**: XML-delimited sections with entity escaping. 6. **Hierarchical tree construction**: `_build_tree` infers `parent_id` from dependency graph. 7. **Retry with exponential backoff**: `_invoke_llm_with_retry` handles transient LLM failures. 8. **ULID validation on `plan_id`**: Proper argument validation in `execute()` and `build_decisions()`. 9. **Comprehensive test coverage**: 107 BDD scenarios covering edge cases (NaN risk, cyclic deps, self-deps, duplicate steps, etc.). --- ## Summary | Category | Count | Items | |----------|-------|-------| | **Blocking** | 5 | B1 type:ignore, B2 conflicts, B3 file sizes, B4 empty body, B5 broad catch | | **Significant** | 3 | S1 private method in tests, S2 broad catch in plan.py, S3 broad ACMS catch | | **Positive** | 9 | Clean architecture, invariant flow, robust parsing, security hardening | The implementation is architecturally sound and addresses the spec requirements well. The blocking issues are primarily process/standards violations (type:ignore, file sizes, PR metadata) rather than fundamental design problems. Once these are addressed, this PR should be ready for merge. --- **Automated by CleverAgents Bot** Supervisor: PR Review | Agent: ca-pr-self-reviewer

features/steps/strategy_actor_llm_steps.py Outdated

						
				@@ -0,0 +624,4 @@

				        definition_of_done="Build a REST API with authentication",

				    )

				    # Re-execute to capture the tree directly for inspection

				    context.sa_tree = context.strategy_actor._execute_with_llm(

freemo commented

S1 — Tests call private method _execute_with_llm: This creates coupling to implementation details and produces inconsistent state (two different StrategyTree instances with different ULIDs). This was flagged as H5 in the original review and acknowledged as needing a fix.

Fix: Expose tree inspection through the public result object, or capture via mock interception on _build_tree.

Automated by CleverAgents Bot
Supervisor: PR Review | Agent: ca-pr-self-reviewer

**S1 — Tests call private method `_execute_with_llm`**: This creates coupling to implementation details and produces inconsistent state (two different `StrategyTree` instances with different ULIDs). This was flagged as H5 in the original review and acknowledged as needing a fix. **Fix**: Expose tree inspection through the public result object, or capture via mock interception on `_build_tree`. --- **Automated by CleverAgents Bot** Supervisor: PR Review | Agent: ca-pr-self-reviewer

src/cleveragents/application/services/plan_executor.py Outdated

freemo commented

B5 — Redundant/overly broad exception catch: except (json.JSONDecodeError, Exception): — Exception is a superclass of JSONDecodeError, making the first type redundant. More importantly, catching bare Exception here swallows programming errors (TypeError, AttributeError) that should propagate per fail-fast principles.

Fix: Narrow to except (json.JSONDecodeError, KeyError, ValueError): or similar specific exceptions that deserialization can actually raise.

Automated by CleverAgents Bot
Supervisor: PR Review | Agent: ca-pr-self-reviewer

**B5 — Redundant/overly broad exception catch**: `except (json.JSONDecodeError, Exception):` — `Exception` is a superclass of `JSONDecodeError`, making the first type redundant. More importantly, catching bare `Exception` here swallows programming errors (`TypeError`, `AttributeError`) that should propagate per fail-fast principles. **Fix**: Narrow to `except (json.JSONDecodeError, KeyError, ValueError):` or similar specific exceptions that deserialization can actually raise. --- **Automated by CleverAgents Bot** Supervisor: PR Review | Agent: ca-pr-self-reviewer

src/cleveragents/application/services/strategy_actor.py Outdated

						
				@@ -0,0 +620,4 @@

				                    time.sleep(delay)

				        # All retries exhausted — re-raise last exception

				        raise last_exc  # type: ignore[misc]

freemo commented

B1 — FORBIDDEN: # type: ignore[misc] suppression.

CONTRIBUTING.md §Static Typing strictly forbids any # type: ignore usage. Refactor to avoid the need — e.g., initialize last_exc: Exception = RuntimeError("LLM invocation failed") before the loop so the type checker knows it's always Exception at this point, or restructure to raise inside the loop's final iteration.

Automated by CleverAgents Bot
Supervisor: PR Review | Agent: ca-pr-self-reviewer

**B1 — FORBIDDEN**: `# type: ignore[misc]` suppression. CONTRIBUTING.md §Static Typing strictly forbids any `# type: ignore` usage. Refactor to avoid the need — e.g., initialize `last_exc: Exception = RuntimeError("LLM invocation failed")` before the loop so the type checker knows it's always `Exception` at this point, or restructure to raise inside the loop's final iteration. --- **Automated by CleverAgents Bot** Supervisor: PR Review | Agent: ca-pr-self-reviewer

src/cleveragents/cli/commands/plan.py Outdated

freemo commented

2026-04-02 19:05:59 +00:00

S2 — Broad except Exception: pass: This silently swallows all exceptions including programming errors (TypeError, NameError, AttributeError). Should be narrowed to expected exceptions from the config service (e.g., KeyError, AttributeError, RuntimeError).

Automated by CleverAgents Bot
Supervisor: PR Review | Agent: ca-pr-self-reviewer

**S2 — Broad `except Exception: pass`**: This silently swallows all exceptions including programming errors (`TypeError`, `NameError`, `AttributeError`). Should be narrowed to expected exceptions from the config service (e.g., `KeyError`, `AttributeError`, `RuntimeError`). --- **Automated by CleverAgents Bot** Supervisor: PR Review | Agent: ca-pr-self-reviewer

freemo referenced this pull request

feat(plan): implement LLM-powered Strategy Actor #828

freemo added the

labels 2026-04-02 19:30:26 +00:00

freemo referenced this pull request

2026-04-02 19:32:08 +00:00

[Automated] Backlog Groomer Session State — v3.7.0 #1463

freemo commented

2026-04-02 19:33:43 +00:00

Review claimed by reviewer pool instance pr-reviewer-pool-3151342-1775157992. Dispatching independent code review.

Automated by CleverAgents Bot
Supervisor: PR Review | Agent: ca-continuous-pr-reviewer

Review claimed by reviewer pool instance pr-reviewer-pool-3151342-1775157992. Dispatching independent code review. --- **Automated by CleverAgents Bot** Supervisor: PR Review | Agent: ca-continuous-pr-reviewer

freemo commented

2026-04-02 19:46:52 +00:00

Review claimed by reviewer pool instance pr-reviewer-pool-3151342-1775157992. Dispatching independent code review.

Automated by CleverAgents Bot
Supervisor: PR Review | Agent: ca-continuous-pr-reviewer

Review claimed by reviewer pool instance pr-reviewer-pool-3151342-1775157992. Dispatching independent code review. --- **Automated by CleverAgents Bot** Supervisor: PR Review | Agent: ca-continuous-pr-reviewer

freemo commented

2026-04-02 23:04:04 +00:00

Review claimed by reviewer pool instance pr-reviewer-pool-3983434-1775170710. Dispatching independent code review.

Automated by CleverAgents Bot
Supervisor: PR Review | Agent: ca-continuous-pr-reviewer

Review claimed by reviewer pool instance pr-reviewer-pool-3983434-1775170710. Dispatching independent code review. --- **Automated by CleverAgents Bot** Supervisor: PR Review | Agent: ca-continuous-pr-reviewer

freemo requested changes 2026-04-03 01:22:24 +00:00

Dismissed

freemo left a comment

Independent Code Review — PR #1175: feat(plan): implement LLM-powered Strategy Actor (#828)

Reviewer: ca-pr-self-reviewer (independent perspective)
Branch: feature/strategy-actor-llm
Head commit: ad554e3b
Spec references: §Strategize Phase, §Decision Record Structure, §Prompt Injection Mitigation

Summary

This PR implements the LLM-powered Strategy Actor for the plan strategize phase, replacing the StrategizeStubActor with a full LLM-backed implementation that produces hierarchical action trees with dependencies, resource requirements, complexity estimates, and risk scores. The implementation is well-structured with good modular decomposition (separate files for models, parsing, prompt construction) and includes 37 Behave BDD scenarios + 7 Robot Framework integration tests.

I reviewed the full diff (14 files, +5167/-12 lines), the linked issue #828, the specification, CONTRIBUTING.md, and the previous review findings. Several of the original HIGH findings (H1, H2, H3) have been addressed. However, three blocking issues prevent merge.

BLOCKING Issues

B1: Merge Conflicts (`mergeable: false`)

The PR currently has merge conflicts with master. Forgejo reports mergeable: false. The branch must be rebased onto current master before merge is possible.

B2: File Size Violations (CONTRIBUTING.md)

CONTRIBUTING.md mandates files must be under 500 lines. Three files exceed this limit:

File	Lines	Over limit
`features/steps/strategy_actor_llm_steps.py`	2,084	4.2×
`src/cleveragents/application/services/strategy_actor.py`	830	1.66×
`features/strategy_actor_llm.feature`	750	1.5×

Required action: Split these files. For example:

strategy_actor.py → extract _build_tree, _tree_to_decisions, _build_invariant_records, and validate_no_cycles into a separate strategy_tree_builder.py
strategy_actor_llm_steps.py → split into multiple step files by concern (e.g., strategy_actor_stub_steps.py, strategy_actor_llm_steps.py, strategy_parsing_steps.py, strategy_prompt_steps.py)
strategy_actor_llm.feature → split into multiple feature files by concern

B3: Empty PR Body

The PR description/body is empty. CONTRIBUTING.md requires: "Pull Requests must have a detailed description that explains the purpose and context of the change." The PR must include a description explaining the change, linking to issue #828, and summarizing the implementation approach.

HIGH Issues

H1: Tests Still Call Private `_execute_with_llm` Directly (6+ places)

This was identified as H5 in the previous review and acknowledged by the maintainer as requiring a fix. The test file still calls context.strategy_actor._execute_with_llm(...) directly in at least 6 places (lines 621, 627, 889, 979, 1095, 1299, 1766). This:

Creates coupling to implementation details (fragile tests)
Produces inconsistent state (double execution with different ULIDs)
Tests a private API that may change without notice

Required action: Test through the public execute() interface. If tree inspection is needed, either expose it through the result object or capture it via mock interception.

H2: Redundant Exception Catch in `plan_executor.py`

In _build_decisions():

except (json.JSONDecodeError, Exception):

Exception is a superclass of json.JSONDecodeError, making the tuple redundant. This should either be except Exception: (if truly broad catch is intended) or narrowed to specific exceptions like except (json.JSONDecodeError, PydanticValidationError):.

MEDIUM Issues

M1: ACMS `except Exception:` Still Broad

Line 557 in strategy_actor.py still uses bare except Exception: for ACMS context retrieval. While the comment documents this as intentional (ACMS failures are non-fatal), it would be better to catch specific exception types that ACMS is known to raise, plus a documented broad catch for truly unexpected errors. At minimum, KeyboardInterrupt and SystemExit should not be caught (though they're BaseException subclasses, the pattern sets a bad precedent).

M2: Cross-Class Private Method Access

_build_decisions() in plan_executor.py calls StrategizeStubActor._parse_steps() — a private method of another class. This creates tight coupling. Consider extracting _parse_steps to a shared utility function or making it a public method.

M3: `resolve_strategy_actor` Silent Degradation Warning Could Be Stronger

When config_value="llm" but no provider registry is available, a warning is logged but the actor is still created in stub mode. Consider raising an error or at minimum using logger.error() instead of logger.warning() since the user explicitly requested LLM mode.

Positive Observations

Good modular decomposition: Splitting into strategy_models.py, strategy_parsing.py, strategy_prompt.py is clean architecture.
H1 fix is solid: _extract_content properly handles .content, .text, list responses, and str() fallback.
H2 fix is thorough: Invariants are now included in the LLM prompt under <constraints> tags with proper XML sanitization.
H3 fix is correct: Lifecycle resolution now catches specific exception types.
Dependency graph validation: Kahn's algorithm for cycle detection is correct and well-documented.
Prompt injection mitigation: XML-style tags with entity escaping is a reasonable approach, documented as an intentional deviation from spec.
Retry logic: Exponential backoff for LLM invocations is well-implemented.
CHANGELOG entry: Present and detailed.
Robot integration tests: 7 tests covering key scenarios.
Forward-looking build_decisions API: Well-documented as not yet wired into the pipeline, with clear notes about future integration.

Verdict: REQUEST_CHANGES

The implementation quality is good and most previous HIGH findings have been addressed. However, the three blocking issues (merge conflicts, file size violations, empty PR body) must be resolved before this can be approved. The remaining HIGH issues (private method testing, redundant exception catch) should also be addressed.

Priority order for fixes:

Rebase onto master to resolve merge conflicts
Split oversized files to comply with 500-line limit
Add PR description
Refactor tests to use public API instead of _execute_with_llm
Fix redundant exception catch in _build_decisions

Automated by CleverAgents Bot
Supervisor: PR Review | Agent: ca-pr-self-reviewer

# Independent Code Review — PR #1175: feat(plan): implement LLM-powered Strategy Actor (#828) **Reviewer**: ca-pr-self-reviewer (independent perspective) **Branch**: `feature/strategy-actor-llm` **Head commit**: `ad554e3b` **Spec references**: §Strategize Phase, §Decision Record Structure, §Prompt Injection Mitigation --- ## Summary This PR implements the LLM-powered Strategy Actor for the plan `strategize` phase, replacing the `StrategizeStubActor` with a full LLM-backed implementation that produces hierarchical action trees with dependencies, resource requirements, complexity estimates, and risk scores. The implementation is well-structured with good modular decomposition (separate files for models, parsing, prompt construction) and includes 37 Behave BDD scenarios + 7 Robot Framework integration tests. I reviewed the full diff (14 files, +5167/-12 lines), the linked issue #828, the specification, CONTRIBUTING.md, and the previous review findings. Several of the original HIGH findings (H1, H2, H3) have been addressed. However, **three blocking issues** prevent merge. --- ## BLOCKING Issues ### B1: Merge Conflicts (`mergeable: false`) The PR currently has merge conflicts with `master`. Forgejo reports `mergeable: false`. The branch must be rebased onto current `master` before merge is possible. ### B2: File Size Violations (CONTRIBUTING.md) CONTRIBUTING.md mandates files must be under 500 lines. Three files exceed this limit: | File | Lines | Over limit | |------|-------|-----------| | `features/steps/strategy_actor_llm_steps.py` | **2,084** | 4.2× | | `src/cleveragents/application/services/strategy_actor.py` | **830** | 1.66× | | `features/strategy_actor_llm.feature` | **750** | 1.5× | **Required action**: Split these files. For example: - `strategy_actor.py` → extract `_build_tree`, `_tree_to_decisions`, `_build_invariant_records`, and `validate_no_cycles` into a separate `strategy_tree_builder.py` - `strategy_actor_llm_steps.py` → split into multiple step files by concern (e.g., `strategy_actor_stub_steps.py`, `strategy_actor_llm_steps.py`, `strategy_parsing_steps.py`, `strategy_prompt_steps.py`) - `strategy_actor_llm.feature` → split into multiple feature files by concern ### B3: Empty PR Body The PR description/body is empty. CONTRIBUTING.md requires: "Pull Requests must have a detailed description that explains the purpose and context of the change." The PR must include a description explaining the change, linking to issue #828, and summarizing the implementation approach. --- ## HIGH Issues ### H1: Tests Still Call Private `_execute_with_llm` Directly (6+ places) This was identified as H5 in the previous review and acknowledged by the maintainer as requiring a fix. The test file still calls `context.strategy_actor._execute_with_llm(...)` directly in at least 6 places (lines 621, 627, 889, 979, 1095, 1299, 1766). This: - Creates coupling to implementation details (fragile tests) - Produces inconsistent state (double execution with different ULIDs) - Tests a private API that may change without notice **Required action**: Test through the public `execute()` interface. If tree inspection is needed, either expose it through the result object or capture it via mock interception. ### H2: Redundant Exception Catch in `plan_executor.py` In `_build_decisions()`: ```python except (json.JSONDecodeError, Exception): ``` `Exception` is a superclass of `json.JSONDecodeError`, making the tuple redundant. This should either be `except Exception:` (if truly broad catch is intended) or narrowed to specific exceptions like `except (json.JSONDecodeError, PydanticValidationError):`. --- ## MEDIUM Issues ### M1: ACMS `except Exception:` Still Broad Line 557 in `strategy_actor.py` still uses bare `except Exception:` for ACMS context retrieval. While the comment documents this as intentional (ACMS failures are non-fatal), it would be better to catch specific exception types that ACMS is known to raise, plus a documented broad catch for truly unexpected errors. At minimum, `KeyboardInterrupt` and `SystemExit` should not be caught (though they're `BaseException` subclasses, the pattern sets a bad precedent). ### M2: Cross-Class Private Method Access `_build_decisions()` in `plan_executor.py` calls `StrategizeStubActor._parse_steps()` — a private method of another class. This creates tight coupling. Consider extracting `_parse_steps` to a shared utility function or making it a public method. ### M3: `resolve_strategy_actor` Silent Degradation Warning Could Be Stronger When `config_value="llm"` but no provider registry is available, a warning is logged but the actor is still created in stub mode. Consider raising an error or at minimum using `logger.error()` instead of `logger.warning()` since the user explicitly requested LLM mode. --- ## Positive Observations 1. **Good modular decomposition**: Splitting into `strategy_models.py`, `strategy_parsing.py`, `strategy_prompt.py` is clean architecture. 2. **H1 fix is solid**: `_extract_content` properly handles `.content`, `.text`, list responses, and `str()` fallback. 3. **H2 fix is thorough**: Invariants are now included in the LLM prompt under `<constraints>` tags with proper XML sanitization. 4. **H3 fix is correct**: Lifecycle resolution now catches specific exception types. 5. **Dependency graph validation**: Kahn's algorithm for cycle detection is correct and well-documented. 6. **Prompt injection mitigation**: XML-style tags with entity escaping is a reasonable approach, documented as an intentional deviation from spec. 7. **Retry logic**: Exponential backoff for LLM invocations is well-implemented. 8. **CHANGELOG entry**: Present and detailed. 9. **Robot integration tests**: 7 tests covering key scenarios. 10. **Forward-looking `build_decisions` API**: Well-documented as not yet wired into the pipeline, with clear notes about future integration. --- ## Verdict: REQUEST_CHANGES The implementation quality is good and most previous HIGH findings have been addressed. However, the three blocking issues (merge conflicts, file size violations, empty PR body) must be resolved before this can be approved. The remaining HIGH issues (private method testing, redundant exception catch) should also be addressed. **Priority order for fixes:** 1. Rebase onto `master` to resolve merge conflicts 2. Split oversized files to comply with 500-line limit 3. Add PR description 4. Refactor tests to use public API instead of `_execute_with_llm` 5. Fix redundant exception catch in `_build_decisions` --- **Automated by CleverAgents Bot** Supervisor: PR Review | Agent: ca-pr-self-reviewer

features/steps/strategy_actor_llm_steps.py Outdated

						
				@@ -0,0 +1,2084 @@

				"""Step definitions for strategy_actor_llm.feature.

freemo commented

[B2 — File Size Violation] This file is 2,084 lines — over 4× the 500-line limit mandated by CONTRIBUTING.md. Split into multiple step files by concern (e.g., strategy_actor_stub_steps.py, strategy_actor_llm_steps.py, strategy_parsing_steps.py, strategy_prompt_steps.py, strategy_resolve_steps.py).

**[B2 — File Size Violation]** This file is **2,084 lines** — over 4× the 500-line limit mandated by CONTRIBUTING.md. Split into multiple step files by concern (e.g., `strategy_actor_stub_steps.py`, `strategy_actor_llm_steps.py`, `strategy_parsing_steps.py`, `strategy_prompt_steps.py`, `strategy_resolve_steps.py`).

features/steps/strategy_actor_llm_steps.py Outdated

						
				@@ -0,0 +624,4 @@

				        definition_of_done="Build a REST API with authentication",

				    )

				    # Re-execute to capture the tree directly for inspection

				    context.sa_tree = context.strategy_actor._execute_with_llm(

freemo commented

[H1 — Private Method Access in Tests] This calls _execute_with_llm directly — a private method — creating coupling to implementation details and producing inconsistent state (double execution with different ULIDs). The assertions on context.sa_tree verify a different tree than what context.strategy_result contains.

Fix: Test through the public execute() interface. If tree inspection is needed, capture the tree via mock interception on _build_tree or expose it through the result object.

**[H1 — Private Method Access in Tests]** This calls `_execute_with_llm` directly — a private method — creating coupling to implementation details and producing inconsistent state (double execution with different ULIDs). The assertions on `context.sa_tree` verify a different tree than what `context.strategy_result` contains. **Fix**: Test through the public `execute()` interface. If tree inspection is needed, capture the tree via mock interception on `_build_tree` or expose it through the result object.

features/strategy_actor_llm.feature Outdated

						
				@@ -0,0 +1,750 @@

				@mock_only

freemo commented

[B2 — File Size Violation] This feature file is 750 lines — exceeds the 500-line limit per CONTRIBUTING.md. Split into multiple feature files by concern (e.g., strategy_actor_stub.feature, strategy_actor_llm.feature, strategy_parsing.feature, strategy_prompt.feature, strategy_resolve.feature).

**[B2 — File Size Violation]** This feature file is **750 lines** — exceeds the 500-line limit per CONTRIBUTING.md. Split into multiple feature files by concern (e.g., `strategy_actor_stub.feature`, `strategy_actor_llm.feature`, `strategy_parsing.feature`, `strategy_prompt.feature`, `strategy_resolve.feature`).

src/cleveragents/application/services/plan_executor.py Outdated

						
				@@ -591,0 +631,4 @@

				                raw_list: list[dict[str, Any]] = json.loads(stored_json)

				                return [StrategyDecision.model_validate(d) for d in raw_list]

				            except (json.JSONDecodeError, Exception):

				                self._logger.warning(

freemo commented

[H2 — Redundant Exception Catch] except (json.JSONDecodeError, Exception): is redundant — Exception is a superclass of json.JSONDecodeError, so the tuple is equivalent to except Exception:. Either narrow to specific exceptions (e.g., except (json.JSONDecodeError, PydanticValidationError):) or use except Exception: if a broad catch is truly intended.

**[H2 — Redundant Exception Catch]** `except (json.JSONDecodeError, Exception):` is redundant — `Exception` is a superclass of `json.JSONDecodeError`, so the tuple is equivalent to `except Exception:`. Either narrow to specific exceptions (e.g., `except (json.JSONDecodeError, PydanticValidationError):`) or use `except Exception:` if a broad catch is truly intended.

src/cleveragents/application/services/strategy_actor.py Outdated

						
				@@ -0,0 +1,830 @@

				"""LLM-powered Strategy Actor for the plan ``strategize`` phase.

freemo commented

[B2 — File Size Violation] This file is 830 lines — exceeds the 500-line limit per CONTRIBUTING.md. Consider extracting _build_tree, _tree_to_decisions, _build_invariant_records, validate_no_cycles, and the Protocol types into a separate module (e.g., strategy_tree_builder.py).

**[B2 — File Size Violation]** This file is **830 lines** — exceeds the 500-line limit per CONTRIBUTING.md. Consider extracting `_build_tree`, `_tree_to_decisions`, `_build_invariant_records`, `validate_no_cycles`, and the Protocol types into a separate module (e.g., `strategy_tree_builder.py`).

src/cleveragents/application/services/strategy_actor.py Outdated

						
				@@ -0,0 +554,4 @@

				            try:

				                acms_result = self._acms_pipeline.get_context_summary()

				                acms_context = str(acms_result) if acms_result else None

				            except Exception:

freemo commented

2026-04-03 01:23:16 +00:00

[M1 — Broad Exception Catch] This except Exception: for ACMS context retrieval is documented as intentional but still catches programming errors (TypeError, NameError, etc.). Consider narrowing to (RuntimeError, ConnectionError, TimeoutError, ValueError, OSError) — the exception types that ACMS/network operations are known to raise.

**[M1 — Broad Exception Catch]** This `except Exception:` for ACMS context retrieval is documented as intentional but still catches programming errors (`TypeError`, `NameError`, etc.). Consider narrowing to `(RuntimeError, ConnectionError, TimeoutError, ValueError, OSError)` — the exception types that ACMS/network operations are known to raise.

freemo referenced this pull request

feat(plan): implement LLM-powered Strategy Actor #828

freemo commented

2026-04-03 03:54:31 +00:00

Merge conflict detected. This PR has mergeable: false — the branch has conflicts with master. The implementing agent needs to rebase this branch onto latest master before this PR can be reviewed and merged.

Automated by CleverAgents Bot
Supervisor: PR Review | Agent: ca-continuous-pr-reviewer

Merge conflict detected. This PR has `mergeable: false` — the branch has conflicts with master. The implementing agent needs to rebase this branch onto latest master before this PR can be reviewed and merged. --- **Automated by CleverAgents Bot** Supervisor: PR Review | Agent: ca-continuous-pr-reviewer

freemo commented

2026-04-03 18:45:46 +00:00

🔒 Review claimed by reviewer-pool-1 [claim-token: reviewer-pool-1-1175-1775242500]

Automated by CleverAgents Bot
Supervisor: PR Review | Agent: ca-continuous-pr-reviewer

🔒 Review claimed by reviewer-pool-1 [claim-token: reviewer-pool-1-1175-1775242500] --- **Automated by CleverAgents Bot** Supervisor: PR Review | Agent: ca-continuous-pr-reviewer

freemo requested changes 2026-04-03 18:50:20 +00:00

Dismissed

freemo left a comment

Independent Code Review — PR #1175: feat(plan): implement LLM-powered Strategy Actor (#828)

Reviewer: ca-pr-self-reviewer (independent perspective)
Commit: ad554e3
Branch: feature/strategy-actor-llm → master
Spec Reference: docs/specification.md §Strategize Phase, §Decision Record Structure, §Prompt Injection Mitigation

Review Summary

This is a substantial, well-architected implementation of the LLM-powered Strategy Actor. The code demonstrates strong design with good separation of concerns (models, parsing, prompt construction, actor logic), thorough test coverage (105 Behave scenarios + 7 Robot tests), and extensive hardening across 7 review cycles. The commit message is exemplary in its detail.

However, four blocking issues prevent merge:

BLOCKING Issues (must fix)

B1: `# type: ignore[misc]` on line 623 of `strategy_actor.py`

CONTRIBUTING.md is absolute: "Using # type: ignore or any other mechanism to suppress or disable type checking is strictly prohibited."

raise last_exc  # type: ignore[misc]

Fix: Assert the variable is not None before raising, or restructure the retry loop to avoid the None possibility:

if last_exc is not None:
    raise last_exc
raise PlanError("LLM invocation failed after all retries with no captured exception")

B2: Merge conflicts — `mergeable: false`

The PR currently has merge conflicts with master. The branch must be rebased onto latest master before it can be merged. Per CONTRIBUTING.md: "Merge commits are not allowed. Branches must be rebased onto the target branch before merging."

B3: Three files exceed the 500-line limit

CONTRIBUTING.md requires files to be under 500 lines:

File	Lines	Over by
`strategy_actor.py`	830	66%
`strategy_actor_llm_steps.py`	2,084	317%
`strategy_actor_llm.feature`	750	50%

Suggested splits:

strategy_actor.py (830 lines): The module already has good internal structure. Extract validate_no_cycles(), _parse_actor_name(), and resolve_strategy_actor() into a separate strategy_resolution.py or similar utility module. The actor class itself would then fit within 500 lines.
strategy_actor_llm_steps.py (2,084 lines): Split by test category — e.g., strategy_actor_llm_parsing_steps.py, strategy_actor_llm_execution_steps.py, strategy_actor_llm_prompt_steps.py.
strategy_actor_llm.feature (750 lines): Split into multiple feature files by section (e.g., strategy_actor_parsing.feature, strategy_actor_execution.feature, strategy_actor_prompt.feature).

B4: Empty PR body

The PR description is empty. CONTRIBUTING.md requires PRs to have a detailed description including closing keywords (Closes #828), a summary of changes, and formal dependency linking. The commit message is excellent — much of it can be adapted for the PR body.

MEDIUM Issues (recommended to fix)

M1: Redundant exception tuple in `plan_executor.py` line 633

except (json.JSONDecodeError, Exception):

json.JSONDecodeError is a subclass of Exception, making this tuple redundant — it's equivalent to except Exception:. Either narrow to specific exceptions (json.JSONDecodeError, ValidationError) or use just except Exception: with a comment explaining the broad catch.

M2: ACMS context retrieval uses bare `except Exception:` (strategy_actor.py ~line 638)

While the code has a comment explaining this is intentional ("ACMS failures are explicitly non-fatal"), CONTRIBUTING.md's fail-fast principles prefer narrowed exception types. Consider narrowing to (RuntimeError, ConnectionError, TimeoutError, ValueError, AttributeError) — the set of exceptions ACMS pipelines are known to raise.

Positive Observations

Excellent code structure: Clean separation into models, parsing, prompt, and actor modules
Thorough test coverage: 105 BDD scenarios covering edge cases, error paths, and security concerns
Security hardening: XML sanitization for prompt injection, input size bounds on all prompt sections
Robust parsing: Multi-anchor JSON retry, numbered-list fallback, NaN/Inf clamping
Good documentation: Comprehensive docstrings, known-limitations notes, spec references
Invariants now flow to LLM prompt: The original H2 concern was properly addressed
Dependency graph validation: Kahn's algorithm implementation is correct and well-tested
7 review cycles of hardening: The iterative improvement is evident in the code quality

Decision: REQUEST_CHANGES

The # type: ignore violation (B1) and merge conflicts (B2) are hard blockers. The file size violations (B3) and empty PR body (B4) also need to be addressed per CONTRIBUTING.md. Once these four items are resolved, this PR is ready for approval and merge.

Automated by CleverAgents Bot
Supervisor: PR Review | Agent: ca-pr-self-reviewer

## Independent Code Review — PR #1175: feat(plan): implement LLM-powered Strategy Actor (#828) **Reviewer**: ca-pr-self-reviewer (independent perspective) **Commit**: `ad554e3` **Branch**: `feature/strategy-actor-llm` → `master` **Spec Reference**: `docs/specification.md` §Strategize Phase, §Decision Record Structure, §Prompt Injection Mitigation --- ### Review Summary This is a substantial, well-architected implementation of the LLM-powered Strategy Actor. The code demonstrates strong design with good separation of concerns (models, parsing, prompt construction, actor logic), thorough test coverage (105 Behave scenarios + 7 Robot tests), and extensive hardening across 7 review cycles. The commit message is exemplary in its detail. However, **four blocking issues** prevent merge: --- ### BLOCKING Issues (must fix) #### B1: `# type: ignore[misc]` on line 623 of `strategy_actor.py` CONTRIBUTING.md is absolute: *"Using `# type: ignore` or any other mechanism to suppress or disable type checking is strictly prohibited."* ```python raise last_exc # type: ignore[misc] ``` **Fix**: Assert the variable is not `None` before raising, or restructure the retry loop to avoid the `None` possibility: ```python if last_exc is not None: raise last_exc raise PlanError("LLM invocation failed after all retries with no captured exception") ``` #### B2: Merge conflicts — `mergeable: false` The PR currently has merge conflicts with `master`. The branch must be rebased onto latest `master` before it can be merged. Per CONTRIBUTING.md: *"Merge commits are not allowed. Branches must be rebased onto the target branch before merging."* #### B3: Three files exceed the 500-line limit CONTRIBUTING.md requires files to be under 500 lines: | File | Lines | Over by | |------|-------|---------| | `strategy_actor.py` | 830 | 66% | | `strategy_actor_llm_steps.py` | 2,084 | 317% | | `strategy_actor_llm.feature` | 750 | 50% | **Suggested splits**: - `strategy_actor.py` (830 lines): The module already has good internal structure. Extract `validate_no_cycles()`, `_parse_actor_name()`, and `resolve_strategy_actor()` into a separate `strategy_resolution.py` or similar utility module. The actor class itself would then fit within 500 lines. - `strategy_actor_llm_steps.py` (2,084 lines): Split by test category — e.g., `strategy_actor_llm_parsing_steps.py`, `strategy_actor_llm_execution_steps.py`, `strategy_actor_llm_prompt_steps.py`. - `strategy_actor_llm.feature` (750 lines): Split into multiple feature files by section (e.g., `strategy_actor_parsing.feature`, `strategy_actor_execution.feature`, `strategy_actor_prompt.feature`). #### B4: Empty PR body The PR description is empty. CONTRIBUTING.md requires PRs to have a detailed description including closing keywords (`Closes #828`), a summary of changes, and formal dependency linking. The commit message is excellent — much of it can be adapted for the PR body. --- ### MEDIUM Issues (recommended to fix) #### M1: Redundant exception tuple in `plan_executor.py` line 633 ```python except (json.JSONDecodeError, Exception): ``` `json.JSONDecodeError` is a subclass of `Exception`, making this tuple redundant — it's equivalent to `except Exception:`. Either narrow to specific exceptions (`json.JSONDecodeError, ValidationError`) or use just `except Exception:` with a comment explaining the broad catch. #### M2: ACMS context retrieval uses bare `except Exception:` (strategy_actor.py ~line 638) While the code has a comment explaining this is intentional ("ACMS failures are explicitly non-fatal"), CONTRIBUTING.md's fail-fast principles prefer narrowed exception types. Consider narrowing to `(RuntimeError, ConnectionError, TimeoutError, ValueError, AttributeError)` — the set of exceptions ACMS pipelines are known to raise. --- ### Positive Observations 1. **Excellent code structure**: Clean separation into models, parsing, prompt, and actor modules 2. **Thorough test coverage**: 105 BDD scenarios covering edge cases, error paths, and security concerns 3. **Security hardening**: XML sanitization for prompt injection, input size bounds on all prompt sections 4. **Robust parsing**: Multi-anchor JSON retry, numbered-list fallback, NaN/Inf clamping 5. **Good documentation**: Comprehensive docstrings, known-limitations notes, spec references 6. **Invariants now flow to LLM prompt**: The original H2 concern was properly addressed 7. **Dependency graph validation**: Kahn's algorithm implementation is correct and well-tested 8. **7 review cycles of hardening**: The iterative improvement is evident in the code quality --- ### Decision: **REQUEST_CHANGES** The `# type: ignore` violation (B1) and merge conflicts (B2) are hard blockers. The file size violations (B3) and empty PR body (B4) also need to be addressed per CONTRIBUTING.md. Once these four items are resolved, this PR is ready for approval and merge. --- **Automated by CleverAgents Bot** Supervisor: PR Review | Agent: ca-pr-self-reviewer

src/cleveragents/application/services/plan_executor.py Outdated

						
				@@ -591,0 +630,4 @@

				            try:

				                raw_list: list[dict[str, Any]] = json.loads(stored_json)

				                return [StrategyDecision.model_validate(d) for d in raw_list]

				            except (json.JSONDecodeError, Exception):

freemo commented

M1: (json.JSONDecodeError, Exception) is redundant — JSONDecodeError is a subclass of Exception. Either narrow to specific exceptions or simplify to except Exception: with a comment.

**M1**: `(json.JSONDecodeError, Exception)` is redundant — `JSONDecodeError` is a subclass of `Exception`. Either narrow to specific exceptions or simplify to `except Exception:` with a comment.

src/cleveragents/application/services/strategy_actor.py Outdated

						
				@@ -0,0 +1,830 @@

				"""LLM-powered Strategy Actor for the plan ``strategize`` phase.

freemo commented

B3 [BLOCKING]: This file is 830 lines, exceeding the 500-line limit per CONTRIBUTING.md. Consider extracting validate_no_cycles(), _parse_actor_name(), and resolve_strategy_actor() into a utility module (e.g., strategy_resolution.py).

**B3 [BLOCKING]**: This file is 830 lines, exceeding the 500-line limit per CONTRIBUTING.md. Consider extracting `validate_no_cycles()`, `_parse_actor_name()`, and `resolve_strategy_actor()` into a utility module (e.g., `strategy_resolution.py`).

src/cleveragents/application/services/strategy_actor.py Outdated

						
				@@ -0,0 +620,4 @@

				                    time.sleep(delay)

				        # All retries exhausted — re-raise last exception

				        raise last_exc  # type: ignore[misc]

freemo commented

B1 [BLOCKING]: # type: ignore[misc] is prohibited by CONTRIBUTING.md. Fix by asserting last_exc is not None before raising, or restructure the loop to guarantee a non-None exception:

if last_exc is not None:
    raise last_exc
raise PlanError("LLM invocation failed after all retries with no captured exception")

**B1 [BLOCKING]**: `# type: ignore[misc]` is prohibited by CONTRIBUTING.md. Fix by asserting `last_exc is not None` before raising, or restructure the loop to guarantee a non-None exception: ```python if last_exc is not None: raise last_exc raise PlanError("LLM invocation failed after all retries with no captured exception") ```

src/cleveragents/application/services/strategy_actor.py Outdated

						
				@@ -0,0 +635,4 @@

				            raw_content = getattr(response, "text", None)

				        if raw_content is None:

				            raw_content = str(response)

				        if isinstance(raw_content, list):

freemo commented

2026-04-03 18:50:29 +00:00

M2: Bare except Exception: for ACMS context retrieval. While documented as intentional, consider narrowing to the specific exception types ACMS pipelines are known to raise (e.g., RuntimeError, ConnectionError, TimeoutError, ValueError).

**M2**: Bare `except Exception:` for ACMS context retrieval. While documented as intentional, consider narrowing to the specific exception types ACMS pipelines are known to raise (e.g., `RuntimeError, ConnectionError, TimeoutError, ValueError`).

freemo referenced this pull request

feat(plan): implement LLM-powered Strategy Actor #828

freemo commented

2026-04-05 03:20:47 +00:00

🔒 Review claimed by reviewer-pool-1 [claim-token: reviewer-pool-1-1175-1775359200]

Automated by CleverAgents Bot
Supervisor: PR Review | Agent: ca-continuous-pr-reviewer

🔒 Review claimed by reviewer-pool-1 [claim-token: reviewer-pool-1-1175-1775359200] --- **Automated by CleverAgents Bot** Supervisor: PR Review | Agent: ca-continuous-pr-reviewer

freemo requested changes 2026-04-05 03:49:24 +00:00

Dismissed

freemo left a comment

Independent Code Review — PR #1175: feat(plan): implement LLM-powered Strategy Actor (#828)

Reviewer: ca-pr-self-reviewer (independent perspective)
Branch: feature/strategy-actor-llm
Head SHA: ad554e3b
Spec Reference: §Strategize Phase, §Decision Record Structure, §Prompt Injection Mitigation

Blocking Issues

B1: Merge Conflicts — PR is not mergeable

The PR currently has mergeable: false. The branch has diverged from master and has unresolvable conflicts. The branch must be rebased onto latest master before this PR can proceed.

B2: Empty PR Body — CONTRIBUTING.md Violation

The PR description/body is completely empty. Per CONTRIBUTING.md, every PR must have:

A detailed description summarizing the changes and motivation
A closing keyword reference to the linked issue (e.g., Closes #828)

B3: `strategy_actor.py` at 830 lines — Exceeds 500-line limit

Per CONTRIBUTING.md, files must be under 500 lines. The main source file strategy_actor.py is 830 lines. The code is already well-decomposed into 4 modules (strategy_models.py, strategy_parsing.py, strategy_prompt.py), but the actor file itself needs further splitting. Suggestions:

Extract validate_no_cycles() and _parse_actor_name() into a strategy_utils.py module
Extract resolve_strategy_actor() into its own module or into the utils module
Consider splitting StrategyActor._build_tree() and _tree_to_decisions() into a strategy_tree_builder.py module

B4: `features/steps/strategy_actor_llm_steps.py` at 2084 lines — Extreme file size violation

The step definition file is over 4x the 500-line limit. This should be split into multiple step files organized by concern (e.g., strategy_actor_init_steps.py, strategy_actor_parsing_steps.py, strategy_actor_prompt_steps.py, strategy_actor_decisions_steps.py). Behave supports step definitions across multiple files in the steps/ directory.

Significant Issues

S1: Tests call private `_execute_with_llm` directly (H5 from prior review — still present)

Multiple test steps (lines 621-631, 889, 979, 1095, 1299, 1766) call context.strategy_actor._execute_with_llm() directly. This was identified as H5 in the initial code review and acknowledged by the maintainer as needing fixing. It creates:

Fragile coupling to implementation details — any refactor of the private method breaks tests
Inconsistent state — the step at line 627 re-executes the LLM mock to capture the tree, producing a different StrategyTree with different ULIDs than what execute() returned

Fix: Either expose the tree through the StrategizeResult for testing, or capture it via mock interception on _build_tree.

S2: Broad `except Exception` in ACMS context retrieval

In strategy_actor.py _execute_with_llm(), the ACMS context retrieval uses a bare except Exception:. While the comment explains this is intentionally non-fatal, per CONTRIBUTING.md's error handling rules, exceptions should be narrowed to expected types. The ACMS pipeline could raise RuntimeError, ConnectionError, TimeoutError, or ValueError — use those specific types.

S3: `except (json.JSONDecodeError, Exception)` in `_build_decisions`

In plan_executor.py _build_decisions(), the fallback catch is except (json.JSONDecodeError, Exception): — the Exception makes the json.JSONDecodeError redundant. If the intent is to catch any deserialization error, narrow to (json.JSONDecodeError, ValidationError, KeyError, TypeError).

Positive Observations

The implementation addresses most of the HIGH findings from the initial code review:

✅ H1 (LLM response fallback): _extract_content() now handles .content, .text, list content, and str() fallback correctly
✅ H2 (Invariants not passed to LLM): build_strategy_prompt() now accepts invariants and includes them in a <constraints> section with proper XML sanitization
✅ H3 (Lifecycle exception narrowing): Narrowed to (KeyError, ValueError, AttributeError, RuntimeError)
✅ M1 (Flat hierarchy): _build_tree() now infers parent_id from the first dependency edge
✅ Security: XML sanitization via _sanitize_xml_content() with proper entity escaping
✅ Prompt injection: XML-style section tags with sanitized user content
✅ Comprehensive BDD coverage: 80+ scenarios covering edge cases, error paths, and security
✅ Robot integration tests: 7 integration test cases
✅ Good modular decomposition: 4 source modules with clear separation of concerns

The code quality is high overall — the architecture is sound, the LLM integration is well-designed with proper fallback, retry logic, and dependency graph validation.

Required Actions Before Merge

Rebase onto master to resolve merge conflicts
Add PR body with description and Closes #828
Split strategy_actor.py to get under 500 lines
Split strategy_actor_llm_steps.py into multiple step files
Replace _execute_with_llm calls in tests with public API or mock interception
Narrow ACMS except Exception to specific exception types
Fix except (json.JSONDecodeError, Exception) in _build_decisions

Automated by CleverAgents Bot
Supervisor: PR Review | Agent: ca-pr-self-reviewer

# Independent Code Review — PR #1175: feat(plan): implement LLM-powered Strategy Actor (#828) **Reviewer**: ca-pr-self-reviewer (independent perspective) **Branch**: `feature/strategy-actor-llm` **Head SHA**: `ad554e3b` **Spec Reference**: §Strategize Phase, §Decision Record Structure, §Prompt Injection Mitigation --- ## Blocking Issues ### B1: Merge Conflicts — PR is not mergeable The PR currently has `mergeable: false`. The branch has diverged from master and has unresolvable conflicts. **The branch must be rebased onto latest master before this PR can proceed.** ### B2: Empty PR Body — CONTRIBUTING.md Violation The PR description/body is completely empty. Per CONTRIBUTING.md, every PR must have: - A detailed description summarizing the changes and motivation - A closing keyword reference to the linked issue (e.g., `Closes #828`) ### B3: `strategy_actor.py` at 830 lines — Exceeds 500-line limit Per CONTRIBUTING.md, files must be under 500 lines. The main source file `strategy_actor.py` is 830 lines. The code is already well-decomposed into 4 modules (`strategy_models.py`, `strategy_parsing.py`, `strategy_prompt.py`), but the actor file itself needs further splitting. Suggestions: - Extract `validate_no_cycles()` and `_parse_actor_name()` into a `strategy_utils.py` module - Extract `resolve_strategy_actor()` into its own module or into the utils module - Consider splitting `StrategyActor._build_tree()` and `_tree_to_decisions()` into a `strategy_tree_builder.py` module ### B4: `features/steps/strategy_actor_llm_steps.py` at 2084 lines — Extreme file size violation The step definition file is over 4x the 500-line limit. This should be split into multiple step files organized by concern (e.g., `strategy_actor_init_steps.py`, `strategy_actor_parsing_steps.py`, `strategy_actor_prompt_steps.py`, `strategy_actor_decisions_steps.py`). Behave supports step definitions across multiple files in the `steps/` directory. --- ## Significant Issues ### S1: Tests call private `_execute_with_llm` directly (H5 from prior review — still present) Multiple test steps (lines 621-631, 889, 979, 1095, 1299, 1766) call `context.strategy_actor._execute_with_llm()` directly. This was identified as H5 in the initial code review and acknowledged by the maintainer as needing fixing. It creates: - **Fragile coupling** to implementation details — any refactor of the private method breaks tests - **Inconsistent state** — the step at line 627 re-executes the LLM mock to capture the tree, producing a *different* `StrategyTree` with different ULIDs than what `execute()` returned **Fix**: Either expose the tree through the `StrategizeResult` for testing, or capture it via mock interception on `_build_tree`. ### S2: Broad `except Exception` in ACMS context retrieval In `strategy_actor.py` `_execute_with_llm()`, the ACMS context retrieval uses a bare `except Exception:`. While the comment explains this is intentionally non-fatal, per CONTRIBUTING.md's error handling rules, exceptions should be narrowed to expected types. The ACMS pipeline could raise `RuntimeError`, `ConnectionError`, `TimeoutError`, or `ValueError` — use those specific types. ### S3: `except (json.JSONDecodeError, Exception)` in `_build_decisions` In `plan_executor.py` `_build_decisions()`, the fallback catch is `except (json.JSONDecodeError, Exception):` — the `Exception` makes the `json.JSONDecodeError` redundant. If the intent is to catch any deserialization error, narrow to `(json.JSONDecodeError, ValidationError, KeyError, TypeError)`. --- ## Positive Observations The implementation addresses most of the HIGH findings from the initial code review: - ✅ **H1 (LLM response fallback)**: `_extract_content()` now handles `.content`, `.text`, list content, and `str()` fallback correctly - ✅ **H2 (Invariants not passed to LLM)**: `build_strategy_prompt()` now accepts `invariants` and includes them in a `<constraints>` section with proper XML sanitization - ✅ **H3 (Lifecycle exception narrowing)**: Narrowed to `(KeyError, ValueError, AttributeError, RuntimeError)` - ✅ **M1 (Flat hierarchy)**: `_build_tree()` now infers `parent_id` from the first dependency edge - ✅ **Security**: XML sanitization via `_sanitize_xml_content()` with proper entity escaping - ✅ **Prompt injection**: XML-style section tags with sanitized user content - ✅ **Comprehensive BDD coverage**: 80+ scenarios covering edge cases, error paths, and security - ✅ **Robot integration tests**: 7 integration test cases - ✅ **Good modular decomposition**: 4 source modules with clear separation of concerns The code quality is high overall — the architecture is sound, the LLM integration is well-designed with proper fallback, retry logic, and dependency graph validation. --- ## Required Actions Before Merge 1. **Rebase onto master** to resolve merge conflicts 2. **Add PR body** with description and `Closes #828` 3. **Split `strategy_actor.py`** to get under 500 lines 4. **Split `strategy_actor_llm_steps.py`** into multiple step files 5. **Replace `_execute_with_llm` calls in tests** with public API or mock interception 6. **Narrow ACMS `except Exception`** to specific exception types 7. **Fix `except (json.JSONDecodeError, Exception)`** in `_build_decisions` --- **Automated by CleverAgents Bot** Supervisor: PR Review | Agent: ca-pr-self-reviewer

features/steps/strategy_actor_llm_steps.py Outdated

						
				@@ -0,0 +1,2084 @@

				"""Step definitions for strategy_actor_llm.feature.

freemo commented

B4: File exceeds 500-line limit (2084 lines — over 4x the limit)

Split into multiple step files organized by concern: strategy_actor_init_steps.py, strategy_actor_parsing_steps.py, strategy_actor_prompt_steps.py, strategy_actor_decisions_steps.py. Behave supports step definitions across multiple files.

**B4: File exceeds 500-line limit (2084 lines — over 4x the limit)** Split into multiple step files organized by concern: `strategy_actor_init_steps.py`, `strategy_actor_parsing_steps.py`, `strategy_actor_prompt_steps.py`, `strategy_actor_decisions_steps.py`. Behave supports step definitions across multiple files.

features/steps/strategy_actor_llm_steps.py Outdated

						
				@@ -0,0 +624,4 @@

				        definition_of_done="Build a REST API with authentication",

				    )

				    # Re-execute to capture the tree directly for inspection

				    context.sa_tree = context.strategy_actor._execute_with_llm(

freemo commented

S1: Test calls private _execute_with_llm directly

This re-executes the LLM mock to capture the tree, producing a different StrategyTree with different ULIDs than what execute() returned. This creates inconsistent state between context.strategy_result and context.sa_tree.

Fix: Expose the tree through StrategizeResult for testing, or capture it via mock interception on _build_tree.

**S1: Test calls private `_execute_with_llm` directly** This re-executes the LLM mock to capture the tree, producing a *different* `StrategyTree` with different ULIDs than what `execute()` returned. This creates inconsistent state between `context.strategy_result` and `context.sa_tree`. Fix: Expose the tree through `StrategizeResult` for testing, or capture it via mock interception on `_build_tree`.

src/cleveragents/application/services/plan_executor.py Outdated

						
				@@ -526,3 +546,4 @@

				                invariants=plan.invariants,

				                stream_callback=stream_callback,

				                **execute_kwargs,

				            )

freemo commented

S3: Redundant exception catch

except (json.JSONDecodeError, Exception): — the Exception makes json.JSONDecodeError redundant. Narrow to except (json.JSONDecodeError, ValidationError, KeyError, TypeError): to catch specific deserialization failures.

**S3: Redundant exception catch** `except (json.JSONDecodeError, Exception):` — the `Exception` makes `json.JSONDecodeError` redundant. Narrow to `except (json.JSONDecodeError, ValidationError, KeyError, TypeError):` to catch specific deserialization failures.

src/cleveragents/application/services/strategy_actor.py Outdated

						
				@@ -0,0 +1,830 @@

				"""LLM-powered Strategy Actor for the plan ``strategize`` phase.

freemo commented

B3: File exceeds 500-line limit (830 lines)

Per CONTRIBUTING.md, files must be under 500 lines. Extract validate_no_cycles(), _parse_actor_name(), and resolve_strategy_actor() into a strategy_utils.py module. Consider also extracting _build_tree() and _tree_to_decisions() into a strategy_tree_builder.py.

**B3: File exceeds 500-line limit (830 lines)** Per CONTRIBUTING.md, files must be under 500 lines. Extract `validate_no_cycles()`, `_parse_actor_name()`, and `resolve_strategy_actor()` into a `strategy_utils.py` module. Consider also extracting `_build_tree()` and `_tree_to_decisions()` into a `strategy_tree_builder.py`.

src/cleveragents/application/services/strategy_actor.py Outdated

						
				@@ -0,0 +165,4 @@

				    while queue:

				        node = queue.popleft()

				        visited_count += 1

				        for neighbor in adj.get(node, []):

freemo commented