fix(cli): add missing --format flag to action create + wire real LLM actors into plan executor #963

2026-03-15T22:46:04Z

freemo commented

2026-03-15 22:46:04 +00:00

Summary

Two critical bug fixes required for the M2 E2E acceptance test (robot/e2e/m2_acceptance.robot) to pass:

Bug #1: `action create` missing `--format` flag (#959)

The action create CLI command was the only action subcommand missing the --format/-f parameter. All other action subcommands (list, show, archive) already accepted --format and routed through _print_action(). Running action create --config action.yaml --format plain failed with a Typer unrecognized-option error.

Fix: Added the fmt parameter to the create() function signature and wired it to _print_action().

Bug #2: `plan execute` always used stub actors (#960)

The plan execute CLI command only performed phase transitions (Strategize → Execute) without ever invoking PlanExecutor to drive the strategize or execute actors. PlanExecutor.__init__ unconditionally created StrategizeStubActor() and ExecuteStubActor() which parse text locally and return empty changesets — no real LLM call was made.

Fix:

Added _get_plan_executor() helper that resolves ProviderRegistry from the DI container and constructs LLMStrategizeActor / LLMExecuteActor for real LLM invocations
Updated execute_plan CLI to detect plan phase/state and run the appropriate actor:
- Strategize/queued → run strategize actor → transition to Execute
- Strategize/complete → phase transition only (backward compat)
- Execute/queued → run execute actor → mark complete
New llm_actors.py module with LLMStrategizeActor and LLMExecuteActor that resolve provider/model actor names to live LangChain LLM instances
PlanExecutor.__init__ now accepts optional strategize_actor and execute_actor parameters

Existing mock-based BDD tests remain backward-compatible via duck-typing fallback.

Testing

Added Behave BDD scenarios for --format plain and --format json on action create
Added Behave BDD scenarios testing custom actor injection into PlanExecutor
All changes validated against M2 E2E test expectations

Issues Closed

## Summary Two critical bug fixes required for the M2 E2E acceptance test (`robot/e2e/m2_acceptance.robot`) to pass: ### Bug #1: `action create` missing `--format` flag (#959) The `action create` CLI command was the only action subcommand missing the `--format`/`-f` parameter. All other action subcommands (`list`, `show`, `archive`) already accepted `--format` and routed through `_print_action()`. Running `action create --config action.yaml --format plain` failed with a Typer unrecognized-option error. **Fix:** Added the `fmt` parameter to the `create()` function signature and wired it to `_print_action()`. ### Bug #2: `plan execute` always used stub actors (#960) The `plan execute` CLI command only performed phase transitions (Strategize → Execute) without ever invoking `PlanExecutor` to drive the strategize or execute actors. `PlanExecutor.__init__` unconditionally created `StrategizeStubActor()` and `ExecuteStubActor()` which parse text locally and return empty changesets — no real LLM call was made. **Fix:** - Added `_get_plan_executor()` helper that resolves `ProviderRegistry` from the DI container and constructs `LLMStrategizeActor` / `LLMExecuteActor` for real LLM invocations - Updated `execute_plan` CLI to detect plan phase/state and run the appropriate actor: - `Strategize/queued` → run strategize actor → transition to Execute - `Strategize/complete` → phase transition only (backward compat) - `Execute/queued` → run execute actor → mark complete - New `llm_actors.py` module with `LLMStrategizeActor` and `LLMExecuteActor` that resolve `provider/model` actor names to live LangChain LLM instances - `PlanExecutor.__init__` now accepts optional `strategize_actor` and `execute_actor` parameters Existing mock-based BDD tests remain backward-compatible via duck-typing fallback. ## Testing - Added Behave BDD scenarios for `--format plain` and `--format json` on `action create` - Added Behave BDD scenarios testing custom actor injection into `PlanExecutor` - All changes validated against M2 E2E test expectations ## Issues Closed - Closes #959 - Closes #960

freemo added 1 commit 2026-03-15 22:46:05 +00:00

test(e2e): E2E acceptance criteria for M2 (v3.1.0) — actor compiler and LLM integration

CI / benchmark-publish (pull_request) Has been skipped

Details

CI / lint (pull_request) Successful in 15s

Details

CI / quality (pull_request) Successful in 20s

Details

CI / build (pull_request) Successful in 19s

Details

CI / security (pull_request) Successful in 35s

Details

CI / typecheck (pull_request) Successful in 36s

Details

CI / e2e_tests (pull_request) Failing after 52s

Details

CI / integration_tests (pull_request) Successful in 2m58s

Details

CI / unit_tests (pull_request) Successful in 3m36s

Details

CI / docker (pull_request) Successful in 35s

Details

CI / coverage (pull_request) Successful in 4m41s

Details

CI / benchmark-regression (pull_request) Failing after 40m8s

Details

d879ba1f96

Add Robot Framework E2E test suite robot/e2e/m2_acceptance.robot exercising
M2 acceptance criteria with zero mocking. Test creates a temp git repo with
sample project files, registers a custom actor via CLI, sets up resource and
project, creates an action referencing the actor, and runs the full plan
lifecycle (use → execute strategize → execute → diff → apply). Validates
actor YAML compilation, skill registry, tool lifecycle, and LLM integration
through real CLI invocations with real provider API keys. Uses flexible
structural assertions and expected_rc=None for LLM-dependent commands.

ISSUES CLOSED: #742

freemo added the

labels 2026-03-15 22:46:10 +00:00

freemo force-pushed bugfix/m2-action-format-plan-executor from d808c010d9 to 0a0e2796a5

2026-03-15 23:18:11 +00:00

Compare

freemo force-pushed bugfix/m2-action-format-plan-executor from 0a0e2796a5 to dfa05a6909

2026-03-16 00:00:15 +00:00

Compare