test: add TDD bug-capture test for #969 — plan correct plan_id handling #1051

2026-03-18T08:58:12Z

hurui200320 commented

2026-03-18 08:58:12 +00:00

Summary

Add TDD bug-capture tests for bug #969 (plan correct expects decision_id but M3 acceptance test passes plan_id). These tests prove the bug exists and will serve as regression guards once the fix in #969 is merged.

Changes

Behave test (features/tdd_plan_correct_plan_id.feature): Two scenarios tagged @tdd_expected_fail @tdd_bug @tdd_bug_969 — one for --mode revert and one for --mode append — that invoke plan correct <plan_id> (without --plan flag) and assert the command resolves the plan_id to its root decision as target_decision_id. Both modes are tested because the bug affects target_decision_id resolution before mode-specific branching.
Step definitions (features/steps/tdd_plan_correct_plan_id_steps.py): Mock setup for DI container (DecisionService), CorrectionService, and _resolve_active_plan_id. Uses tpcpid step prefix per project conventions.
Shared fixtures (features/mocks/tdd_plan_correct_plan_id_fixtures.py): Centralised constants, patch targets, mock builders (make_decision_ns, make_mock_container, make_correction_svc, make_default_decisions, make_default_container), and build_cli_args helper. Both the Behave steps and Robot helper import from this shared module, eliminating code duplication and drift risk.
Robot test (robot/tdd_plan_correct_plan_id.robot): Two integration-level tests (revert + append) with tdd_expected_fail tdd_bug tdd_bug_969 tags, exercising the same code paths via the helper script.
Robot helper (robot/helper_tdd_plan_correct_plan_id.py): Standalone helper that exits 0 with sentinel when the bug is fixed, exits 1 when the bug is present. Imports shared fixtures from features/mocks/.
Changelog (CHANGELOG.md): Added entry under Unreleased for #979.

Bug Description

The correct_decision function in cleveragents.cli.commands.plan declares decision_id as its first positional argument. When the M3 acceptance test calls plan correct <plan_id> --mode revert --guidance "...", the plan_id is captured as decision_id and used directly as target_decision_id in svc.request_correction(). Since the plan_id is not a valid decision_id, the correction service cannot find the targeted decision. The same bug path is exercised by --mode append.

How TDD Expected-Fail Works

The @tdd_expected_fail tag causes the test framework to invert the result: the test passes CI when the underlying assertion fails (proving the bug exists) and fails CI if the assertion passes (bug was fixed without removing the tag).
When bug #969 is fixed, the developer removes the @tdd_expected_fail tag, and the test runs normally as a regression guard.

Quality Gates

All nox sessions pass:

nox -s lint ✅
nox -s typecheck ✅ (0 errors)
nox -s unit_tests ✅ (387 features, 11121 scenarios, 0 failures)
nox -s integration_tests ✅ (1561 tests, 0 failures)
nox -s e2e_tests ✅ (16 tests, 0 failures)
nox -s coverage_report ✅ (≥97%)

Closes #979

## Summary Add TDD bug-capture tests for bug #969 (`plan correct` expects `decision_id` but M3 acceptance test passes `plan_id`). These tests prove the bug exists and will serve as regression guards once the fix in #969 is merged. ### Changes - **Behave test** (`features/tdd_plan_correct_plan_id.feature`): Two scenarios tagged `@tdd_expected_fail @tdd_bug @tdd_bug_969` — one for `--mode revert` and one for `--mode append` — that invoke `plan correct <plan_id>` (without `--plan` flag) and assert the command resolves the plan_id to its root decision as `target_decision_id`. Both modes are tested because the bug affects `target_decision_id` resolution **before** mode-specific branching. - **Step definitions** (`features/steps/tdd_plan_correct_plan_id_steps.py`): Mock setup for DI container (DecisionService), CorrectionService, and `_resolve_active_plan_id`. Uses `tpcpid` step prefix per project conventions. - **Shared fixtures** (`features/mocks/tdd_plan_correct_plan_id_fixtures.py`): Centralised constants, patch targets, mock builders (`make_decision_ns`, `make_mock_container`, `make_correction_svc`, `make_default_decisions`, `make_default_container`), and `build_cli_args` helper. Both the Behave steps and Robot helper import from this shared module, eliminating code duplication and drift risk. - **Robot test** (`robot/tdd_plan_correct_plan_id.robot`): Two integration-level tests (revert + append) with `tdd_expected_fail tdd_bug tdd_bug_969` tags, exercising the same code paths via the helper script. - **Robot helper** (`robot/helper_tdd_plan_correct_plan_id.py`): Standalone helper that exits 0 with sentinel when the bug is fixed, exits 1 when the bug is present. Imports shared fixtures from `features/mocks/`. - **Changelog** (`CHANGELOG.md`): Added entry under Unreleased for #979. ### Bug Description The `correct_decision` function in `cleveragents.cli.commands.plan` declares `decision_id` as its first positional argument. When the M3 acceptance test calls `plan correct <plan_id> --mode revert --guidance "..."`, the plan_id is captured as `decision_id` and used directly as `target_decision_id` in `svc.request_correction()`. Since the plan_id is not a valid decision_id, the correction service cannot find the targeted decision. The same bug path is exercised by `--mode append`. ### How TDD Expected-Fail Works - The `@tdd_expected_fail` tag causes the test framework to invert the result: the test passes CI when the underlying assertion fails (proving the bug exists) and fails CI if the assertion passes (bug was fixed without removing the tag). - When bug #969 is fixed, the developer removes the `@tdd_expected_fail` tag, and the test runs normally as a regression guard. ### Quality Gates All nox sessions pass: - `nox -s lint` ✅ - `nox -s typecheck` ✅ (0 errors) - `nox -s unit_tests` ✅ (387 features, 11121 scenarios, 0 failures) - `nox -s integration_tests` ✅ (1561 tests, 0 failures) - `nox -s e2e_tests` ✅ (16 tests, 0 failures) - `nox -s coverage_report` ✅ (≥97%) Closes #979

hurui200320 added the

Type

Testing

label 2026-03-18 08:58:19 +00:00

hurui200320 added this to the v3.2.0 milestone 2026-03-18 08:58:19 +00:00

hurui200320 added a new dependency 2026-03-18 09:00:15 +00:00

#979 TDD: Write failing test for #969 — plan correct expects decision_id but test passes plan_id

hurui200320 force-pushed tdd/m3-plan-correct-plan-id from 5a08e12ebd to 433aa7db21

2026-03-18 11:56:02 +00:00

Compare

freemo approved these changes 2026-03-19 04:55:12 +00:00

Dismissed

freemo left a comment

Code Review — PR #1051 `test: TDD bug-capture test for #969 — plan correct plan_id handling`

Clean TDD test with correct tag triplet (@tdd_expected_fail @tdd_bug @tdd_bug_969). Excellent use of shared fixtures in features/mocks/tdd_plan_correct_plan_id_fixtures.py — this eliminates duplication between Behave steps and Robot helper and should be the standard pattern for future TDD PRs. Two scenarios (revert + append modes) thoroughly test the root bug.

Approved. No issues found.

## Code Review — PR #1051 `test: TDD bug-capture test for #969 — plan correct plan_id handling` Clean TDD test with correct tag triplet (`@tdd_expected_fail @tdd_bug @tdd_bug_969`). Excellent use of shared fixtures in `features/mocks/tdd_plan_correct_plan_id_fixtures.py` — this eliminates duplication between Behave steps and Robot helper and should be the standard pattern for future TDD PRs. Two scenarios (revert + append modes) thoroughly test the root bug. **Approved.** No issues found.

freemo requested review from brent.edwards 2026-03-19 05:16:15 +00:00

freemo requested review from freemo 2026-03-19 05:16:15 +00:00

hurui200320 force-pushed tdd/m3-plan-correct-plan-id from 433aa7db21 to 7b8b15f75c

2026-03-19 07:23:19 +00:00

Compare

hurui200320 dismissed freemo's review 2026-03-19 07:23:19 +00:00

Reason:

New commits pushed, approval review dismissed automatically according to repository settings

hurui200320 scheduled this pull request to auto merge when all checks succeed 2026-03-19 07:29:26 +00:00

hurui200320 merged commit 0b415a6e5e into master

2026-03-19 07:31:13 +00:00

hurui200320 deleted branch tdd/m3-plan-correct-plan-id