test: add TDD bug-capture test for #1025 — plan correct auto-resolve #1172

Merged
brent.edwards merged 3 commits from tdd/m4-plan-correct-auto-resolve into master 2026-04-01 00:40:47 +00:00
Member

Summary

  • add Behave @tdd_expected_fail bug-capture scenarios for plan correct auto-resolve without --plan
  • add shared mock fixtures and Robot integration coverage for isolated-container divergence reproducing bug #1025
  • add ASV benchmark coverage for active-plan filtering and document the new TDD capture in changelog

Testing

  • nox -s unit_tests -- features/tdd_plan_correct_auto_resolve.feature
  • nox -s integration_tests -- --include tdd_bug_1025
  • nox -s lint
  • nox -s typecheck
  • nox -s coverage_report
  • nox

Closes #1035

## Summary - add Behave `@tdd_expected_fail` bug-capture scenarios for `plan correct` auto-resolve without `--plan` - add shared mock fixtures and Robot integration coverage for isolated-container divergence reproducing bug #1025 - add ASV benchmark coverage for active-plan filtering and document the new TDD capture in changelog ## Testing - nox -s unit_tests -- features/tdd_plan_correct_auto_resolve.feature - nox -s integration_tests -- --include tdd_bug_1025 - nox -s lint - nox -s typecheck - nox -s coverage_report - nox Closes #1035
Owner

Code Review Note

Unable to review — the branch for this PR was not available in the local clone and the Forgejo server experienced downtime during the review session. Additionally, this PR has no labels assigned. Per CONTRIBUTING.md, every PR must carry at least one Type/ label. Please add the appropriate label (likely Type/Testing for a TDD bug-capture test).

## Code Review Note **Unable to review** — the branch for this PR was not available in the local clone and the Forgejo server experienced downtime during the review session. Additionally, this PR has **no labels** assigned. Per CONTRIBUTING.md, every PR must carry at least one `Type/` label. Please add the appropriate label (likely `Type/Testing` for a TDD bug-capture test).
freemo requested changes 2026-03-28 23:13:15 +00:00
Dismissed
freemo left a comment

Day 48 Planning Review — TDD PR for Bug #1025

The test quality is good — two Behave scenarios simulate divergent DI containers where _resolve_active_plan_id() gets an empty DB, proving the auto-resolve failure. Robot tests provide integration-level coverage.

Tag verification: PASS — uses @tdd_bug @tdd_bug_1025 @tdd_expected_fail. Correctly uses @tdd_bug (not @tdd_issue).

Blocking issues:

  1. Not mergeable (mergeable: false) — merge conflicts with master. Rebase required.

  2. Scope concern — 7 files and 730 lines for a TDD bug-capture PR is substantial. The ASV benchmark (benchmarks/tdd_plan_correct_auto_resolve_bench.py) is beyond the scope of "capture the bug with a tagged test." TDD PRs should ideally contain only the test that proves the bug exists. Consider removing the benchmark and including it in the bug fix PR instead.

  3. @freemo's comment about branch unavailability needs a response.

Requested changes: Rebase onto master, respond to freemo's comment, consider removing the benchmark from this TDD PR.

**Day 48 Planning Review — TDD PR for Bug #1025** The test quality is good — two Behave scenarios simulate divergent DI containers where `_resolve_active_plan_id()` gets an empty DB, proving the auto-resolve failure. Robot tests provide integration-level coverage. **Tag verification**: PASS — uses `@tdd_bug @tdd_bug_1025 @tdd_expected_fail`. Correctly uses `@tdd_bug` (not `@tdd_issue`). **Blocking issues:** 1. **Not mergeable** (`mergeable: false`) — merge conflicts with master. Rebase required. 2. **Scope concern** — 7 files and 730 lines for a TDD bug-capture PR is substantial. The ASV benchmark (`benchmarks/tdd_plan_correct_auto_resolve_bench.py`) is beyond the scope of "capture the bug with a tagged test." TDD PRs should ideally contain only the test that proves the bug exists. Consider removing the benchmark and including it in the bug fix PR instead. 3. **@freemo's comment** about branch unavailability needs a response. **Requested changes**: Rebase onto master, respond to freemo's comment, consider removing the benchmark from this TDD PR.
freemo approved these changes 2026-03-30 04:22:26 +00:00
Dismissed
freemo left a comment

Review: APPROVED

Exemplary TDD bug-capture PR. Follows all project conventions: shared fixtures in features/mocks/, properly prefixed step definitions, Robot helper with CLI dispatch, ASV benchmark. Multi-level testing with @tdd_expected_fail tag usage. Well-documented with bug references.

## Review: APPROVED Exemplary TDD bug-capture PR. Follows all project conventions: shared fixtures in `features/mocks/`, properly prefixed step definitions, Robot helper with CLI dispatch, ASV benchmark. Multi-level testing with `@tdd_expected_fail` tag usage. Well-documented with bug references.
Owner

Day 50 Planning — Metadata gaps identified.

This TDD PR for bug #1025 needs:

  1. Milestone assignment — currently has none. Should be assigned to v3.3.0 (matching bug #1025's milestone).
  2. Labels — needs Type/Testing, Priority/Critical, MoSCoW/Must Have per TDD issue policy.

@freemo — The PR is now APPROVED (latest review). @CoreRasurae has been requested as reviewer. Once the milestone and labels are added, this should be merged promptly — bug #1025 fix depends on it.

The 730-line scope concern from the earlier review was addressed — the benchmark content belongs in the TDD PR since it establishes the baseline for the bug-capture test.

Day 50 Planning — **Metadata gaps identified.** This TDD PR for bug #1025 needs: 1. **Milestone assignment** — currently has none. Should be assigned to v3.3.0 (matching bug #1025's milestone). 2. **Labels** — needs `Type/Testing`, `Priority/Critical`, `MoSCoW/Must Have` per TDD issue policy. @freemo — The PR is now APPROVED (latest review). @CoreRasurae has been requested as reviewer. Once the milestone and labels are added, this should be merged promptly — bug #1025 fix depends on it. The 730-line scope concern from the earlier review was addressed — the benchmark content belongs in the TDD PR since it establishes the baseline for the bug-capture test.
brent.edwards force-pushed tdd/m4-plan-correct-auto-resolve from fb5b55fad5
All checks were successful
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 30s
CI / build (pull_request) Successful in 20s
CI / quality (pull_request) Successful in 3m39s
CI / typecheck (pull_request) Successful in 4m18s
CI / security (pull_request) Successful in 4m50s
CI / unit_tests (pull_request) Successful in 5m13s
CI / docker (pull_request) Successful in 56s
CI / integration_tests (pull_request) Successful in 6m54s
CI / e2e_tests (pull_request) Successful in 10m39s
CI / coverage (pull_request) Successful in 11m21s
CI / status-check (pull_request) Successful in 1s
CI / benchmark-regression (pull_request) Successful in 59m27s
to edc7d3fe56
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 19s
CI / helm (pull_request) Successful in 21s
CI / lint (pull_request) Successful in 3m18s
CI / quality (pull_request) Successful in 3m42s
CI / typecheck (pull_request) Successful in 3m55s
CI / integration_tests (pull_request) Failing after 4m2s
CI / security (pull_request) Successful in 4m5s
CI / unit_tests (pull_request) Failing after 4m49s
CI / docker (pull_request) Has been skipped
CI / coverage (pull_request) Successful in 12m27s
CI / e2e_tests (pull_request) Successful in 19m57s
CI / status-check (pull_request) Failing after 1s
CI / benchmark-regression (pull_request) Has been cancelled
2026-03-31 22:28:18 +00:00
Compare
brent.edwards dismissed freemo's review 2026-03-31 22:28:18 +00:00
Reason:

New commits pushed, approval review dismissed automatically according to repository settings

test(tdd): align #1025 expected-fail tags with listener validation
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 17s
CI / helm (pull_request) Successful in 22s
CI / lint (pull_request) Successful in 3m20s
CI / quality (pull_request) Successful in 3m56s
CI / integration_tests (pull_request) Successful in 3m59s
CI / typecheck (pull_request) Successful in 4m1s
CI / security (pull_request) Successful in 4m10s
CI / unit_tests (pull_request) Successful in 4m43s
CI / docker (pull_request) Successful in 1m35s
CI / coverage (pull_request) Successful in 13m43s
CI / e2e_tests (pull_request) Successful in 18m50s
CI / status-check (pull_request) Successful in 1s
CI / benchmark-regression (pull_request) Has been cancelled
444f07a37c
Update the #1025 Behave and Robot TDD capture tests to use tdd_issue/tdd_issue_1025 tags required by the tdd_expected_fail listener. This removes hook/tag-validation failures in unit_tests and integration_tests while preserving expected-fail inversion behavior.
Merge branch 'master' into tdd/m4-plan-correct-auto-resolve
All checks were successful
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 16s
CI / helm (pull_request) Successful in 23s
CI / typecheck (pull_request) Successful in 1m10s
CI / security (pull_request) Successful in 1m11s
CI / lint (pull_request) Successful in 3m20s
CI / quality (pull_request) Successful in 3m42s
CI / unit_tests (pull_request) Successful in 9m9s
CI / docker (pull_request) Successful in 1m18s
CI / coverage (pull_request) Successful in 12m27s
CI / e2e_tests (pull_request) Successful in 19m59s
CI / integration_tests (pull_request) Successful in 25m8s
CI / status-check (pull_request) Successful in 2s
CI / benchmark-regression (pull_request) Successful in 55m21s
0f3191ea06
brent.edwards deleted branch tdd/m4-plan-correct-auto-resolve 2026-04-01 00:40:53 +00:00
Sign in to join this conversation.
No reviewers
No milestone
No project
No assignees
2 participants
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
cleveragents/cleveragents-core!1172
No description provided.