test: add TDD bug-capture test for #1152 — budget eviction deletes instead of demotes #1218
No reviewers
Labels
No labels
auto/needs-reevaluation
controller-managed
auto/blocked-by-deps
auto/ci-timeout
auto/claimed-implementer
auto/claimed-merge
auto/claimed-reviewer
auto/driver-down
auto/invariant-violation
auto/last-attempt-tier-0
auto/last-attempt-tier-1
auto/last-attempt-tier-2
auto/last-attempt-tier-min
Automation Tracking
auto/needs-conflict-resolution
auto/needs-implementer
auto/postmortem
auto/ready-to-merge
auto/restart-throttled
auto/revert
auto/sentinel
auto/stale-inactivity
auto/unstable
Blocked
Bounty
$100
Bounty
$1000
Bounty
$10000
Bounty
$20
Bounty
$2000
Bounty
$250
Bounty
$50
Bounty
$500
Bounty
$5000
Bounty
$750
MoSCoW
Could have
MoSCoW
Must have
MoSCoW
Should have
Needs Feedback
Points
1
Points
13
Points
2
Points
21
Points
3
Points
34
Points
5
Points
55
Points
8
Points
88
Priority
Backlog
Priority
CI Blocker
Priority
Critical
Priority
High
Priority
Low
Priority
Medium
Signed-off: Owner
Signed-off: Scrum Master
Signed-off: Tech Lead
Spike
State
Completed
State
Duplicate
State
In Progress
State
In Review
State
Paused
State
Unverified
State
Verified
State
Wont Do
Type
Automation
Type
Bug
Type
Discussion
Type
Documentation
Type
Epic
Type
Feature
Type
Legendary
Type
Refactor
Type
Support
Type
Task
Type
Testing
No project
No assignees
2 participants
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
cleveragents/cleveragents-core!1218
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "tdd/m5-budget-eviction-delete"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Summary
Add TDD issue-capture tests for bug #1152:
_enforce_hot_budget()andevict_lru()permanently delete hot-tier fragments instead of demoting them to the warm tier, violating the specification's downward tier lifecycle.Changes
features/tdd_budget_eviction_deletes_not_demotes.feature+ step file): Two scenarios tagged@tdd_expected_fail @tdd_issue @tdd_issue_1152 @mock_onlythat prove the bug exists by asserting evicted fragments should be in the warm tier (they aren't — they're destroyed).robot/tdd_budget_eviction_deletes_not_demotes.robot+ helper): Two integration tests mirroring the Behave scenarios withtdd_expected_fail tdd_issue tdd_issue_1152tags.Motivation
Per CONTRIBUTING.md Bug Fix Workflow, every bug must have a TDD counterpart that captures the buggy behavior before the fix is implemented. This PR provides that counterpart for #1152.
Test Design
ContextTierServicewith a 100-token hot-tier budgetevict_lru) to trigger evictiontier == WARMCurrently,
_enforce_hot_budget()doesdel self._hot[oldest_id]andevict_lru()doesdel store[fid]— both permanently destroy the fragment. The@tdd_expected_failtag inverts these assertion failures to CI passes.Quality Gates
All 11 nox sessions pass:
Closes #1183
28fee8e59d8cbccd17ca🔒 Claimed by pr-reviewer-2. Starting independent code review.
Code Review — PR #1218: TDD bug-capture test for #1152
Overall Assessment
The test design is excellent — it correctly captures the bug described in #1152 with clear, well-structured BDD scenarios and matching Robot Framework integration tests. The feature file documentation, step docstrings, and assertion messages are thorough and will be very helpful when the bug fix is implemented.
However, there is one blocking issue that must be resolved before merge.
🚫 Blocking:
# type: ignore[arg-type]violationsBoth the Behave step file and the Robot helper contain
# type: ignore[arg-type]suppressions, which are explicitly forbidden by CONTRIBUTING.md ("No# type: ignoresuppressions").Affected files:
features/steps/tdd_budget_eviction_deletes_not_demotes_steps.py—_make_eviction_fragment()helperrobot/helper_tdd_budget_eviction_deletes_not_demotes.py—_make_fragment()helperRoot cause: Both helpers build a
dict[str, object]and unpack it with**kwargs, which Pyright cannot narrow to the specific typesTieredFragment.__init__expects.Fix: Construct
TieredFragmentdirectly with keyword arguments instead of building an intermediate dict. Thelast_accessedconditional can be handled with two return paths or by using a default value. See inline comments for the specific fix.✅ What looks good
tdd_budget_eviction_deletes_not_demotes.feature): Clean Gherkin, proper tags (@tdd_expected_fail @tdd_issue @tdd_issue_1152 @mock_only), excellent inline documentation explaining the bug and the expected-failure mechanism.store()and explicitevict_lru()) correctly target the two code paths identified in bug #1152.ISSUES CLOSED: #1183footer.Closes #1183).ℹ️ Minor note (non-blocking)
Issue #1183 body mentions
@tdd_bug/@tdd_bug_1152tags, but the implementation uses@tdd_issue/@tdd_issue_1152. The PR body and commit message consistently document the@tdd_issueconvention. If@tdd_issueis the current project convention, the issue body should be updated for consistency. Not blocking.@ -0,0 +39,4 @@def _make_eviction_fragment(fragment_id: str,tier: ContextTier,token_count: int = 50,Blocking:
# type: ignore[arg-type]is forbidden by CONTRIBUTING.md.The intermediate
dict[str, object]forces the type suppression because Pyright cannot narrowobjectvalues to the specific typesTieredFragmentexpects.Fix: Construct
TieredFragmentdirectly with keyword arguments:This eliminates the
dict[str, object]entirely and passes correctly-typed arguments directly.@ -0,0 +52,4 @@) -> TieredFragment:"""Create a ``TieredFragment`` with the given properties."""kwargs: dict[str, object] = {"fragment_id": fragment_id,Blocking: Same
# type: ignore[arg-type]violation as in the Behave step file. Apply the same fix — constructTieredFragmentdirectly with keyword arguments instead of building an intermediatedict[str, object].Review: APPROVED ✅
Changes Reviewed
features/tdd_budget_eviction_deletes_not_demotes.feature— Behave BDD feature file with 2 scenarios tagged@tdd_expected_failfeatures/steps/tdd_budget_eviction_deletes_not_demotes_steps.py— Step definitions for the Behave scenariosrobot/tdd_budget_eviction_deletes_not_demotes.robot— Robot Framework integration testsrobot/helper_tdd_budget_eviction_deletes_not_demotes.py— Robot helper scriptFix Applied
Removed 2 forbidden
# type: ignore[arg-type]suppressions from the step file and Robot helper. The root cause was constructingTieredFragmentviadict[str, object]unpacking (**kwargs), which Pyright correctly flagged as type-unsafe. Fixed by constructingTieredFragmentdirectly with typed keyword arguments and using post-construction assignment for the optionallast_accessedfield (leveraging Pydantic'svalidate_assignment=True).Specification Alignment
_enforce_hot_budget()andevict_lru()permanently delete fragments instead of demoting to warm tier@tdd_expected_failtag correctly inverts results so CI passes while the bug is openCode Quality
# type: ignoresuppressions remainPR Metadata
needs feedbacklabelReview claimed by reviewer pool instance reviewer-pool-1. Dispatching independent code review.
Independent Code Review — PR #1218 (reviewer-pool-1)
Summary
TDD bug-capture tests for #1152: budget eviction permanently deletes hot-tier fragments instead of demoting them to the warm tier. This PR adds Behave BDD scenarios and Robot Framework integration tests that prove the bug exists, tagged with
@tdd_expected_failso CI passes while the bug remains open.Files Reviewed
features/tdd_budget_eviction_deletes_not_demotes.feature— 2 BDD scenariosfeatures/steps/tdd_budget_eviction_deletes_not_demotes_steps.py— Step definitions (~200 lines)robot/tdd_budget_eviction_deletes_not_demotes.robot— 2 Robot Framework testsrobot/helper_tdd_budget_eviction_deletes_not_demotes.py— Robot helper script (~200 lines)Specification Alignment ✅
_enforce_hot_budget()doingdel self._hot[oldest_id]andevict_lru()doingdel store[fid]Type Safety ✅
# type: ignoresuppressions — the previous review cycle caught and fixed thisTieredFragmentconstructed with direct keyword arguments;last_accessedset via post-construction assignment to avoid dict unpacking type issuesTest Quality ✅
store()and explicitevict_lru())@tdd_expected_failtag correctly inverts results for CI while bug is open@mock_onlytag on Behave feature prevents unnecessary DB setupCode Quality ✅
noqa: E402for necessary sys.path manipulation)PR Process Compliance ✅
test: add TDD bug-capture test for #1152 — budget eviction deletes instead of demotesCloses #1183in PR bodyISSUES CLOSED: #1183in commit footersneeds feedbacklabelDecision: APPROVED — merging via squash