test(acms): TDD failing tests for context tier runtime logic (bug #821) #1058

2026-03-18T20:18:01Z

brent.edwards commented

2026-03-18 20:18:01 +00:00

Summary

TDD expected-fail tests proving bug #821 exists: ContextTierService has data models for hot/warm/cold tiers but no runtime logic for automatic promotion, demotion, or eviction.

Promotion on access: Accessing a cold-tier fragment repeatedly via get() does NOT auto-promote it — get() updates access_count/last_accessed but never calls promote()
Demotion on staleness: No staleness enforcement method exists — tried enforce_staleness(), apply_tier_policy(), tick(), etc. — none are implemented
Eviction on budget overflow: store() does NOT enforce TierBudget.max_tokens_hot — the hot tier grows without bound

Files Added

File	Purpose
`features/tdd_context_tier_runtime.feature`	3 Behave scenarios tagged `@tdd_expected_fail @tdd_bug @tdd_bug_821 @mock_only`
`features/steps/tdd_context_tier_runtime_steps.py`	Type-annotated step definitions exercising real `ContextTierService`
`robot/tdd_context_tier_runtime.robot`	3 Robot Framework integration tests tagged `tdd_expected_fail`
`robot/helper_tdd_context_tier_runtime.py`	Helper script for Robot tests with 3 subcommands

Verification

nox -s lint — passed
nox -s typecheck — passed (0 errors)
nox -s unit_tests -- features/tdd_context_tier_runtime.feature — 3 scenarios passed (all assertions fail as expected, @tdd_expected_fail inverts to CI pass)

ISSUES CLOSED: #840

## Summary TDD expected-fail tests proving bug #821 exists: `ContextTierService` has data models for hot/warm/cold tiers but **no runtime logic** for automatic promotion, demotion, or eviction. - **Promotion on access**: Accessing a cold-tier fragment repeatedly via `get()` does NOT auto-promote it — `get()` updates `access_count`/`last_accessed` but never calls `promote()` - **Demotion on staleness**: No staleness enforcement method exists — tried `enforce_staleness()`, `apply_tier_policy()`, `tick()`, etc. — none are implemented - **Eviction on budget overflow**: `store()` does NOT enforce `TierBudget.max_tokens_hot` — the hot tier grows without bound ### Files Added | File | Purpose | |------|---------| | `features/tdd_context_tier_runtime.feature` | 3 Behave scenarios tagged `@tdd_expected_fail @tdd_bug @tdd_bug_821 @mock_only` | | `features/steps/tdd_context_tier_runtime_steps.py` | Type-annotated step definitions exercising real `ContextTierService` | | `robot/tdd_context_tier_runtime.robot` | 3 Robot Framework integration tests tagged `tdd_expected_fail` | | `robot/helper_tdd_context_tier_runtime.py` | Helper script for Robot tests with 3 subcommands | ### Verification - `nox -s lint` — passed - `nox -s typecheck` — passed (0 errors) - `nox -s unit_tests -- features/tdd_context_tier_runtime.feature` — **3 scenarios passed** (all assertions fail as expected, `@tdd_expected_fail` inverts to CI pass) ISSUES CLOSED: #840

brent.edwards referenced this pull request

2026-03-18 20:18:50 +00:00

TDD: context tier service has no runtime promotion/demotion/eviction logic (bug #821) #840

freemo approved these changes 2026-03-19 04:56:00 +00:00

freemo left a comment

Code Review — PR #1058 `test(acms): TDD failing tests for context tier runtime logic (bug #821)`

Clean TDD test PR following the project's bug fix workflow correctly. Three scenarios (promotion on access, demotion on staleness, eviction on budget overflow) effectively prove bug #821 exists. TDD tags @tdd_expected_fail @tdd_bug @tdd_bug_821 are correctly applied.

The step definitions are thorough — the staleness scenario probes for multiple method names (enforce_staleness(), apply_tier_policy(), tick(), etc.) which clearly demonstrates the absence of any runtime enforcement method. Robot integration tests mirror the Behave scenarios for full coverage.

Approved. No issues found.

## Code Review — PR #1058 `test(acms): TDD failing tests for context tier runtime logic (bug #821)` Clean TDD test PR following the project's bug fix workflow correctly. Three scenarios (promotion on access, demotion on staleness, eviction on budget overflow) effectively prove bug #821 exists. TDD tags `@tdd_expected_fail @tdd_bug @tdd_bug_821` are correctly applied. The step definitions are thorough — the staleness scenario probes for multiple method names (`enforce_staleness()`, `apply_tier_policy()`, `tick()`, etc.) which clearly demonstrates the absence of any runtime enforcement method. Robot integration tests mirror the Behave scenarios for full coverage. **Approved.** No issues found.

freemo requested review from freemo 2026-03-19 05:16:42 +00:00

freemo requested review from hamza.khyari 2026-03-19 05:16:42 +00:00

freemo added this to the v3.4.0 milestone 2026-03-19 05:28:34 +00:00

brent.edwards force-pushed tdd/m5-context-tier-runtime from 30a8828770 to 1125e49d4f

2026-03-19 20:58:19 +00:00

Compare

brent.edwards added 1 commit 2026-03-19 22:14:58 +00:00

Merge branch 'master' into tdd/m5-context-tier-runtime

CI / lint (pull_request) Successful in 26s

Details

CI / security (pull_request) Successful in 37s

Details

CI / quality (pull_request) Successful in 43s

Details

CI / typecheck (pull_request) Successful in 1m15s

Details

CI / benchmark-publish (pull_request) Has been skipped

Details

CI / build (pull_request) Successful in 22s

Details

CI / unit_tests (pull_request) Successful in 2m17s

Details

CI / integration_tests (pull_request) Successful in 2m48s

Details

CI / docker (pull_request) Successful in 2m8s

Details

CI / e2e_tests (pull_request) Successful in 6m34s

Details

CI / coverage (pull_request) Successful in 4m47s

Details

CI / benchmark-regression (pull_request) Successful in 38m11s

Details

b5b45a6ff4

brent.edwards merged commit 202c9bfe75 into master

2026-03-19 22:24:42 +00:00

brent.edwards referenced this issue from a commit

2026-03-19 22:24:43 +00:00

test(acms): TDD failing tests for context tier runtime logic (bug #821) (#1058)

brent.edwards deleted branch tdd/m5-context-tier-runtime

2026-03-19 22:24:44 +00:00

Sign in to join this conversation.

2 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: cleveragents/cleveragents-core#1058

test(acms): TDD failing tests for context tier runtime logic (bug #821) #1058

Summary

Files Added

Verification

Code Review — PR #1058 test(acms): TDD failing tests for context tier runtime logic (bug #821)

Code Review — PR #1058 `test(acms): TDD failing tests for context tier runtime logic (bug #821)`