feat(cli): implement context show and context clear CLI commands for ACMS - Closes #9586 #9675

SUGGESTION -- HAL 9000 appears twice in the contributors list. This should be deduplicated.

features/steps/acms_context_cli_steps.py Outdated

						
				@@ -0,0 +335,4 @@

				    )

				    context.command_exit_code = 0

HAL9001 commented

SUGGESTION -- The help-flag step definitions don't actually invoke the Typer subcommand via CLI; they read function docstrings directly. This means the --help output is never tested through Typer's real help generation path.

Suggested fix: Use typer.testing.CliRunner.

SUGGESTION -- The help-flag step definitions don't actually invoke the Typer subcommand via CLI; they read function docstrings directly. This means the --help output is never tested through Typer's real help generation path. Suggested fix: Use typer.testing.CliRunner.

src/cleveragents/cli/commands/acms_context.py Outdated

						
				@@ -0,0 +112,4 @@

				        console.print(f"  Warm tier: {metrics.warm_count} fragments")

				        console.print(f"  Cold tier: {metrics.cold_count} fragments")

				        console.print(f"  Hot utilization: {utilization}")

HAL9001 commented

BUG -- Budget utilization calculation mixes fragment count with token budget. This line:
utilization = _format_budget_utilization(
metrics.hot_count * 100, budget.max_tokens_hot or 1)
uses hot_count * 100 (a fragment count multiplied by an arbitrary factor) as a proxy for token utilization against budget.max_tokens_hot (a token limit). These are fundamentally different units -- fragment count multiplied by 100 does not represent actual token usage.

Suggested fix: Use the actual total_tokens value already computed earlier in the function, or track token usage per tier in the metrics object.

BUG -- Budget utilization calculation mixes fragment count with token budget. This line: utilization = _format_budget_utilization( metrics.hot_count * 100, budget.max_tokens_hot or 1) uses hot_count * 100 (a fragment count multiplied by an arbitrary factor) as a proxy for token utilization against budget.max_tokens_hot (a token limit). These are fundamentally different units -- fragment count multiplied by 100 does not represent actual token usage. Suggested fix: Use the actual total_tokens value already computed earlier in the function, or track token usage per tier in the metrics object.

src/cleveragents/cli/commands/acms_context.py Outdated

						
				@@ -0,0 +145,4 @@

				    yes: Annotated[

				        bool, typer.Option("--yes", "-y", help="Skip confirmation prompt")

				    ] = False,

				) -> None:

HAL9001 commented

SUGGESTION -- The except typer.Exit(0) handler is overly generic and will catch any typer.Exit(0) raised anywhere in the try block, not just from user cancellation.

Suggested fix: Move the cancellation handling outside the try/except block with an early return, then remove the except typer.Exit(0) handler entirely.

SUGGESTION -- The except typer.Exit(0) handler is overly generic and will catch any typer.Exit(0) raised anywhere in the try block, not just from user cancellation. Suggested fix: Move the cancellation handling outside the try/except block with an early return, then remove the except typer.Exit(0) handler entirely.

src/cleveragents/cli/commands/acms_context.py Outdated

						
				@@ -0,0 +185,4 @@

				        table = Table()

				        table.add_column("Resource", style="cyan")

				        table.add_column("Tier", style="green")

				        table.add_column("Project", style="yellow")

HAL9001 commented

2026-04-30 14:08:42 +00:00

SUGGESTION -- The tier_service parameter in _remove_fragments has type object. This bypasses type safety for the entire function body.

Suggested fix: Use a Protocol with _remove_from_all and _lock fields.

SUGGESTION -- The tier_service parameter in _remove_fragments has type object. This bypasses type safety for the entire function body. Suggested fix: Use a Protocol with _remove_from_all and _lock fields.

HAL9001 commented

Automated by CleverAgents Bot
Supervisor: PR Review | Agent: pr-review-worker

--- Automated by CleverAgents Bot Supervisor: PR Review | Agent: pr-review-worker

HAL9001 requested changes 2026-04-30 18:24:32 +00:00

Dismissed

HAL9001 left a comment

Re-Review of PR #9675 — feat(cli): implement context show and context clear CLI commands for ACMS

Prior Feedback Verification

No previous REQUEST_CHANGES reviews were found on this PR (the reviews list API returned empty). The PR was previously flagged by CI as failing but no formal review comments exist to verify were addressed. This re-review constitutes a fresh comprehensive evaluation.

CI Status — BLOCKING

Three required CI checks are failing:

unit_tests: Failing after 5m1s
integration_tests: Failing after 5m20s
security: Failing after 11m55s

Lint (success) and typecheck (success) are passing. Per company policy, all CI gates (lint, typecheck, security, unit_tests, coverage) must pass before a PR can be approved and merged. The CI failures need to be investigated and fixed before this PR can proceed.

Code Review Findings

BLOCKING Issues

Lock acquired per fragment in _remove_fragments — The removal loop acquires and releases the lock for EACH fragment individually (one lock operation per fragment). This is an N+1 locking pattern. If the lock is intended to protect the entire batch removal atomically, all removal operations must execute under a single lock acquisition. See inline comment on _remove_fragments.

Non-Blocking Suggestions

tier parameter shadows Python builtin — The tier parameter name in acms_context_clear shadows the Python builtin tier. Suggest renaming to tier_filter for clarity and to match the parameter name in _filter_fragments.
_remove_fragments uses object type annotation — The tier_service: object type is too loose to provide any type safety. Should use a Protocol defining the expected interface (get_all_fragments, budget, _lock, _remove_from_all), or the concrete ContextTierService type.
_remove_fragments uses getattr on private attributes — Dynamically accessing _remove_from_all and _lock via getattr makes the code depend on private implementation details that may change. This pattern was used to accommodate mocking but creates fragility.
Parameter naming inconsistency — _filter_fragments accepts tier_filter but is called with a parameter named tier. This inconsistency makes the data flow harder to follow.
Commit message under-describes scope — The commit message says "wire up ACMS context sub-app to main CLI entry point" but the changes include a 305-line new source file, 355-line step definitions, 98-line mocks, 65-line feature, 310-line Robot helper, 81-line Robot test, and 177-line benchmarks. The message significantly under-sells the scope of changes.

Summary

CI is failing on 3/5 required checks (unit_tests, integration_tests, security)
Lock granularity issue in _remove_fragments needs architectural correction
Several code quality suggestions provided inline below

Recommend fixing CI failures and addressing the lock granularity issue before resubmission for review.

## Re-Review of PR #9675 — feat(cli): implement context show and context clear CLI commands for ACMS ### Prior Feedback Verification No previous `REQUEST_CHANGES` reviews were found on this PR (the reviews list API returned empty). The PR was previously flagged by CI as failing but no formal review comments exist to verify were addressed. This re-review constitutes a fresh comprehensive evaluation. ### CI Status — BLOCKING Three required CI checks are failing: - **unit_tests**: Failing after 5m1s - **integration_tests**: Failing after 5m20s - **security**: Failing after 11m55s Lint (success) and typecheck (success) are passing. Per company policy, all CI gates (lint, typecheck, security, unit_tests, coverage) must pass before a PR can be approved and merged. The CI failures need to be investigated and fixed before this PR can proceed. ### Code Review Findings #### BLOCKING Issues 1. **Lock acquired per fragment in `_remove_fragments`** — The removal loop acquires and releases the lock for EACH fragment individually (one lock operation per fragment). This is an N+1 locking pattern. If the lock is intended to protect the entire batch removal atomically, all removal operations must execute under a single lock acquisition. See inline comment on `_remove_fragments`. #### Non-Blocking Suggestions 2. **`tier` parameter shadows Python builtin** — The `tier` parameter name in `acms_context_clear` shadows the Python builtin `tier`. Suggest renaming to `tier_filter` for clarity and to match the parameter name in `_filter_fragments`. 3. **`_remove_fragments` uses `object` type annotation** — The `tier_service: object` type is too loose to provide any type safety. Should use a `Protocol` defining the expected interface (`get_all_fragments`, `budget`, `_lock`, `_remove_from_all`), or the concrete `ContextTierService` type. 4. **`_remove_fragments` uses `getattr` on private attributes** — Dynamically accessing `_remove_from_all` and `_lock` via `getattr` makes the code depend on private implementation details that may change. This pattern was used to accommodate mocking but creates fragility. 5. **Parameter naming inconsistency** — `_filter_fragments` accepts `tier_filter` but is called with a parameter named `tier`. This inconsistency makes the data flow harder to follow. 6. **Commit message under-describes scope** — The commit message says "wire up ACMS context sub-app to main CLI entry point" but the changes include a 305-line new source file, 355-line step definitions, 98-line mocks, 65-line feature, 310-line Robot helper, 81-line Robot test, and 177-line benchmarks. The message significantly under-sells the scope of changes. ### Summary - CI is failing on 3/5 required checks (unit_tests, integration_tests, security) - Lock granularity issue in `_remove_fragments` needs architectural correction - Several code quality suggestions provided inline below Recommend fixing CI failures and addressing the lock granularity issue before resubmission for review.

src/cleveragents/cli/commands/acms_context.py Outdated

						
				@@ -0,0 +150,4 @@

				    Removes context fragments by path pattern, metadata tag, or tier.

				    Requires confirmation unless --yes flag is provided.

				    """

HAL9001 commented

Suggestion: tier parameter shadows Python builtin. Consider renaming to tier_filter and updating the call site consistently. The _filter_fragments function already uses tier_filter as the parameter name.

Suggestion: `tier` parameter shadows Python builtin. Consider renaming to `tier_filter` and updating the call site consistently. The `_filter_fragments` function already uses `tier_filter` as the parameter name.

src/cleveragents/cli/commands/acms_context.py Outdated

						
				@@ -0,0 +273,4 @@

				    Returns:

				        Number of fragments successfully removed.

				    """

HAL9001 commented

Suggestion: tier_service: object type provides no type safety. Consider using a Protocol:

class ContextTierServiceOps(Protocol):
    def get_all_fragments(self) -> list[TieredFragment]: ...
    @property
    def budget(self) -> TierBudget: ...
    @property
    def _lock(self) -> threading.Lock | None: ...
    def _remove_from_all(self, fragment_id: str) -> None: ...

Or use the concrete type if available.

Suggestion: `tier_service: object` type provides no type safety. Consider using a Protocol: class ContextTierServiceOps(Protocol): def get_all_fragments(self) -> list[TieredFragment]: ... @property def budget(self) -> TierBudget: ... @property def _lock(self) -> threading.Lock | None: ... def _remove_from_all(self, fragment_id: str) -> None: ... Or use the concrete type if available.

src/cleveragents/cli/commands/acms_context.py Outdated

						
				@@ -0,0 +277,4 @@

				    remove_fn = getattr(tier_service, "_remove_from_all", None)

				    lock = getattr(tier_service, "_lock", None)

				    if remove_fn is None:

				        return 0

HAL9001 commented

Suggestion: Using getattr on private attributes (_remove_from_all, _lock) makes this function depend on internal implementation details. Consider defining a Protocol or accepting a specific interface type instead. This pattern also masks potential bugs at type-check time.

Suggestion: Using `getattr` on private attributes (`_remove_from_all`, `_lock`) makes this function depend on internal implementation details. Consider defining a Protocol or accepting a specific interface type instead. This pattern also masks potential bugs at type-check time.

src/cleveragents/cli/commands/acms_context.py Outdated

						
				@@ -0,0 +280,4 @@

				        return 0

				    removed = 0

				    for fragment in fragments:

HAL9001 commented

2026-04-30 18:42:41 +00:00

BLOCKING: Lock acquired per fragment. This loop acquires and releases the lock for EACH fragment individually:

for fragment in fragments:
    if lock is not None:
        with lock:
            remove_fn(fragment.fragment_id)

If the lock is intended to protect the entire batch atomically, move the lock acquisition outside the loop:

if lock is not None:
    with lock:
        for fragment in fragments:
            remove_fn(fragment.fragment_id)
else:
    for fragment in fragments:
        remove_fn(fragment.fragment_id)

Alternatively, the method could use a single batch remove call on the tier_service if the real implementation supports it.

BLOCKING: Lock acquired per fragment. This loop acquires and releases the lock for EACH fragment individually: for fragment in fragments: if lock is not None: with lock: remove_fn(fragment.fragment_id) If the lock is intended to protect the entire batch atomically, move the lock acquisition outside the loop: if lock is not None: with lock: for fragment in fragments: remove_fn(fragment.fragment_id) else: for fragment in fragments: remove_fn(fragment.fragment_id) Alternatively, the method could use a single batch remove call on the tier_service if the real implementation supports it.

HAL9001 commented

Automated by CleverAgents Bot
Supervisor: PR Review | Agent: pr-review-worker

--- Automated by CleverAgents Bot Supervisor: PR Review | Agent: pr-review-worker

HAL9001 requested changes 2026-04-30 18:45:55 +00:00

Dismissed

HAL9001 left a comment

Review Summary

Reviewed PR #9675 (feat(cli): implement context show and context clear CLI commands for ACMS) on branch feat/v3.4.0-context-show-clear-cli for milestone v3.4.0.

CI Status: FAILING (Blocking)

Three required CI checks are failing:

CI / unit_tests (pull_request) — Failing after 5m1s
CI / integration_tests (pull_request) — Failing after 5m20s
CI / security (pull_request) — Failing after 11m55s

Per company policy, all CI gates (lint, typecheck, security, unit_tests, coverage) must pass before a PR can be approved and merged. These failures must be addressed before this PR can be merged.

Blocking Issues

1. Bug: Incorrect budget utilization calculation
In acms_context.py, the budget utilization is calculated as metrics.hot_count * 100, which multiplies fragment count by 100 rather than summing actual token usage. This produces nonsensical utilization percentages. For example, 5 hot-tier fragments produces "500 tokens used" regardless of actual token counts. The fix should compute sum(f.token_count for f in hot_tier_fragments) instead.

2. Missing warm/cold tier utilization
The budget utilization summary claims to show a full breakdown, but only hot tier utilization is displayed. There is no warm or cold tier utilization. Per the issue spec, users need a "budget utilization summary (tokens used vs. budget, per-tier breakdown)" — this is incomplete.

3. Non-atomic commits
The PR contains 3 tightly coupled commits for a single feature:

620ed7d1 feat(cli): implement context show and context clear CLI commands for ACMS
dfdbb721 feat(cli): implement context show and context clear CLI commands for ACMS - Closes #9586
89df021b feat(cli): wire up ACMS context sub-app to main CLI entry point

The first two commits both implement the same feature. The second only rewrites the first to add the Closes keyword. These should be squashed into a single atomic commit per the project convention.

## Review Summary Reviewed PR #9675 (feat(cli): implement context show and context clear CLI commands for ACMS) on branch feat/v3.4.0-context-show-clear-cli for milestone v3.4.0. ### CI Status: FAILING (Blocking) Three required CI checks are failing: - **CI / unit_tests (pull_request)** — Failing after 5m1s - **CI / integration_tests (pull_request)** — Failing after 5m20s - **CI / security (pull_request)** — Failing after 11m55s Per company policy, all CI gates (lint, typecheck, security, unit_tests, coverage) must pass before a PR can be approved and merged. These failures must be addressed before this PR can be merged. ### Blocking Issues **1. Bug: Incorrect budget utilization calculation** In `acms_context.py`, the budget utilization is calculated as `metrics.hot_count * 100`, which multiplies fragment count by 100 rather than summing actual token usage. This produces nonsensical utilization percentages. For example, 5 hot-tier fragments produces "500 tokens used" regardless of actual token counts. The fix should compute `sum(f.token_count for f in hot_tier_fragments)` instead. **2. Missing warm/cold tier utilization** The budget utilization summary claims to show a full breakdown, but only hot tier utilization is displayed. There is no warm or cold tier utilization. Per the issue spec, users need a "budget utilization summary (tokens used vs. budget, per-tier breakdown)" — this is incomplete. **3. Non-atomic commits** The PR contains 3 tightly coupled commits for a single feature: - `620ed7d1` feat(cli): implement context show and context clear CLI commands for ACMS - `dfdbb721` feat(cli): implement context show and context clear CLI commands for ACMS - Closes #9586 - `89df021b` feat(cli): wire up ACMS context sub-app to main CLI entry point The first two commits both implement the same feature. The second only rewrites the first to add the Closes keyword. These should be squashed into a single atomic commit per the project convention.

.git Outdated

HAL9001 commented

2026-04-30 18:45:55 +00:00

SUGGESTION: The constant 10 used as the max number of entries to display in the pre-clear summary table (entries_to_remove[:10]) is a magic number. Consider defining it as a module-level constant like MAX_DISPLAY_ENTRIES = 10 for clarity.

SUGGESTION: The constant `10` used as the max number of entries to display in the pre-clear summary table (`entries_to_remove[:10]`) is a magic number. Consider defining it as a module-level constant like `MAX_DISPLAY_ENTRIES = 10` for clarity.

features/steps/acms_context_cli_steps.py Outdated

						
				@@ -0,0 +124,4 @@

				            return_value=mock_container,

				        ),

				        patch(

				            "cleveragents.cli.commands.acms_context._get_console",

HAL9001 commented

SUGGESTION: The step_run_context_clear_yes step captures the mock_typer_confirm but in step_run_context_clear_with_confirm, typer.confirm is patched with a flat return_value=True rather than being captured into the context. For consistency and better test verification, consider using the same pattern in both steps.

SUGGESTION: The `step_run_context_clear_yes` step captures the `mock_typer_confirm` but in `step_run_context_clear_with_confirm`, `typer.confirm` is patched with a flat `return_value=True` rather than being captured into the context. For consistency and better test verification, consider using the same pattern in both steps.

features/steps/acms_context_cli_steps.py Outdated

						
				@@ -0,0 +307,4 @@

				    with (

				        patch(

				            "cleveragents.cli.commands.acms_context.get_container",

HAL9001 commented

SUGGESTION: The _filter_fragments function is not testable independently because it is not a Typer command and has no direct CLI path to invoke it. Consider whether this function could be tested in isolation via a unit test, or whether the current integration-style test coverage is sufficient. Given that 355 lines of step definitions are dedicated to this single feature, consider consolidating where possible.

SUGGESTION: The `_filter_fragments` function is not testable independently because it is not a Typer command and has no direct CLI path to invoke it. Consider whether this function could be tested in isolation via a unit test, or whether the current integration-style test coverage is sufficient. Given that 355 lines of step definitions are dedicated to this single feature, consider consolidating where possible.

robot/helper_acms_context_cli.py Outdated

						
				@@ -0,0 +1,310 @@

				"""Robot Framework helper for ACMS context CLI integration tests.

HAL9001 commented

SUGGESTION: This 310-line Python helper script runs outside the normal test framework and uses sys.path manipulation to import from src/ and features/. Consider whether this could be simplified by using the proper test harness (e.g., through nox) instead of a custom runner.

SUGGESTION: This 310-line Python helper script runs outside the normal test framework and uses `sys.path` manipulation to import from `src/` and `features/`. Consider whether this could be simplified by using the proper test harness (e.g., through `nox`) instead of a custom runner.

src/cleveragents/cli/commands/acms_context.py Outdated

						
				@@ -0,0 +100,4 @@

				        console.print(table)

				        console.print(f"\n[bold]Total Tokens:[/bold] {total_tokens:,}")

HAL9001 commented

BLOCKER: Budget utilization calculation is incorrect here. metrics.hot_count * 100 multiplies the fragment count by 100, which is not the actual token usage. This produces meaningless utilization percentages.

Suggested fix: Compute the actual sum of token counts for hot tier fragments:

hot_tokens = sum(f.token_count for f in fragments if f.tier == ContextTier.HOT)
utilization = _format_budget_utilization(hot_tokens, budget.max_tokens_hot or 1)

Likewise, extend the utilization section to show warm and cold tier breakdowns as promised in the issue spec ("tokens used vs. budget, per-tier breakdown").

BLOCKER: Budget utilization calculation is incorrect here. `metrics.hot_count * 100` multiplies the fragment count by 100, which is not the actual token usage. This produces meaningless utilization percentages. Suggested fix: Compute the actual sum of token counts for hot tier fragments: ```python hot_tokens = sum(f.token_count for f in fragments if f.tier == ContextTier.HOT) utilization = _format_budget_utilization(hot_tokens, budget.max_tokens_hot or 1) ``` Likewise, extend the utilization section to show warm and cold tier breakdowns as promised in the issue spec ("tokens used vs. budget, per-tier breakdown").

src/cleveragents/cli/main.py Outdated

HAL9001 commented