fix(cli): handle missing database in session list command #680

[M2 — MEDIUM: Stale documentation] This comment still says "expected to fail until the DI container fix lands" but the @tdd_expected_fail tags have been removed because the fix is applied. Update to describe these as regression tests for the resolved bug.

**[M2 — MEDIUM: Stale documentation]** This comment still says *"expected to fail until the DI container fix lands"* but the `@tdd_expected_fail` tags have been removed because the fix is applied. Update to describe these as regression tests for the resolved bug.

robot/tdd_session_create_di.robot Outdated

CoreRasurae commented

[M2 — MEDIUM: Stale documentation] Line 4 says "fails due to the DI container lacking a db provider" but the fix has been applied and @tdd_expected_fail tags removed. Update documentation to reflect these are now regression tests.

**[M2 — MEDIUM: Stale documentation]** Line 4 says *"fails due to the DI container lacking a `db` provider"* but the fix has been applied and `@tdd_expected_fail` tags removed. Update documentation to reflect these are now regression tests.

src/cleveragents/cli/commands/session.py Outdated

						
				@@ -46,0 +61,4 @@

				    if structlog.is_configured():

				        return

				    logging.basicConfig(

CoreRasurae commented

[M5 — MEDIUM: force=True may override application logging] logging.basicConfig(force=True) forcefully reconfigures the root logger, potentially overriding logging setup from other modules. Consider using force=False or adding a handler directly to avoid side effects.

**[M5 — MEDIUM: `force=True` may override application logging]** `logging.basicConfig(force=True)` forcefully reconfigures the root logger, potentially overriding logging setup from other modules. Consider using `force=False` or adding a handler directly to avoid side effects.

CoreRasurae commented

[H3 — HIGH: No error handling] list_sessions() is the only session subcommand without a try/except block. If the database is inaccessible or the DI container fails to resolve, this will crash with an unhandled exception instead of a clean error message. All other session subcommands (create, show, delete, tell) wrap their logic in try/except.

**[H3 — HIGH: No error handling]** `list_sessions()` is the only session subcommand without a `try/except` block. If the database is inaccessible or the DI container fails to resolve, this will crash with an unhandled exception instead of a clean error message. All other session subcommands (`create`, `show`, `delete`, `tell`) wrap their logic in `try/except`.

src/cleveragents/cli/commands/session.py Outdated

						
				@@ -181,3 +219,2 @@

				    if not sessions:

				        console.print("[yellow]No sessions found.[/yellow]")

				        console.print("Create one with 'agents session create'")

				        if fmt not in (OutputFormat.RICH.value, "plain"):

CoreRasurae commented

[H2 — HIGH: color format excluded from human-friendly path] This condition doesn't include OutputFormat.COLOR.value. The color format should show "No sessions found." (same layout as plain, per spec), but instead it falls through to format_output(empty_data, "color") which produces structured output like sessions:\ntotal: 0.

Suggested fix:

if fmt not in (OutputFormat.RICH.value, OutputFormat.PLAIN.value, OutputFormat.COLOR.value):

**[H2 — HIGH: `color` format excluded from human-friendly path]** This condition doesn't include `OutputFormat.COLOR.value`. The `color` format should show `"No sessions found."` (same layout as `plain`, per spec), but instead it falls through to `format_output(empty_data, "color")` which produces structured output like `sessions:\ntotal: 0`. Suggested fix: ```python if fmt not in (OutputFormat.RICH.value, OutputFormat.PLAIN.value, OutputFormat.COLOR.value): ```

src/cleveragents/infrastructure/database/repositories.py Outdated

CoreRasurae commented

[C1 — CRITICAL: Silent data loss] delete() only calls flush() without commit(). Since _get_session_service() bypasses UnitOfWork and the session-factory creates a new SQLAlchemy session per method call, this transaction is never committed. The deletion is silently rolled back when the session is garbage-collected.

The same issue affects update() and SessionMessageRepository.append(). Only create() was patched with commit() in this PR.

This means agents session delete will report success but the session remains, and agents session tell will display the response but messages are never persisted.

**[C1 — CRITICAL: Silent data loss]** `delete()` only calls `flush()` without `commit()`. Since `_get_session_service()` bypasses UnitOfWork and the session-factory creates a new SQLAlchemy session per method call, this transaction is **never committed**. The deletion is silently rolled back when the session is garbage-collected. The same issue affects `update()` and `SessionMessageRepository.append()`. Only `create()` was patched with `commit()` in this PR. This means `agents session delete` will report success but the session remains, and `agents session tell` will display the response but messages are never persisted.

src/cleveragents/infrastructure/database/repositories.py Outdated

						
				@@ -3899,6 +3903,7 @@ class SessionRepository:

				            db_model = SessionModel.from_domain(session)

				            db_session.add(db_model)

				            db_session.flush()

				            db_session.commit()

CoreRasurae commented

2026-03-11 19:19:39 +00:00

[H1 — HIGH: Contract violation] The class docstring (line 3872-3873) states: "All mutating methods flush (but do NOT commit); the caller or a UnitOfWork wrapper is responsible for committing the transaction."

Adding commit() here breaks this contract. Either all mutating methods should commit (and the docstring should be updated), or the CLI should be changed to use UnitOfWork for transaction management.

**[H1 — HIGH: Contract violation]** The class docstring (line 3872-3873) states: *"All mutating methods flush (but do NOT commit); the caller or a `UnitOfWork` wrapper is responsible for committing the transaction."* Adding `commit()` here breaks this contract. Either all mutating methods should commit (and the docstring should be updated), or the CLI should be changed to use UnitOfWork for transaction management.

freemo added a new dependency 2026-03-11 18:30:42 +00:00

#683 TDD: session list missing database handling (bug #680)

brent.edwards force-pushed fix/m3-session-list-error from d2ccd5a539 to a58cfd7b6b

2026-03-11 19:18:51 +00:00

Compare

brent.edwards commented

Review Response — Review #2139

Thank you for the thorough review, @CoreRasurae. I've addressed the findings as follows:

Fixed in this push

ID	Severity	Summary	Action
C1	Critical	Silent data loss in `delete()`/`update()`/`append()` — `flush()` without `commit()`	Added `db_session.commit()` after `flush()` in all three methods
H1	High	Class docstrings claim "flush but do NOT commit"	Updated `SessionRepository` and `SessionMessageRepository` docstrings to document the commit-per-method contract
H2	High	Empty-list format branch excludes `color` format	Changed condition to include `OutputFormat.COLOR.value` and `OutputFormat.PLAIN.value` (using enum values consistently)
H3	High	No error handling in `list_sessions()`	Wrapped service calls in `try/except Exception` matching the pattern used by other subcommands
M2	Medium	Stale "expected to fail" documentation in 7+ files	Updated all feature descriptions, robot docs, and step definition docstrings to describe these as regression tests for the resolved bug
M4	Medium	Engine leak in `session_create_error_steps.py`	Wrapped `create_all(engine)` in `try/finally` with `engine.dispose()`, matching the pattern in `session_list_error_steps.py`
M5	Medium	`_ensure_cli_logging()` uses `force=True` and `cache_logger_on_first_use=False`	Changed to `force=False` and `cache_logger_on_first_use=True`

Deferred to follow-up issues

ID	Severity	Summary	Rationale
M1	Medium	Engine proliferation — multiple `_build_*` functions create separate engines	Architectural concern outside scope of this bug fix
M3	Medium	No tests for delete/tell/show through real DI path	Additional coverage; not a regression of this fix
L1	Low	No test for `--format table` or `--format color` on session list	Additional coverage
L2	Low	Missing `session_create_bench.py` per #570 acceptance criteria	Benchmark file; not blocking
L3	Low	`Settings._instance = None` directly mutates private attribute	Pre-existing pattern; out of scope
L4	Low	Spec compliance gaps in output structure	Pre-existing; out of scope

CI results (all green)

lint: All checks passed
typecheck: 0 errors (1 pre-existing warning)
unit_tests: 10,422 scenarios passed, 0 failed
integration_tests: 1,447 tests passed, 0 failed
coverage: 98% (threshold: 97%)

## Review Response — Review #2139 Thank you for the thorough review, @CoreRasurae. I've addressed the findings as follows: ### Fixed in this push | ID | Severity | Summary | Action | |----|----------|---------|--------| | **C1** | Critical | Silent data loss in `delete()`/`update()`/`append()` — `flush()` without `commit()` | Added `db_session.commit()` after `flush()` in all three methods | | **H1** | High | Class docstrings claim "flush but do NOT commit" | Updated `SessionRepository` and `SessionMessageRepository` docstrings to document the commit-per-method contract | | **H2** | High | Empty-list format branch excludes `color` format | Changed condition to include `OutputFormat.COLOR.value` and `OutputFormat.PLAIN.value` (using enum values consistently) | | **H3** | High | No error handling in `list_sessions()` | Wrapped service calls in `try/except Exception` matching the pattern used by other subcommands | | **M2** | Medium | Stale "expected to fail" documentation in 7+ files | Updated all feature descriptions, robot docs, and step definition docstrings to describe these as regression tests for the resolved bug | | **M4** | Medium | Engine leak in `session_create_error_steps.py` | Wrapped `create_all(engine)` in `try/finally` with `engine.dispose()`, matching the pattern in `session_list_error_steps.py` | | **M5** | Medium | `_ensure_cli_logging()` uses `force=True` and `cache_logger_on_first_use=False` | Changed to `force=False` and `cache_logger_on_first_use=True` | ### Deferred to follow-up issues | ID | Severity | Summary | Rationale | |----|----------|---------|----------| | **M1** | Medium | Engine proliferation — multiple `_build_*` functions create separate engines | Architectural concern outside scope of this bug fix | | **M3** | Medium | No tests for delete/tell/show through real DI path | Additional coverage; not a regression of this fix | | **L1** | Low | No test for `--format table` or `--format color` on session list | Additional coverage | | **L2** | Low | Missing `session_create_bench.py` per #570 acceptance criteria | Benchmark file; not blocking | | **L3** | Low | `Settings._instance = None` directly mutates private attribute | Pre-existing pattern; out of scope | | **L4** | Low | Spec compliance gaps in output structure | Pre-existing; out of scope | ### CI results (all green) - **lint**: All checks passed - **typecheck**: 0 errors (1 pre-existing warning) - **unit_tests**: 10,422 scenarios passed, 0 failed - **integration_tests**: 1,447 tests passed, 0 failed - **coverage**: 98% (threshold: 97%)

CoreRasurae requested changes 2026-03-11 19:52:41 +00:00

CoreRasurae left a comment

Code Review Report -- PR #680 `fix(cli): handle missing database in session list command`

Branch: fix/m3-session-list-error
Issues: #554 (agents session list DI error), #570 (agents session create DI error)
Reviewer: Automated Code Review (3 full review cycles across all categories)
Reviewed commits by: Brent Edwards (key commit: a58cfd7b)

Executive Summary

The DI container fix correctly addresses the root cause of both #554 and #570 by adding a db Singleton provider to the Container class. The test coverage is thorough with Behave, Robot, and ASV suites. However, the review identified 25 findings across 6 categories, including 4 high-severity issues that should be addressed before merge.

Findings by Severity

CRITICAL / HIGH (4 findings -- must fix before merge)

H1. `_get_session_service()` never caches the built service [Bug]

File: src/cleveragents/cli/commands/session.py:82-107
Severity: HIGH

The function declares global _service and checks the cache, but never assigns the built service back to _service:

def _get_session_service() -> SessionService:
    global _service
    if _service is not None:
        return _service
    # ... builds service ...
    return PersistentSessionService(session_repo, message_repo)  # never assigned to _service!

This means:

Every CLI invocation re-resolves the full DI chain (container, sessionmaker, repos, service)
The _reset_session_service() function and the _service global are dead code in production
In the interactive REPL (commit f48eb5a0), repeated session commands re-create everything each time

Fix: Add _service = PersistentSessionService(session_repo, message_repo) before the return.

H2. SQLAlchemy sessions are never closed -- resource leak [Bug]

File: src/cleveragents/infrastructure/database/repositories.py:3904, 3925, 3946, 3966, 3996, 4068, 4099, 4126
Severity: HIGH

Every method in SessionRepository and SessionMessageRepository calls self._session() to get a new SQLAlchemy session but never closes it:

db_session = self._session()
try:
    # ... operations ...
    db_session.commit()
    return session
except ...:
    db_session.rollback()
    raise
# No finally: db_session.close()

There are 8 methods across both repositories that leak sessions. With SQLite, this keeps the database file locked and accumulates unclosed connections. Over many operations, this will exhaust resources.

Fix: Add finally: db_session.close() blocks to all methods, or use a context manager pattern.

H3. Commit-per-method breaks the Unit of Work pattern [Architecture]

File: src/cleveragents/infrastructure/database/repositories.py:3872-3876
Severity: HIGH

The change from "flush-only, caller commits" to "flush-and-commit per method" breaks the UoW contract that every other repository in this file follows. The commit message acknowledges this: "the CLI's _get_session_service() bypasses UnitOfWork".

Consequences:

If any future consumer uses SessionRepository within a UnitOfWork, auto-commits will prematurely commit partial transactions
All 20+ other repository classes in the file follow flush-only; this creates an inconsistency that will confuse developers
Atomic multi-step operations involving sessions become impossible

Recommendation: Instead of changing the repository contract, make the CLI use the UnitOfWork properly, or create a dedicated CLISessionRepository subclass that commits. Alternatively, add a auto_commit flag to the repository constructor defaulting to False.

H4. `SessionService` built outside DI container, inconsistent with codebase [Architecture]

File: src/cleveragents/cli/commands/session.py:103-107
Severity: HIGH

The session service is manually wired outside the DI container:

container = get_container()
db = container.db()
session_repo = SessionRepository(db)
message_repo = SessionMessageRepository(db)
return PersistentSessionService(session_repo, message_repo)

Every other service (ProjectService, ActorService, PlanService, etc.) is registered as a proper container provider. The session service should follow the same pattern with a providers.Factory(PersistentSessionService, ...) registration in Container.

This would:

Make the DI container the single source of truth
Enable proper testing via provider.override() instead of private _service manipulation
Eliminate the need for H1's caching fix

MEDIUM (8 findings -- should fix)

M1. `list_sessions` catches bare `Exception` [Bug]

File: src/cleveragents/cli/commands/session.py:219

except Exception as exc:
    console.print(f"[red]Error:[/red] {exc}")
    raise typer.Exit(1) from exc

Catching Exception is overly broad. It could mask unexpected bugs in the service layer (e.g., TypeError from wrong arguments, ImportError from missing deps). Should catch specific expected exceptions (DatabaseError, OperationalError).

M2. `create` command missing general error handling [Bug]

File: src/cleveragents/cli/commands/session.py:176-197

The create command only catches SessionNotFoundError, but the same DI/DB errors that affected list also affect create. A DatabaseError during creation (disk full, permission denied) would show an ugly traceback instead of a friendly CLI error.

M3. `Base.metadata.create_all()` bypasses migration system [Bug]

File: src/cleveragents/application/container.py:304

_build_db_session_factory() calls Base.metadata.create_all(engine) but other container builders do not. This:

Bypasses Alembic migrations, potentially creating tables with outdated schemas
Creates inconsistency: session tables auto-create, other tables don't
Could cause subtle schema mismatches in production databases that use migrations

M4. Inconsistent error handling between `create` and `list` commands [Spec Compliance]

File: src/cleveragents/cli/commands/session.py

list_sessions has a broad except Exception handler (M1), but create only catches SessionNotFoundError. Both commands call _get_session_service() which can fail the same ways. The error handling strategy should be uniform across all session subcommands.

M5. No negative/error path tests for `create` command [Test Coverage]

File: features/session_create_error.feature

All 4 scenarios test happy paths. There are no tests for:

Database inaccessible/corrupted during create
Disk full during create
Concurrent create race conditions

Compare with session_list_error.feature which at least tests empty-list edge cases.

M6. Tests rely on private `_service` attribute manipulation [Test Flaw]

Files: features/steps/session_list_error_steps.py:62,91, features/steps/session_create_error_steps.py:62,88, features/steps/tdd_session_shared_steps.py:37, robot/helper_tdd_session_di_common.py:33,44

Six test files directly access session_mod._service (a private module attribute) for setup/teardown. While documented in docstrings, this creates fragile coupling. If H4 is implemented (proper DI registration), these tests can use provider.override() instead.

M7. `Settings._instance = None` direct manipulation in tests [Test Flaw]

Files: features/steps/tdd_session_shared_steps.py:52, robot/helper_tdd_session_di_common.py:48

Tests directly reset a private class attribute of Settings. This indicates Settings lacks a public reset() API, forcing tests to reach into internals.

M8. Inconsistent schema creation across container builders [Architecture]

File: src/cleveragents/application/container.py

_build_db_session_factory() calls Base.metadata.create_all() but the 6 other _build_* functions do not. This means session operations work on a fresh DB, but other services may fail without explicit migration. The schema creation strategy should be consistent.

LOW (13 findings -- consider for future improvement)

L1. `_ensure_cli_logging()` is session-module-specific but should be global [Architecture]

File: src/cleveragents/cli/commands/session.py:46-79

The structlog-to-stderr fix should be in the CLI entrypoint, not just the session module. Other CLI commands likely have the same stdout contamination issue.

L2. Redundant `flush()` before `commit()` [Code Quality]

File: src/cleveragents/infrastructure/database/repositories.py:3908-3909, 3974-3975, 4026-4027, 4073-4074

commit() implies a flush. The explicit flush() calls before commit() are redundant.

L3. Missing `table` output format test [Test Coverage]

File: features/session_list_error.feature

The spec lists table as a valid format. Tests cover rich, json, yaml, plain but not table.

L4. Missing `color` output format test [Test Coverage]

File: features/session_list_error.feature

The spec lists color as a valid format. The code checks OutputFormat.COLOR.value in the empty-list branch but no test exercises this path.

L5. No DI regression tests for `show`, `delete`, `tell`, `export`, `import` [Test Coverage]

All session commands use _get_session_service(), but only list and create have DI regression tests. The other 5 commands could regress if the DI wiring changes.

L6. Double cleanup in `session_create_error_steps.py` [Test Flaw]

File: features/steps/session_create_error_steps.py:102-104

_reset_session_service() sets _service = None, then line 104 overwrites with the original value. The reset call is redundant.

L7. Pre-populated session test double-commits [Test Flaw]

File: features/steps/session_list_error_steps.py:136

After svc.create() (which now commits internally per H3's change), the code also calls factory().commit(). This redundant commit masks whether the repository's own commit works correctly.

L8. Redundant scenarios in `session_list_error.feature` [Test Flaw]

File: features/session_list_error.feature:13-24

Scenarios "Session list returns empty list when no sessions exist" and "Session list after init does not raise DI error" test nearly identical paths. The second adds no value beyond the first.

L9. `_ensure_cli_logging` `force=False` is fragile [Code Quality]

File: src/cleveragents/cli/commands/session.py:68

logging.basicConfig(force=False) is the default. If another library called basicConfig first, this becomes a no-op and structlog may still write to stdout.

L10. Raw exception details exposed in error output [Security]

File: src/cleveragents/cli/commands/session.py:220

console.print(f"[red]Error:[/red] {exc}") could expose file paths, SQL statements, or internal state. Acceptable for CLI-only usage but a concern if server mode shares this code path.

L11. Global mutable state pattern (`_service`) [Architecture]

File: src/cleveragents/cli/commands/session.py:43

Module-level mutable _service is technical debt. Would be resolved by H4 (proper DI registration).

L12. `create_all` adds latency to first access [Performance]

File: src/cleveragents/application/container.py:304

Base.metadata.create_all(engine) reflects database schema on every first Singleton resolution. For the Singleton pattern this is one-time per process, but adds startup latency.

L13. Temp file cleanup with `ignore_errors=True` [Resource Leak]

File: features/steps/session_list_error_steps.py:104

shutil.rmtree(..., ignore_errors=True) silently ignores failures. If SQLAlchemy sessions leak (H2), database files stay locked and temp directories persist across test runs.

Summary Table

Severity	Count	Categories
HIGH	4	Bug (2), Architecture (2)
MEDIUM	8	Bug (3), Architecture (1), Spec Compliance (1), Test Coverage (1), Test Flaw (2)
LOW	13	Architecture (2), Code Quality (2), Test Coverage (3), Test Flaw (3), Security (1), Performance (1), Resource Leak (1)
Total	25

Recommendation

Request Changes. The 4 HIGH findings (H1-H4) represent real bugs and architectural regressions that should be resolved before merge. H1 (missing cache assignment) is a one-line fix. H2 (session leak) requires adding finally: close() blocks. H3 and H4 are more structural but could be addressed incrementally -- at minimum, H3 should document the UoW bypass as a known architectural decision (ADR) rather than silently changing the repository contract.

# Code Review Report -- PR #680 `fix(cli): handle missing database in session list command` **Branch:** `fix/m3-session-list-error` **Issues:** #554 (`agents session list` DI error), #570 (`agents session create` DI error) **Reviewer:** Automated Code Review (3 full review cycles across all categories) **Reviewed commits by:** Brent Edwards (key commit: `a58cfd7b`) --- ## Executive Summary The DI container fix correctly addresses the root cause of both #554 and #570 by adding a `db` Singleton provider to the Container class. The test coverage is thorough with Behave, Robot, and ASV suites. However, the review identified **25 findings** across 6 categories, including 4 high-severity issues that should be addressed before merge. --- ## Findings by Severity ### CRITICAL / HIGH (4 findings -- must fix before merge) #### H1. `_get_session_service()` never caches the built service [Bug] **File:** `src/cleveragents/cli/commands/session.py:82-107` **Severity:** HIGH The function declares `global _service` and checks the cache, but **never assigns the built service** back to `_service`: ```python def _get_session_service() -> SessionService: global _service if _service is not None: return _service # ... builds service ... return PersistentSessionService(session_repo, message_repo) # never assigned to _service! ``` This means: - Every CLI invocation re-resolves the full DI chain (container, sessionmaker, repos, service) - The `_reset_session_service()` function and the `_service` global are dead code in production - In the interactive REPL (commit `f48eb5a0`), repeated session commands re-create everything each time **Fix:** Add `_service = PersistentSessionService(session_repo, message_repo)` before the return. --- #### H2. SQLAlchemy sessions are never closed -- resource leak [Bug] **File:** `src/cleveragents/infrastructure/database/repositories.py:3904, 3925, 3946, 3966, 3996, 4068, 4099, 4126` **Severity:** HIGH Every method in `SessionRepository` and `SessionMessageRepository` calls `self._session()` to get a new SQLAlchemy session but **never closes it**: ```python db_session = self._session() try: # ... operations ... db_session.commit() return session except ...: db_session.rollback() raise # No finally: db_session.close() ``` There are 8 methods across both repositories that leak sessions. With SQLite, this keeps the database file locked and accumulates unclosed connections. Over many operations, this will exhaust resources. **Fix:** Add `finally: db_session.close()` blocks to all methods, or use a context manager pattern. --- #### H3. Commit-per-method breaks the Unit of Work pattern [Architecture] **File:** `src/cleveragents/infrastructure/database/repositories.py:3872-3876` **Severity:** HIGH The change from "flush-only, caller commits" to "flush-and-commit per method" **breaks the UoW contract** that every other repository in this file follows. The commit message acknowledges this: *"the CLI's `_get_session_service()` bypasses UnitOfWork"*. Consequences: - If any future consumer uses `SessionRepository` within a `UnitOfWork`, auto-commits will prematurely commit partial transactions - All 20+ other repository classes in the file follow flush-only; this creates an inconsistency that will confuse developers - Atomic multi-step operations involving sessions become impossible **Recommendation:** Instead of changing the repository contract, make the CLI use the `UnitOfWork` properly, or create a dedicated `CLISessionRepository` subclass that commits. Alternatively, add a `auto_commit` flag to the repository constructor defaulting to `False`. --- #### H4. `SessionService` built outside DI container, inconsistent with codebase [Architecture] **File:** `src/cleveragents/cli/commands/session.py:103-107` **Severity:** HIGH The session service is manually wired **outside** the DI container: ```python container = get_container() db = container.db() session_repo = SessionRepository(db) message_repo = SessionMessageRepository(db) return PersistentSessionService(session_repo, message_repo) ``` Every other service (`ProjectService`, `ActorService`, `PlanService`, etc.) is registered as a proper container provider. The session service should follow the same pattern with a `providers.Factory(PersistentSessionService, ...)` registration in `Container`. This would: - Make the DI container the single source of truth - Enable proper testing via `provider.override()` instead of private `_service` manipulation - Eliminate the need for H1's caching fix --- ### MEDIUM (8 findings -- should fix) #### M1. `list_sessions` catches bare `Exception` [Bug] **File:** `src/cleveragents/cli/commands/session.py:219` ```python except Exception as exc: console.print(f"[red]Error:[/red] {exc}") raise typer.Exit(1) from exc ``` Catching `Exception` is overly broad. It could mask unexpected bugs in the service layer (e.g., `TypeError` from wrong arguments, `ImportError` from missing deps). Should catch specific expected exceptions (`DatabaseError`, `OperationalError`). #### M2. `create` command missing general error handling [Bug] **File:** `src/cleveragents/cli/commands/session.py:176-197` The `create` command only catches `SessionNotFoundError`, but the same DI/DB errors that affected `list` also affect `create`. A `DatabaseError` during creation (disk full, permission denied) would show an ugly traceback instead of a friendly CLI error. #### M3. `Base.metadata.create_all()` bypasses migration system [Bug] **File:** `src/cleveragents/application/container.py:304` `_build_db_session_factory()` calls `Base.metadata.create_all(engine)` but other container builders do not. This: - Bypasses Alembic migrations, potentially creating tables with outdated schemas - Creates inconsistency: session tables auto-create, other tables don't - Could cause subtle schema mismatches in production databases that use migrations #### M4. Inconsistent error handling between `create` and `list` commands [Spec Compliance] **File:** `src/cleveragents/cli/commands/session.py` `list_sessions` has a broad `except Exception` handler (M1), but `create` only catches `SessionNotFoundError`. Both commands call `_get_session_service()` which can fail the same ways. The error handling strategy should be uniform across all session subcommands. #### M5. No negative/error path tests for `create` command [Test Coverage] **File:** `features/session_create_error.feature` All 4 scenarios test happy paths. There are no tests for: - Database inaccessible/corrupted during create - Disk full during create - Concurrent create race conditions Compare with `session_list_error.feature` which at least tests empty-list edge cases. #### M6. Tests rely on private `_service` attribute manipulation [Test Flaw] **Files:** `features/steps/session_list_error_steps.py:62,91`, `features/steps/session_create_error_steps.py:62,88`, `features/steps/tdd_session_shared_steps.py:37`, `robot/helper_tdd_session_di_common.py:33,44` Six test files directly access `session_mod._service` (a private module attribute) for setup/teardown. While documented in docstrings, this creates fragile coupling. If H4 is implemented (proper DI registration), these tests can use `provider.override()` instead. #### M7. `Settings._instance = None` direct manipulation in tests [Test Flaw] **Files:** `features/steps/tdd_session_shared_steps.py:52`, `robot/helper_tdd_session_di_common.py:48` Tests directly reset a private class attribute of `Settings`. This indicates `Settings` lacks a public `reset()` API, forcing tests to reach into internals. #### M8. Inconsistent schema creation across container builders [Architecture] **File:** `src/cleveragents/application/container.py` `_build_db_session_factory()` calls `Base.metadata.create_all()` but the 6 other `_build_*` functions do not. This means session operations work on a fresh DB, but other services may fail without explicit migration. The schema creation strategy should be consistent. --- ### LOW (13 findings -- consider for future improvement) #### L1. `_ensure_cli_logging()` is session-module-specific but should be global [Architecture] **File:** `src/cleveragents/cli/commands/session.py:46-79` The structlog-to-stderr fix should be in the CLI entrypoint, not just the session module. Other CLI commands likely have the same stdout contamination issue. #### L2. Redundant `flush()` before `commit()` [Code Quality] **File:** `src/cleveragents/infrastructure/database/repositories.py:3908-3909, 3974-3975, 4026-4027, 4073-4074` `commit()` implies a flush. The explicit `flush()` calls before `commit()` are redundant. #### L3. Missing `table` output format test [Test Coverage] **File:** `features/session_list_error.feature` The spec lists `table` as a valid format. Tests cover `rich`, `json`, `yaml`, `plain` but not `table`. #### L4. Missing `color` output format test [Test Coverage] **File:** `features/session_list_error.feature` The spec lists `color` as a valid format. The code checks `OutputFormat.COLOR.value` in the empty-list branch but no test exercises this path. #### L5. No DI regression tests for `show`, `delete`, `tell`, `export`, `import` [Test Coverage] All session commands use `_get_session_service()`, but only `list` and `create` have DI regression tests. The other 5 commands could regress if the DI wiring changes. #### L6. Double cleanup in `session_create_error_steps.py` [Test Flaw] **File:** `features/steps/session_create_error_steps.py:102-104` `_reset_session_service()` sets `_service = None`, then line 104 overwrites with the original value. The reset call is redundant. #### L7. Pre-populated session test double-commits [Test Flaw] **File:** `features/steps/session_list_error_steps.py:136` After `svc.create()` (which now commits internally per H3's change), the code also calls `factory().commit()`. This redundant commit masks whether the repository's own commit works correctly. #### L8. Redundant scenarios in `session_list_error.feature` [Test Flaw] **File:** `features/session_list_error.feature:13-24` Scenarios "Session list returns empty list when no sessions exist" and "Session list after init does not raise DI error" test nearly identical paths. The second adds no value beyond the first. #### L9. `_ensure_cli_logging` `force=False` is fragile [Code Quality] **File:** `src/cleveragents/cli/commands/session.py:68` `logging.basicConfig(force=False)` is the default. If another library called `basicConfig` first, this becomes a no-op and structlog may still write to stdout. #### L10. Raw exception details exposed in error output [Security] **File:** `src/cleveragents/cli/commands/session.py:220` `console.print(f"[red]Error:[/red] {exc}")` could expose file paths, SQL statements, or internal state. Acceptable for CLI-only usage but a concern if server mode shares this code path. #### L11. Global mutable state pattern (`_service`) [Architecture] **File:** `src/cleveragents/cli/commands/session.py:43` Module-level mutable `_service` is technical debt. Would be resolved by H4 (proper DI registration). #### L12. `create_all` adds latency to first access [Performance] **File:** `src/cleveragents/application/container.py:304` `Base.metadata.create_all(engine)` reflects database schema on every first Singleton resolution. For the Singleton pattern this is one-time per process, but adds startup latency. #### L13. Temp file cleanup with `ignore_errors=True` [Resource Leak] **File:** `features/steps/session_list_error_steps.py:104` `shutil.rmtree(..., ignore_errors=True)` silently ignores failures. If SQLAlchemy sessions leak (H2), database files stay locked and temp directories persist across test runs. --- ## Summary Table | Severity | Count | Categories | |----------|-------|------------| | HIGH | 4 | Bug (2), Architecture (2) | | MEDIUM | 8 | Bug (3), Architecture (1), Spec Compliance (1), Test Coverage (1), Test Flaw (2) | | LOW | 13 | Architecture (2), Code Quality (2), Test Coverage (3), Test Flaw (3), Security (1), Performance (1), Resource Leak (1) | | **Total** | **25** | | ## Recommendation **Request Changes.** The 4 HIGH findings (H1-H4) represent real bugs and architectural regressions that should be resolved before merge. H1 (missing cache assignment) is a one-line fix. H2 (session leak) requires adding `finally: close()` blocks. H3 and H4 are more structural but could be addressed incrementally -- at minimum, H3 should document the UoW bypass as a known architectural decision (ADR) rather than silently changing the repository contract.

src/cleveragents/application/container.py

						
				@@ -284,0 +301,4 @@

				    from cleveragents.infrastructure.database.models import Base

				    engine = create_engine(database_url, echo=False)

				    Base.metadata.create_all(engine)

CoreRasurae commented

M3/M8 (MEDIUM): _build_db_session_factory() calls Base.metadata.create_all(engine) but none of the other 6 _build_* functions do. This creates an inconsistency: session tables auto-create on a fresh DB, but other services won't work without explicit migration. Additionally, create_all bypasses Alembic migrations, potentially creating tables with outdated schemas in production databases.

**M3/M8 (MEDIUM):** `_build_db_session_factory()` calls `Base.metadata.create_all(engine)` but none of the other 6 `_build_*` functions do. This creates an inconsistency: session tables auto-create on a fresh DB, but other services won't work without explicit migration. Additionally, `create_all` bypasses Alembic migrations, potentially creating tables with outdated schemas in production databases.

						
				@@ -43,6 +43,42 @@ _FORMAT_HELP = "Output format: json, yaml, plain, table, or rich (default: rich)

				_service: SessionService | None = None

				def _ensure_cli_logging() -> None:

CoreRasurae commented

L1 (LOW - Architecture): This logging fix addresses a real stdout contamination issue, but it's scoped only to the session module. If other CLI command modules have the same problem, they'll need their own copies. Consider moving this to the CLI entrypoint (e.g., the main Typer app callback) so all commands benefit.

**L1 (LOW - Architecture):** This logging fix addresses a real stdout contamination issue, but it's scoped only to the session module. If other CLI command modules have the same problem, they'll need their own copies. Consider moving this to the CLI entrypoint (e.g., the main Typer app callback) so all commands benefit.

CoreRasurae commented

H4 (HIGH - Architecture): The session service is manually wired outside the DI container. Every other service (ProjectService, ActorService, PlanService, etc.) is registered as a proper container provider. This should follow the same pattern with a providers.Factory(PersistentSessionService, ...) registration in Container. This would also eliminate the need for the private _service global and make testing cleaner via provider.override().

**H4 (HIGH - Architecture):** The session service is manually wired outside the DI container. Every other service (`ProjectService`, `ActorService`, `PlanService`, etc.) is registered as a proper container provider. This should follow the same pattern with a `providers.Factory(PersistentSessionService, ...)` registration in `Container`. This would also eliminate the need for the private `_service` global and make testing cleaner via `provider.override()`.

CoreRasurae commented

H1 (HIGH - Bug): _get_session_service() declares global _service and checks it, but never assigns the newly built service to _service. This means the service is rebuilt on every call. Add _service = PersistentSessionService(session_repo, message_repo) before the return statement.

**H1 (HIGH - Bug):** `_get_session_service()` declares `global _service` and checks it, but never assigns the newly built service to `_service`. This means the service is rebuilt on every call. Add `_service = PersistentSessionService(session_repo, message_repo)` before the return statement.

CoreRasurae commented

M2 (MEDIUM - Bug): The create command only catches SessionNotFoundError, but the same DI/DB errors that can affect list can also affect create. A DatabaseError during creation would show an ugly traceback instead of a friendly CLI error. Consider adding equivalent error handling.

**M2 (MEDIUM - Bug):** The `create` command only catches `SessionNotFoundError`, but the same DI/DB errors that can affect `list` can also affect `create`. A `DatabaseError` during creation would show an ugly traceback instead of a friendly CLI error. Consider adding equivalent error handling.

						
				@@ -180,0 +216,4 @@

				    try:

				        service = _get_session_service()

				        sessions = service.list()

				    except Exception as exc:

CoreRasurae commented

M1 (MEDIUM - Bug): Catching bare Exception is overly broad. This could mask unexpected bugs (TypeError, ImportError, etc.) in the service layer. Consider catching specific expected exceptions such as DatabaseError and OperationalError.

**M1 (MEDIUM - Bug):** Catching bare `Exception` is overly broad. This could mask unexpected bugs (`TypeError`, `ImportError`, etc.) in the service layer. Consider catching specific expected exceptions such as `DatabaseError` and `OperationalError`.

src/cleveragents/infrastructure/database/repositories.py

CoreRasurae commented

H2 (HIGH - Bug / Resource Leak): This method (and all 7 other methods across SessionRepository and SessionMessageRepository) creates a new SQLAlchemy session via self._session() but never closes it. There is no finally: db_session.close() block. This leaks database connections and keeps SQLite files locked. Add finally: db_session.close() to every method.

**H2 (HIGH - Bug / Resource Leak):** This method (and all 7 other methods across `SessionRepository` and `SessionMessageRepository`) creates a new SQLAlchemy session via `self._session()` but **never closes it**. There is no `finally: db_session.close()` block. This leaks database connections and keeps SQLite files locked. Add `finally: db_session.close()` to every method.

src/cleveragents/infrastructure/database/repositories.py

						
				@@ -3871,3 +3871,2 @@

				    All mutating methods flush (but do NOT commit); the caller or a

				    ``UnitOfWork`` wrapper is responsible for committing the transaction.

				    All mutating methods flush **and commit** within the same call.  This

CoreRasurae commented