feat(cli): final CLI polish and UX consistency pass
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 24s
CI / lint (pull_request) Successful in 3m20s
CI / typecheck (pull_request) Successful in 4m4s
CI / quality (pull_request) Successful in 4m3s
CI / security (pull_request) Successful in 4m38s
CI / integration_tests (pull_request) Successful in 6m56s
CI / unit_tests (pull_request) Successful in 6m58s
CI / docker (pull_request) Successful in 1m13s
CI / e2e_tests (pull_request) Successful in 9m19s
CI / coverage (pull_request) Successful in 12m14s
CI / status-check (pull_request) Successful in 1s
CI / benchmark-regression (pull_request) Failing after 18m11s
Points
5
Type
Task
feat(sandbox): implement overlay filesystem sandbox strategy
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 24s
CI / lint (pull_request) Successful in 3m21s
CI / typecheck (pull_request) Successful in 3m56s
CI / quality (pull_request) Successful in 4m15s
CI / security (pull_request) Successful in 4m37s
CI / e2e_tests (pull_request) Successful in 8m47s
CI / integration_tests (pull_request) Successful in 9m1s
CI / unit_tests (pull_request) Successful in 9m16s
CI / docker (pull_request) Successful in 1m13s
CI / coverage (pull_request) Successful in 11m14s
CI / status-check (pull_request) Successful in 1s
CI / benchmark-regression (pull_request) Failing after 29m0s
MoSCoW
Could have
Points
8
Type
Feature
feat(perf): large project scaling tests
All checks were successful
CI / build (pull_request) Successful in 28s
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 3m56s
CI / quality (pull_request) Successful in 4m15s
CI / typecheck (pull_request) Successful in 4m30s
CI / security (pull_request) Successful in 4m44s
CI / integration_tests (pull_request) Successful in 7m23s
CI / e2e_tests (pull_request) Successful in 8m30s
CI / unit_tests (pull_request) Successful in 8m49s
CI / docker (pull_request) Successful in 1m9s
CI / coverage (pull_request) Successful in 11m19s
CI / status-check (pull_request) Successful in 1s
CI / benchmark-regression (pull_request) Successful in 47m35s
Points
8
Type
Testing
feat(ci): CI/CD pipeline definitions
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 17s
CI / lint (pull_request) Successful in 3m34s
CI / quality (pull_request) Successful in 3m54s
CI / typecheck (pull_request) Successful in 4m10s
CI / security (pull_request) Successful in 4m12s
CI / integration_tests (pull_request) Successful in 6m46s
CI / unit_tests (pull_request) Successful in 7m30s
CI / e2e_tests (pull_request) Successful in 8m31s
CI / docker (pull_request) Successful in 1m1s
CI / coverage (pull_request) Successful in 9m41s
CI / status-check (pull_request) Successful in 1s
CI / benchmark-regression (pull_request) Failing after 17m14s
Points
8
Type
Task
feat(cli): repo indexing CLI functional
Some checks failed
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 28s
CI / typecheck (pull_request) Successful in 1m13s
CI / quality (pull_request) Successful in 48s
CI / security (pull_request) Successful in 49s
CI / build (pull_request) Successful in 20s
CI / integration_tests (pull_request) Successful in 5m31s
CI / unit_tests (pull_request) Successful in 5m42s
CI / docker (pull_request) Successful in 1m5s
CI / coverage (pull_request) Successful in 7m50s
CI / e2e_tests (pull_request) Successful in 8m6s
CI / benchmark-regression (pull_request) Failing after 22m0s
Points
5
Type
Feature
feat(tool): add tool-level execution environment preferences
All checks were successful
CI / lint (pull_request) Successful in 36s
CI / benchmark-publish (pull_request) Has been skipped
CI / typecheck (pull_request) Successful in 1m0s
CI / quality (pull_request) Successful in 43s
CI / build (pull_request) Successful in 23s
CI / security (pull_request) Successful in 1m9s
CI / integration_tests (pull_request) Successful in 4m0s
CI / unit_tests (pull_request) Successful in 4m1s
CI / docker (pull_request) Successful in 1m15s
CI / e2e_tests (pull_request) Successful in 7m5s
CI / coverage (pull_request) Successful in 7m18s
CI / benchmark-regression (pull_request) Successful in 36m55s
Points
5
Type
Feature
feat(tool): implement BuiltinAdapter class and MCP automatic resource slot creation
All checks were successful
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 31s
CI / lint (pull_request) Successful in 4m23s
CI / typecheck (pull_request) Successful in 4m48s
CI / quality (pull_request) Successful in 4m47s
CI / security (pull_request) Successful in 5m1s
CI / integration_tests (pull_request) Successful in 8m55s
CI / e2e_tests (pull_request) Successful in 9m15s
CI / unit_tests (pull_request) Successful in 9m58s
CI / docker (pull_request) Successful in 1m8s
CI / coverage (pull_request) Successful in 11m12s
CI / status-check (pull_request) Successful in 1s
CI / benchmark-regression (pull_request) Successful in 57m48s
Points
5
Type
Feature
feat(actor): implement estimation actor type
All checks were successful
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 26s
CI / lint (pull_request) Successful in 3m21s
CI / quality (pull_request) Successful in 3m44s
CI / typecheck (pull_request) Successful in 3m56s
CI / security (pull_request) Successful in 4m4s
CI / e2e_tests (pull_request) Successful in 4m51s
CI / unit_tests (pull_request) Successful in 6m5s
CI / integration_tests (pull_request) Successful in 6m44s
CI / docker (pull_request) Successful in 57s
CI / coverage (pull_request) Successful in 11m3s
CI / status-check (pull_request) Successful in 1s
CI / benchmark-regression (pull_request) Successful in 36m36s
Points
8
Type
Feature
test: add TDD bug-capture test for #932 — plan apply missing --yes flag
All checks were successful
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 28s
CI / build (pull_request) Successful in 27s
CI / quality (pull_request) Successful in 45s
CI / typecheck (pull_request) Successful in 51s
CI / security (pull_request) Successful in 53s
CI / unit_tests (pull_request) Successful in 3m43s
CI / docker (pull_request) Successful in 1m1s
CI / integration_tests (pull_request) Successful in 5m11s
CI / e2e_tests (pull_request) Successful in 6m24s
CI / coverage (pull_request) Successful in 6m33s
CI / benchmark-regression (pull_request) Successful in 36m10s
MoSCoW
Must have
Points
2
Priority
Critical
Type
Testing
test(plan): TDD failing tests for subplan spawn orchestration (bug #823)
All checks were successful
CI / lint (pull_request) Successful in 1m2s
CI / quality (pull_request) Successful in 39s
CI / typecheck (pull_request) Successful in 2m21s
CI / benchmark-publish (pull_request) Has been skipped
CI / security (pull_request) Successful in 2m36s
CI / build (pull_request) Successful in 21s
CI / e2e_tests (pull_request) Successful in 2m6s
CI / integration_tests (pull_request) Successful in 3m57s
CI / unit_tests (pull_request) Successful in 4m5s
CI / docker (pull_request) Successful in 58s
CI / coverage (pull_request) Successful in 6m37s
CI / benchmark-regression (pull_request) Successful in 37m33s
MoSCoW
Must have
Points
2
Priority
Critical
State
In Review
Type
Testing
test(plan): TDD failing tests for checkpoint real rollback (bug #822)
All checks were successful
CI / lint (pull_request) Successful in 40s
CI / typecheck (pull_request) Successful in 43s
CI / security (pull_request) Successful in 52s
CI / quality (pull_request) Successful in 26s
CI / unit_tests (pull_request) Successful in 5m20s
CI / integration_tests (pull_request) Successful in 5m54s
CI / benchmark-publish (pull_request) Has been skipped
CI / build (pull_request) Successful in 15s
CI / e2e_tests (pull_request) Successful in 1m34s
CI / docker (pull_request) Successful in 1m0s
CI / coverage (pull_request) Successful in 6m12s
CI / benchmark-regression (pull_request) Successful in 37m16s
MoSCoW
Must have
Points
2
Priority
Critical
State
In Review
Type
Testing
fix(test): convert M1-M6 E2E suites to real subprocess CLI invocations (closes #658)
All checks were successful
CI / benchmark-publish (pull_request) Has been skipped
CI / quality (pull_request) Successful in 17s
CI / lint (pull_request) Successful in 18s
CI / build (pull_request) Successful in 21s
CI / e2e_tests (pull_request) Successful in 33s
CI / security (pull_request) Successful in 1m4s
CI / typecheck (pull_request) Successful in 1m11s
CI / unit_tests (pull_request) Successful in 3m4s
CI / integration_tests (pull_request) Successful in 3m47s
CI / docker (pull_request) Successful in 43s
CI / coverage (pull_request) Successful in 6m36s
CI / benchmark-regression (pull_request) Successful in 37m38s
MoSCoW
Must have
Points
88
Priority
Critical
State
In Progress
Type
Bug
test(e2e): TDD failing tests for E2E mock-only coverage (bug #658)
All checks were successful
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 14s
CI / build (pull_request) Successful in 15s
CI / quality (pull_request) Successful in 18s
CI / e2e_tests (pull_request) Successful in 34s
CI / security (pull_request) Successful in 37s
CI / typecheck (pull_request) Successful in 39s
CI / unit_tests (pull_request) Successful in 4m46s
CI / integration_tests (pull_request) Successful in 5m11s
CI / docker (pull_request) Successful in 42s
CI / coverage (pull_request) Successful in 5m35s
CI / benchmark-regression (pull_request) Successful in 36m16s
MoSCoW
Must have
Points
3
Priority
Critical
State
In Progress
Type
Testing
fix(cli): handle missing database in session list command
All checks were successful
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 16s
CI / quality (pull_request) Successful in 20s
CI / build (pull_request) Successful in 20s
CI / e2e_tests (pull_request) Successful in 27s
CI / typecheck (pull_request) Successful in 40s
CI / security (pull_request) Successful in 52s
CI / unit_tests (pull_request) Successful in 3m9s
CI / integration_tests (pull_request) Successful in 3m32s
CI / docker (pull_request) Successful in 40s
CI / coverage (pull_request) Successful in 5m41s
CI / benchmark-regression (pull_request) Successful in 36m0s
MoSCoW
Must have
Points
8
Priority
High
State
In Review
Type
Bug
fix(cli): handle missing database in session list command
All checks were successful
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 16s
CI / build (pull_request) Successful in 17s
CI / quality (pull_request) Successful in 18s
CI / security (pull_request) Successful in 37s
CI / typecheck (pull_request) Successful in 43s
CI / unit_tests (pull_request) Successful in 3m2s
CI / integration_tests (pull_request) Successful in 3m23s
CI / docker (pull_request) Successful in 40s
CI / coverage (pull_request) Successful in 6m27s
CI / benchmark-regression (pull_request) Successful in 33m46s
Priority
High
State
Unverified
Type
Bug
fix(test): resolve race condition in M4 validation integration test
All checks were successful
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 25s
CI / quality (pull_request) Successful in 25s
CI / build (pull_request) Successful in 26s
CI / typecheck (pull_request) Successful in 1m9s
CI / security (pull_request) Successful in 1m18s
CI / integration_tests (pull_request) Successful in 3m43s
CI / unit_tests (pull_request) Successful in 4m8s
CI / docker (pull_request) Successful in 43s
CI / coverage (pull_request) Successful in 5m35s
CI / benchmark-regression (pull_request) Successful in 32m44s
MoSCoW
Must have
Points
3
Priority
Critical
State
In Review
Type
Bug
test(cli): add failing tests for session list DI container error (#554)
All checks were successful
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 15s
CI / build (pull_request) Successful in 17s
CI / quality (pull_request) Successful in 18s
CI / typecheck (pull_request) Successful in 37s
CI / security (pull_request) Successful in 50s
CI / unit_tests (pull_request) Successful in 2m49s
CI / integration_tests (pull_request) Successful in 3m27s
CI / docker (pull_request) Successful in 40s
CI / coverage (pull_request) Successful in 5m16s
CI / benchmark-regression (pull_request) Successful in 34m45s
MoSCoW
Must have
Points
2
Priority
Medium
State
In Review
Type
Testing
test(cli): add failing tests for session create DI container error (#570)
All checks were successful
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 16s
CI / build (pull_request) Successful in 18s
CI / quality (pull_request) Successful in 21s
CI / typecheck (pull_request) Successful in 39s
CI / security (pull_request) Successful in 39s
CI / unit_tests (pull_request) Successful in 4m22s
CI / integration_tests (pull_request) Successful in 5m2s
CI / docker (pull_request) Successful in 40s
CI / coverage (pull_request) Successful in 5m9s
CI / benchmark-regression (pull_request) Successful in 34m47s
MoSCoW
Must have
Points
2
Priority
Medium
State
In Review
Type
Testing
fix(actor): handle empty actor list without validation error
All checks were successful
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 15s
CI / build (pull_request) Successful in 17s
CI / quality (pull_request) Successful in 21s
CI / security (pull_request) Successful in 36s
CI / typecheck (pull_request) Successful in 39s
CI / unit_tests (pull_request) Successful in 2m58s
CI / integration_tests (pull_request) Successful in 3m28s
CI / docker (pull_request) Successful in 40s
CI / coverage (pull_request) Successful in 5m27s
CI / benchmark-regression (pull_request) Successful in 36m5s
MoSCoW
Must have
Points
3
Priority
High
State
In Review
Type
Bug
test(e2e): add M6 autonomy acceptance suite
All checks were successful
CI / benchmark-publish (pull_request) Has been skipped
CI / lint (pull_request) Successful in 14s
CI / quality (pull_request) Successful in 16s
CI / build (pull_request) Successful in 23s
CI / typecheck (pull_request) Successful in 31s
CI / security (pull_request) Successful in 59s
CI / integration_tests (pull_request) Successful in 3m30s
CI / benchmark-regression (pull_request) Successful in 21m22s
CI / unit_tests (pull_request) Successful in 23m4s
CI / docker (pull_request) Successful in 16s
CI / coverage (pull_request) Successful in 43m38s
State
Completed
Type
Testing