fix(security): use relpath containment instead of startswith to prevent prefix-collision bypass #11242

2026-05-17T20:14:45Z

HAL9000 commented

2026-05-17 20:14:45 +00:00

Summary

Replaces insecure str().startswith() path containment checks with proper, canonicalized path containment using os.path.relpath(), Path.is_relative_to(), and Path.relative_to() across three files. This prevents prefix-collision attacks where a malicious path like /tmp/evil could bypass an intended check for paths starting with /tmp/e.

## Summary Replaces insecure `str().startswith()` path containment checks with proper, canonicalized path containment using `os.path.relpath()`, `Path.is_relative_to()`, and `Path.relative_to()` across three files. This prevents prefix-collision attacks where a malicious path like `/tmp/evil` could bypass an intended check for paths starting with `/tmp/e`.

HAL9000 added 1 commit 2026-05-17 20:14:45 +00:00

fix(security): use relpath containment instead of startswith to prevent prefix-collision bypass

CI / push-validation (pull_request) Successful in 31s

Details

CI / helm (pull_request) Successful in 43s

Details

CI / build (pull_request) Successful in 1m13s

Details

CI / lint (pull_request) Successful in 1m46s

Details

CI / quality (pull_request) Successful in 1m51s

Details

CI / security (pull_request) Successful in 1m51s

Details

CI / typecheck (pull_request) Successful in 2m3s

Details

CI / integration_tests (pull_request) Successful in 4m38s

Details

CI / unit_tests (pull_request) Failing after 6m42s

Details

CI / coverage (pull_request) Has been skipped

Details

CI / docker (pull_request) Has been skipped

Details

CI / status-check (pull_request) Failing after 3s

Details

52c7e96d6b

Replace str().startswith(root) checks with proper relpath-based
containment that prevents prefix-collision bypass attacks where a
sibling directory whose name starts with the root path would pass
an unchecked containment test.

Files changed:
- decomposition_clustering.py: Use os.path.relpath() instead of
  string-prefix matching for root-relative path computation.
- plan.py (_apply_sandbox_changes): Use Path.is_relative_to()
  instead of str.startswith() for sandbox-to-project root check.
- inline_executor.py: Use Path.relative_to() instead of
  str.startswith() for sandbox containment validation.

HAL9000 added the

State

In Review

label 2026-05-18 01:23:14 +00:00

HAL9000 added the controller-managed label 2026-05-28 03:48:50 +00:00

HAL9000 added 1 commit 2026-05-28 03:50:26 +00:00

chore: re-trigger CI [controller]

CI / lint (pull_request) Successful in 49s

Details

CI / typecheck (pull_request) Successful in 1m10s

Details

CI / push-validation (pull_request) Successful in 35s

Details

CI / helm (pull_request) Successful in 40s

Details

CI / build (pull_request) Successful in 47s

Details

CI / quality (pull_request) Successful in 1m7s

Details

CI / security (pull_request) Successful in 1m54s

Details

CI / integration_tests (pull_request) Failing after 4m10s

Details

CI / unit_tests (pull_request) Failing after 4m30s

Details

CI / coverage (pull_request) Has been skipped

Details

CI / docker (pull_request) Has been skipped

Details

CI / status-check (pull_request) Failing after 3s

Details

4a71e3c368

HAL9000 force-pushed feature/security-fix-relpath-pr-11217 from 4a71e3c368 to 9d0c5850b3

2026-05-28 04:09:45 +00:00

Compare

HAL9000 added 2 commits 2026-05-28 10:40:17 +00:00

fix(security): correct inverted relpath check and catch ValueError in path resolve ba1b2e20af

Two follow-up bugs from the relpath-containment refactor:

1. decomposition_clustering._directory_key had inverted boolean logic:
   `if not (rel == "" or not rel.startswith(".." + os.sep))` meant the
   relative path was used ONLY when it escaped the root, and discarded
   when the path was actually inside the root. Fixed to update
   `normalized` exactly when the path is inside (or equal to) root.

2. inline_executor._validate_paths split the original
   `except (OSError, ValueError)` so that ValueError from
   `Path(value).resolve()` (raised for invalid paths such as those
   containing null bytes) propagated instead of returning a graceful
   "Invalid path" error string.

ISSUES CLOSED: #11217

chore: worker ruff auto-fix (pre-push lint gate)

CI / push-validation (pull_request) Successful in 22s

Details

CI / build (pull_request) Successful in 33s

Details

CI / helm (pull_request) Successful in 34s

Details

CI / lint (pull_request) Successful in 36s

Details

CI / quality (pull_request) Successful in 1m14s

Details

CI / typecheck (pull_request) Successful in 1m18s

Details

CI / security (pull_request) Successful in 1m19s

Details

CI / integration_tests (pull_request) Failing after 2m54s

Details

CI / unit_tests (pull_request) Successful in 4m46s

Details

CI / docker (pull_request) Successful in 1m36s

Details

CI / coverage (pull_request) Successful in 11m13s

Details

CI / status-check (pull_request) Failing after 3s

Details

0aff794f69

HAL9000 commented

2026-05-28 13:46:14 +00:00

[CONTROLLER-DEFER:Gate 1:needs_evaluation]

This PR has been deferred for re-evaluation. The controller has stepped back
from processing it. To resume, a human or scope-evaluator must clear the
deferral flag AND re-add the auto/sentinel label.

Decision:

Gate: Gate 1
Reason category: needs_evaluation
Canonical: #-
LLM confidence: medium
LLM reasoning: PR #11242 (anchor) and PR #11217 have identical titles and both fix the same security issue (#7478) using the relpath containment approach. The anchor's head branch name explicitly references "#11217", indicating it builds on that work. However, #11242 has 4.5x larger scope (32 additions vs 7, covering 3 files vs 1). This could represent: (1) necessary comprehensive coverage of the vulnerability across the codebase, or (2) redundant extension beyond what #11217 sufficiently addresses. Human judgment required to determine if #11242's broader file coverage is essential or unnecessary.
Preserved value (when applicable): PR #11242 applies the security fix to 3 files across the codebase (32 additions, 16 deletions), while PR #11217 covers only 1 file (7 additions, 1 deletion). The broader scope in the anchor may be necessary for comprehensive security coverage against prefix-collision bypass attacks across all vulnerable code paths, or may represent over-engineering if the issue is adequately addressed by PR #11217's focused approach.

To clear the deferral (SQL):
UPDATE workflows SET deferred_reason=NULL,
deferred_at=NULL,
deferred_target_workflow_id=NULL
WHERE workflow_id = 8;

INSERT INTO controller_events
  (workflow_id, ts, event_type, payload, cause, forgejo_write_pending, replay_attempts)
VALUES (8, datetime('now'), 'deferral_cleared',
        json_object('cleared_by', 'operator', 'reason', '<your reason>'),
        'operator', 0, 0);

Audit ID: 1354

Automated by the CleverAgents controller pipeline.
Identity: HAL9000 (pipeline action)

[CONTROLLER-DEFER:Gate 1:needs_evaluation] This PR has been deferred for re-evaluation. The controller has stepped back from processing it. To resume, a human or scope-evaluator must clear the deferral flag AND re-add the auto/sentinel label. Decision: - Gate: Gate 1 - Reason category: needs_evaluation - Canonical: #- - LLM confidence: medium - LLM reasoning: PR #11242 (anchor) and PR #11217 have identical titles and both fix the same security issue (#7478) using the relpath containment approach. The anchor's head branch name explicitly references "#11217", indicating it builds on that work. However, #11242 has 4.5x larger scope (32 additions vs 7, covering 3 files vs 1). This could represent: (1) necessary comprehensive coverage of the vulnerability across the codebase, or (2) redundant extension beyond what #11217 sufficiently addresses. Human judgment required to determine if #11242's broader file coverage is essential or unnecessary. - Preserved value (when applicable): PR #11242 applies the security fix to 3 files across the codebase (32 additions, 16 deletions), while PR #11217 covers only 1 file (7 additions, 1 deletion). The broader scope in the anchor may be necessary for comprehensive security coverage against prefix-collision bypass attacks across all vulnerable code paths, or may represent over-engineering if the issue is adequately addressed by PR #11217's focused approach. To clear the deferral (SQL): UPDATE workflows SET deferred_reason=NULL, deferred_at=NULL, deferred_target_workflow_id=NULL WHERE workflow_id = 8; INSERT INTO controller_events (workflow_id, ts, event_type, payload, cause, forgejo_write_pending, replay_attempts) VALUES (8, datetime('now'), 'deferral_cleared', json_object('cleared_by', 'operator', 'reason', '<your reason>'), 'operator', 0, 0); Audit ID: 1354 --- Automated by the CleverAgents controller pipeline. Identity: HAL9000 (pipeline action)

HAL9000 added auto/needs-reevaluation

and removed

labels 2026-05-28 13:46:35 +00:00

HAL9000 commented

2026-05-31 16:07:56 +00:00

event occurred 2026-05-28T04:07:37.990738+00:00

📋 Estimate: tier 1.

3-file security fix replacing startswith with proper path containment (relpath/is_relative_to/relative_to). CI is failing with specific actor-related test regressions in both unit and integration gates — Unknown Actor Name Error and Actor App Unknown Name Error — indicating the path containment changes broke actor name/path resolution logic. Not flaky CI; the implementer must diagnose the regression across the actor subsystem, fix without undoing the security improvement, and validate across both test suites. Cross-file context and actor path resolution understanding required.

*event occurred 2026-05-28T04:07:37.990738+00:00* **📋 Estimate: tier 1.** 3-file security fix replacing startswith with proper path containment (relpath/is_relative_to/relative_to). CI is failing with specific actor-related test regressions in both unit and integration gates — Unknown Actor Name Error and Actor App Unknown Name Error — indicating the path containment changes broke actor name/path resolution logic. Not flaky CI; the implementer must diagnose the regression across the actor subsystem, fix without undoing the security improvement, and validate across both test suites. Cross-file context and actor path resolution understanding required.

HAL9000 commented

2026-05-31 16:08:02 +00:00

(attempt #3, tier 1)

event occurred 2026-05-28T04:12:01.850398+00:00

🔧 Implementer attempt — rebased.

Pushed 1 commit: 9d0c585.

_(attempt #3, tier 1)_ *event occurred 2026-05-28T04:12:01.850398+00:00* **🔧 Implementer attempt — `rebased`.** Pushed 1 commit: `9d0c585`.

HAL9000 commented

2026-05-31 16:08:03 +00:00

(attempt #4, tier 1)

event occurred 2026-05-28T04:28:00.712692+00:00

🔧 Implementer attempt — ci-not-ready.

_(attempt #4, tier 1)_ *event occurred 2026-05-28T04:28:00.712692+00:00* **🔧 Implementer attempt — `ci-not-ready`.**

HAL9000 commented

2026-05-31 16:08:04 +00:00

(attempt #8, tier 1)

event occurred 2026-05-28T10:42:21.506801+00:00

🔧 Implementer attempt — resolved.

Pushed 2 commits: ba1b2e2, 0aff794.

Files touched: src/cleveragents/application/services/decomposition_clustering.py, src/cleveragents/skills/inline_executor.py.

_(attempt #8, tier 1)_ *event occurred 2026-05-28T10:42:21.506801+00:00* **🔧 Implementer attempt — `resolved`.** Pushed 2 commits: `ba1b2e2`, `0aff794`. Files touched: `src/cleveragents/application/services/decomposition_clustering.py`, `src/cleveragents/skills/inline_executor.py`.

HAL9000 commented

2026-05-31 16:08:05 +00:00

(attempt #9, tier 2)

event occurred 2026-05-28T11:11:30.953087+00:00

🔧 Implementer attempt — blocked.

Blockers:

CI integration_tests reports 'Plan Use And Execute With Custom Actor' (robot/m2_e2e_verification.robot) FAILED on head 0aff794f, but the same test passes locally on the exact head SHA. Verified twice: (1) standalone -t 'Plan Use And Execute With Custom Actor' on robot/m2_e2e_verification.robot — PASS in 51s; (2) full pabot --processes 64 m2 suite + entire robot/ directory — 2014 tests, 2014 passed, this test PASSED in 98.7s. No Traceback or unhandled INTERNAL error is visible in the CI log excerpt; only the Robot framework FAIL summary line. The test exercises 'agents plan use' + 'agents plan execute' for a plan in strategize/queued state expecting a controlled 'not ready' rejection. The 3 files in this PR's diff (decomposition_clustering.py, plan.py, inline_executor.py) implement a relpath-containment security fix and are not exercised by this test path (plan is rejected before sandbox apply runs; decomposition_clustering._directory_key is not on the plan-use/execute path). No reproducible local failure to root-cause, and noop is not satisfiable because CI overall_state=failure. Suspected CI runner flake (act/docker environment, parallel-test contention, or external runner reaper). Recommend retrigger or inspect run 21299 job 5 raw log beyond the FAIL summary line for actual failure detail.

_(attempt #9, tier 2)_ *event occurred 2026-05-28T11:11:30.953087+00:00* **🔧 Implementer attempt — `blocked`.** Blockers: - CI integration_tests reports 'Plan Use And Execute With Custom Actor' (robot/m2_e2e_verification.robot) FAILED on head 0aff794f, but the same test passes locally on the exact head SHA. Verified twice: (1) standalone -t 'Plan Use And Execute With Custom Actor' on robot/m2_e2e_verification.robot — PASS in 51s; (2) full pabot --processes 64 m2 suite + entire robot/ directory — 2014 tests, 2014 passed, this test PASSED in 98.7s. No Traceback or unhandled INTERNAL error is visible in the CI log excerpt; only the Robot framework FAIL summary line. The test exercises 'agents plan use' + 'agents plan execute' for a plan in strategize/queued state expecting a controlled 'not ready' rejection. The 3 files in this PR's diff (decomposition_clustering.py, plan.py, inline_executor.py) implement a relpath-containment security fix and are not exercised by this test path (plan is rejected before sandbox apply runs; decomposition_clustering._directory_key is not on the plan-use/execute path). No reproducible local failure to root-cause, and noop is not satisfiable because CI overall_state=failure. Suspected CI runner flake (act/docker environment, parallel-test contention, or external runner reaper). Recommend retrigger or inspect run 21299 job 5 raw log beyond the FAIL summary line for actual failure detail.

HAL9000 commented

2026-06-18 12:02:05 +00:00

Closing as redundant: the relpath-containment security fix (issue #7478) has already merged; issue #7478 is closed.

Automated canonical-selection cleanup of the controller's duplicate deferred queue.

Closing as redundant: the relpath-containment security fix (issue #7478) has already merged; issue #7478 is closed. _Automated canonical-selection cleanup of the controller's duplicate deferred queue._

HAL9000 closed this pull request

2026-06-18 12:02:05 +00:00

HAL9000 removed the

State

Paused

label 2026-06-18 12:06:14 +00:00

CI / push-validation (pull_request) Successful in 22s

Details

CI / build (pull_request) Successful in 33s

Required

Details

CI / helm (pull_request) Successful in 34s

Details

CI / lint (pull_request) Successful in 36s

Required

Details

CI / quality (pull_request) Successful in 1m14s

Required

Details

CI / typecheck (pull_request) Successful in 1m18s

Required

Details

CI / security (pull_request) Successful in 1m19s

Required

Details

CI / integration_tests (pull_request) Failing after 2m54s

Required

Details

CI / unit_tests (pull_request) Successful in 4m46s

Required

Details

CI / docker (pull_request) Successful in 1m36s

Required

Details

CI / coverage (pull_request) Successful in 11m13s

Required

Details

CI / status-check (pull_request) Failing after 3s

Details

Pull request closed

Please reopen this pull request to perform a merge.

Sign in to join this conversation.

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: cleveragents/cleveragents-core#11242