ci(pipeline): guard integration/e2e jobs when LLM secrets unavailable #10996

BLOCKING — Guard exit 0 does NOT prevent the nox step from running

Calling exit 0 at the end of this step makes the step succeed, but it does NOT prevent the next "Run integration tests via nox" step from executing. Subsequent steps run unconditionally unless they have an if: expression checking a step output.

When LLM keys are absent, nox will still be invoked, attempt real provider calls, and fail — defeating the purpose of this guard.

Fix: Add id: secret-guard to this step and set a step output, then add if: steps.secret-guard.outputs.secrets_present == 'true' to the nox step:

- name: Verify LLM API keys for integration tests
  id: secret-guard
  env:
    ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}
    OPENAI_API_KEY: ${{ secrets.OPENAI_API_KEY }}
  run: |
    set -euo pipefail
    HAS_KEYS=false
    [ -n "${ANTHROPIC_API_KEY:-}" ] && HAS_KEYS=true
    [ -n "${OPENAI_API_KEY:-}" ] && HAS_KEYS=true
    echo "secrets_present=$HAS_KEYS" >> $GITHUB_OUTPUT
    if [ "$HAS_KEYS" = "false" ]; then
      echo "WARNING: No LLM API keys configured — skipping integration_tests."
    fi

- name: Run integration tests via nox
  if: steps.secret-guard.outputs.secrets_present == 'true'
  run: ...

The same fix must be applied to the e2e guard step.

**BLOCKING — Guard `exit 0` does NOT prevent the nox step from running** Calling `exit 0` at the end of this step makes the step succeed, but it does NOT prevent the next "Run integration tests via nox" step from executing. Subsequent steps run unconditionally unless they have an `if:` expression checking a step output. When LLM keys are absent, nox will still be invoked, attempt real provider calls, and fail — defeating the purpose of this guard. **Fix**: Add `id: secret-guard` to this step and set a step output, then add `if: steps.secret-guard.outputs.secrets_present == 'true'` to the nox step: ```yaml - name: Verify LLM API keys for integration tests id: secret-guard env: ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }} OPENAI_API_KEY: ${{ secrets.OPENAI_API_KEY }} run: | set -euo pipefail HAS_KEYS=false [ -n "${ANTHROPIC_API_KEY:-}" ] && HAS_KEYS=true [ -n "${OPENAI_API_KEY:-}" ] && HAS_KEYS=true echo "secrets_present=$HAS_KEYS" >> $GITHUB_OUTPUT if [ "$HAS_KEYS" = "false" ]; then echo "WARNING: No LLM API keys configured — skipping integration_tests." fi - name: Run integration tests via nox if: steps.secret-guard.outputs.secrets_present == 'true' run: ... ``` The same fix must be applied to the e2e guard step.

.forgejo/workflows/ci.yml Outdated

						
				@@ -608,7 +676,7 @@ jobs:

				                  fi

HAL9001 commented

BLOCKING — Invalid YAML: wrong indentation on status-check: key

This line was changed from status-check: (4 spaces, matching all other top-level job keys) to status-check: (3 spaces). This breaks the YAML mapping structure — yaml.safe_load() raises a ParserError at this location.

This is the root cause of the CI failures on this PR. The Forgejo Actions runner cannot parse the workflow file at all.

Fix: Restore the correct 4-space indentation:

    status-check:

**BLOCKING — Invalid YAML: wrong indentation on `status-check:` key** This line was changed from ` status-check:` (4 spaces, matching all other top-level job keys) to ` status-check:` (3 spaces). This breaks the YAML mapping structure — `yaml.safe_load()` raises a `ParserError` at this location. This is the root cause of the CI failures on this PR. The Forgejo Actions runner cannot parse the workflow file at all. **Fix**: Restore the correct 4-space indentation: ```yaml status-check: ```

.forgejo/workflows/ci.yml Outdated

						
				@@ -648,0 +716,4 @@

				                  # LLM-dependent jobs (integration_tests, e2e_tests) may skip

				                  # gracefully when secrets are unavailable. Accept success or skip;

				                  # only fail on failure or cancelled.

				                  for job in integration_tests e2e_tests; do

HAL9001 commented

BLOCKING — ${{ needs.$job.result }} is a broken expression that always produces an empty string

${{ ... }} expressions are evaluated by Forgejo Actions at workflow parse/plan time, not at bash runtime. The variable $job exists only in the bash shell — it is not available in the Actions expression context. As a result, needs.$job.result resolves to an empty string for every iteration of the loop, meaning $result will always be "" and the failure check will never fire.

This guard is silently broken and provides no protection against actual failures in integration_tests or e2e_tests.

Fix: Reference each job result explicitly without relying on a bash variable:

# integration_tests
if [ "${{ needs.integration_tests.result }}" = "failure" ] || \
   [ "${{ needs.integration_tests.result }}" = "cancelled" ]; then
    echo "FAILED: integration_tests did not succeed or skip"
    exit 1
fi
echo "OK: integration_tests result is ${{ needs.integration_tests.result }}"

# e2e_tests
if [ "${{ needs.e2e_tests.result }}" = "failure" ] || \
   [ "${{ needs.e2e_tests.result }}" = "cancelled" ]; then
    echo "FAILED: e2e_tests did not succeed or skip"
    exit 1
fi
echo "OK: e2e_tests result is ${{ needs.e2e_tests.result }}"

**BLOCKING — `${{ needs.$job.result }}` is a broken expression that always produces an empty string** `${{ ... }}` expressions are evaluated by Forgejo Actions at workflow parse/plan time, not at bash runtime. The variable `$job` exists only in the bash shell — it is not available in the Actions expression context. As a result, `needs.$job.result` resolves to an empty string for every iteration of the loop, meaning `$result` will always be `""` and the failure check will never fire. This guard is silently broken and provides no protection against actual failures in `integration_tests` or `e2e_tests`. **Fix**: Reference each job result explicitly without relying on a bash variable: ```yaml # integration_tests if [ "${{ needs.integration_tests.result }}" = "failure" ] || \ [ "${{ needs.integration_tests.result }}" = "cancelled" ]; then echo "FAILED: integration_tests did not succeed or skip" exit 1 fi echo "OK: integration_tests result is ${{ needs.integration_tests.result }}" # e2e_tests if [ "${{ needs.e2e_tests.result }}" = "failure" ] || \ [ "${{ needs.e2e_tests.result }}" = "cancelled" ]; then echo "FAILED: e2e_tests did not succeed or skip" exit 1 fi echo "OK: e2e_tests result is ${{ needs.e2e_tests.result }}" ```

CHANGELOG.md Outdated

						
				@@ -10,0 +21,4 @@

				  merging when only non-LLM aspects of the change were modified. The `benchmark-regression`

				  job in `master.yml` already used this pattern; this brings consistency to all jobs that

				  gate on external secrets (see lines 98-110 of `master.yml`).

HAL9001 commented

BLOCKING — Duplicate ### Fixed heading in [Unreleased] section

A new ### Fixed heading was inserted here, but a ### Fixed heading already exists just below it (line ~25). Keep a Changelog format allows only one occurrence of each section header per release block. Having two ### Fixed sections in [Unreleased] is malformed.

Fix: Remove this extra ### Fixed heading. Move the CI pipeline bullet point to be a new list item under the single existing ### Fixed section.

**BLOCKING — Duplicate `### Fixed` heading in `[Unreleased]` section** A new `### Fixed` heading was inserted here, but a `### Fixed` heading already exists just below it (line ~25). Keep a Changelog format allows only one occurrence of each section header per release block. Having two `### Fixed` sections in `[Unreleased]` is malformed. **Fix**: Remove this extra `### Fixed` heading. Move the CI pipeline bullet point to be a new list item under the single existing `### Fixed` section.

features/steps/ci_llm_guard_steps.py Outdated

						
				@@ -0,0 +134,4 @@

				@then("its status-check result logic should handle \"skipped\" as acceptable")

				def step_then_status_check_handles_skipped(context):

				    """Verify that the status-check job treats skipped/success as acceptable for LLM-dependent jobs."""

				    import re as _re

HAL9001 commented

2026-05-07 18:54:26 +00:00

Non-blocking — Redundant local import of re inside function body

import re as _re is unnecessary here: re is already imported at module level on line 3, and _re is not actually used anywhere in this function body.

Fix: Remove this local import.

Also note: all step function parameters currently lack type annotations, which will fail Pyright strict mode. Consider annotating them:

from behave.runner import Context

def step_given_ci_workflow_file(context: Context, path: str) -> None:
    ...

**Non-blocking — Redundant local import of `re` inside function body** `import re as _re` is unnecessary here: `re` is already imported at module level on line 3, and `_re` is not actually used anywhere in this function body. **Fix**: Remove this local import. Also note: all step function parameters currently lack type annotations, which will fail Pyright strict mode. Consider annotating them: ```python from behave.runner import Context def step_given_ci_workflow_file(context: Context, path: str) -> None: ... ```

HAL9001 commented

Automated by CleverAgents Bot
Supervisor: PR Review | Agent: pr-review-worker

--- Automated by CleverAgents Bot Supervisor: PR Review | Agent: pr-review-worker

HAL9001 requested changes 2026-05-07 21:34:26 +00:00

HAL9001 left a comment

Review Summary — PR #10996: `ci(pipeline): guard integration/e2e jobs when LLM secrets unavailable`

The intent of this PR is correct and the problem it solves is real. However there are three correctness defects in the implementation that make the guard non-functional, plus several conformance and quality issues that must be fixed before this can be approved.

❌ BLOCKING — Critical Implementation Defects

1. exit 0 in a step does NOT skip subsequent steps (integration_tests + e2e_tests)

In Forgejo/GitHub Actions, each step is an independent shell invocation. exit 0 terminates that step successfully but the runner unconditionally starts the next step. Both Run integration tests via nox and Run E2E tests via nox have no if: condition, so they execute regardless of whether the guard step exited 0 or continued. The guard is entirely non-functional as written.

Fix: add a step id to the guard step and use if: steps.<id>.conclusion == success && env.HAS_KEYS == true on the nox step — or better, use a step output:

- name: Verify LLM API keys for integration tests
  id: secret-guard
  run: |
    if [ -n "${ANTHROPIC_API_KEY:-}" ] || [ -n "${OPENAI_API_KEY:-}" ]; then
      echo "secrets_present=true" >> $GITHUB_OUTPUT
    else
      echo "secrets_present=false" >> $GITHUB_OUTPUT
    fi

- name: Run integration tests via nox
  if: steps.secret-guard.outputs.secrets_present == true
  run: ...

This also aligns with issue #9128 proposal item 2 ("Surface a boolean output such as integration_secrets_present") which is not implemented.

2. ${{ needs.$job.result }} is a dead expression in the status-check for loop

Forgejo Actions template expressions (${{ ... }}) are evaluated at YAML-parse / workflow-dispatch time — before any job runs. At that point, $job is a bash shell variable that does not exist in the Actions expression context. needs.$job resolves to nothing and the expression evaluates to an empty string. The for loop therefore always evaluates result="", which never equals "failure" or "cancelled", so the check is a no-op. The loop never validates either job result.

Fix: expand the expressions explicitly, one per job:

if [ "${{ needs.integration_tests.result }}" = "failure" ] || \
   [ "${{ needs.integration_tests.result }}" = "cancelled" ]; then
    echo "FAILED: integration_tests result: ${{ needs.integration_tests.result }}"
    exit 1
fi
if [ "${{ needs.e2e_tests.result }}" = "failure" ] || \
   [ "${{ needs.e2e_tests.result }}" = "cancelled" ]; then
    echo "FAILED: e2e_tests result: ${{ needs.e2e_tests.result }}"
    exit 1
fi

3. YAML indentation broken on status-check: job key

All other top-level job keys in jobs: are indented 4 spaces. This PR changes status-check: to 3 spaces, which makes the YAML structurally invalid — it may parse into an unexpected position in the document tree or cause the workflow to fail with a parse error. This is almost certainly why CI is failing immediately ("Failing after 0s").

Fix: restore 4-space indentation: status-check:

❌ BLOCKING — Test Quality

4. 8 of 9 BDD scenarios are missing the Given setup step — context.ci_workflow_path is never set

Only the first scenario (integration_tests job exists in CI workflow) has a Given step that sets context.ci_workflow_path. All remaining 8 scenarios begin with When I parse the CI workflow YAML, which calls context.ci_workflow_path on line 25 of the step file — but this attribute is unset in a fresh scenario context (Behave does NOT share state between scenarios). This will raise AttributeError on every scenario after the first.

Fix: add a Background: block at the top of the feature:

Background:
  Given the CI workflow file at ".forgejo/workflows/ci.yml"

5. In-function import re as _re at line 137 violates project import rules

The project rule is explicit: all imports must be at the top of the file. import re is already present at the top of the file (line 1 of the imports). The inner alias import re as _re inside step_then_status_check_handles_skipped is both redundant and a rule violation.

Fix: remove the in-function import; the top-level import re is sufficient.

⚠️ BLOCKING — Conformance Issues

6. Branch name fix/9222-guard-integration-e2e-jobs is non-conforming

Branch naming rules require feature/mN-<name> or bugfix/mN-<name> (with milestone number). The prefix fix/ is not a valid prefix, and there is no milestone number in the name. For a CI infrastructure task in v3.2.0, this should be feature/m2-guard-integration-e2e-jobs.

7. PR has no labels applied

CONTRIBUTING requires exactly one Type/ label on every PR. The PR description mentions Type/Ci and Type/Task but neither is actually applied. Please apply Type/Task.

8. Milestone mismatch: PR is assigned v3.9.0, issue #9128 is in v3.2.0

Per CONTRIBUTING rules, the PR must be assigned to the same milestone as the linked issue. Issue #9128 milestone is v3.2.0. The PR should be moved to v3.2.0.

9. docs/development/ci-cd.md update is missing

Issue #9128 proposal item 4 explicitly requires: "Document the behavior in docs/development/ci-cd.md, including how maintainers can trigger full runs when they need real-provider validation." The file exists (docs/development/ci-cd.md) but was not updated. The documentation must be included in the same commit as the code change.

10. CHANGELOG has a duplicate ### Fixed header and references wrong issue number

The PR introduces a second ### Fixed section header in the [Unreleased] block (line 25 in the new CHANGELOG), creating two consecutive ### Fixed sections instead of adding the entry to the existing one. Additionally, the entry references #9222 (which is a PR/issue for a previous attempt) rather than #9128 (the canonical issue this PR closes). Fix: merge into the existing ### Fixed block and change the reference to #9128.

CI Status

CI is currently failing on:

ci.yml / fix(ci): guard integration/e2e jobs when LLM secrets unavailable — failing immediately (0s), consistent with the YAML indentation defect on status-check:
CI / benchmark-regression — failing after 51s (pre-existing or introduced regression; must be investigated)

All required CI gates (lint, typecheck, security, unit_tests, coverage) must pass before this PR can be merged.

What is Good

The core idea is sound and the problem description is accurate.
The shell guard logic itself (checking for non-empty env vars) is correct.
The status-check conceptual change (removing hard != success for LLM jobs, replacing with a failure/cancelled check) is the right approach — just needs the expression bug fixed.
The CHANGELOG entry is well-written (content-wise) and the CONTRIBUTORS entry is correct.
The commit message is well-formed with ISSUES CLOSED: #9222 footer and a clear body.
The BDD step definitions have correct docstrings and overall structure.
Single commit for this change — good atomicity.

Required Changes Before Approval

Fix the exit 0 / no-condition bug: add id: secret-guard to guard steps and if: steps.secret-guard.outputs.secrets_present == true to nox steps
Fix ${{ needs.$job.result }} — expand to explicit per-job expressions
Fix status-check: YAML indentation (3 spaces → 4 spaces)
Add Background: block to the feature file so all scenarios have ci_workflow_path set
Remove in-function import re as _re (line 137 of step file)
Update docs/development/ci-cd.md as required by the issue
Fix the duplicate ### Fixed section in CHANGELOG and correct issue reference to #9128
Apply Type/Task label to this PR
Move milestone to v3.2.0 (matching issue #9128)

Automated by CleverAgents Bot
Supervisor: PR Review | Agent: pr-review-worker

## Review Summary — PR #10996: `ci(pipeline): guard integration/e2e jobs when LLM secrets unavailable` The intent of this PR is correct and the problem it solves is real. However there are **three correctness defects in the implementation that make the guard non-functional**, plus several conformance and quality issues that must be fixed before this can be approved. --- ### ❌ BLOCKING — Critical Implementation Defects **1. `exit 0` in a step does NOT skip subsequent steps (integration_tests + e2e_tests)** In Forgejo/GitHub Actions, each `step` is an independent shell invocation. `exit 0` terminates *that step* successfully but the runner unconditionally starts the *next* step. Both `Run integration tests via nox` and `Run E2E tests via nox` have **no `if:` condition**, so they execute regardless of whether the guard step exited 0 or continued. The guard is entirely non-functional as written. Fix: add a step `id` to the guard step and use `if: steps.<id>.conclusion == success && env.HAS_KEYS == true` on the nox step — or better, use a step output: ```yaml - name: Verify LLM API keys for integration tests id: secret-guard run: | if [ -n "${ANTHROPIC_API_KEY:-}" ] || [ -n "${OPENAI_API_KEY:-}" ]; then echo "secrets_present=true" >> $GITHUB_OUTPUT else echo "secrets_present=false" >> $GITHUB_OUTPUT fi - name: Run integration tests via nox if: steps.secret-guard.outputs.secrets_present == true run: ... ``` This also aligns with issue #9128 proposal item 2 ("Surface a boolean output such as `integration_secrets_present`") which is not implemented. **2. `${{ needs.$job.result }}` is a dead expression in the status-check for loop** Forgejo Actions template expressions (`${{ ... }}`) are evaluated at YAML-parse / workflow-dispatch time — before any job runs. At that point, `$job` is a bash shell variable that does not exist in the Actions expression context. `needs.$job` resolves to nothing and the expression evaluates to an empty string. The for loop therefore always evaluates `result=""`, which never equals `"failure"` or `"cancelled"`, so the check is a no-op. The loop never validates either job result. Fix: expand the expressions explicitly, one per job: ```yaml if [ "${{ needs.integration_tests.result }}" = "failure" ] || \ [ "${{ needs.integration_tests.result }}" = "cancelled" ]; then echo "FAILED: integration_tests result: ${{ needs.integration_tests.result }}" exit 1 fi if [ "${{ needs.e2e_tests.result }}" = "failure" ] || \ [ "${{ needs.e2e_tests.result }}" = "cancelled" ]; then echo "FAILED: e2e_tests result: ${{ needs.e2e_tests.result }}" exit 1 fi ``` **3. YAML indentation broken on `status-check:` job key** All other top-level job keys in `jobs:` are indented 4 spaces. This PR changes `status-check:` to 3 spaces, which makes the YAML structurally invalid — it may parse into an unexpected position in the document tree or cause the workflow to fail with a parse error. This is almost certainly why CI is failing immediately ("Failing after 0s"). Fix: restore 4-space indentation: ` status-check:` --- ### ❌ BLOCKING — Test Quality **4. 8 of 9 BDD scenarios are missing the `Given` setup step — `context.ci_workflow_path` is never set** Only the first scenario (`integration_tests job exists in CI workflow`) has a `Given` step that sets `context.ci_workflow_path`. All remaining 8 scenarios begin with `When I parse the CI workflow YAML`, which calls `context.ci_workflow_path` on line 25 of the step file — but this attribute is unset in a fresh scenario context (Behave does NOT share state between scenarios). This will raise `AttributeError` on every scenario after the first. Fix: add a `Background:` block at the top of the feature: ```gherkin Background: Given the CI workflow file at ".forgejo/workflows/ci.yml" ``` **5. In-function `import re as _re` at line 137 violates project import rules** The project rule is explicit: all imports must be at the top of the file. `import re` is already present at the top of the file (line 1 of the imports). The inner alias `import re as _re` inside `step_then_status_check_handles_skipped` is both redundant and a rule violation. Fix: remove the in-function import; the top-level `import re` is sufficient. --- ### ⚠️ BLOCKING — Conformance Issues **6. Branch name `fix/9222-guard-integration-e2e-jobs` is non-conforming** Branch naming rules require `feature/mN-<name>` or `bugfix/mN-<name>` (with milestone number). The prefix `fix/` is not a valid prefix, and there is no milestone number in the name. For a CI infrastructure task in v3.2.0, this should be `feature/m2-guard-integration-e2e-jobs`. **7. PR has no labels applied** CONTRIBUTING requires exactly one `Type/` label on every PR. The PR description mentions `Type/Ci` and `Type/Task` but neither is actually applied. Please apply `Type/Task`. **8. Milestone mismatch: PR is assigned v3.9.0, issue #9128 is in v3.2.0** Per CONTRIBUTING rules, the PR must be assigned to the same milestone as the linked issue. Issue #9128 milestone is v3.2.0. The PR should be moved to v3.2.0. **9. `docs/development/ci-cd.md` update is missing** Issue #9128 proposal item 4 explicitly requires: *"Document the behavior in `docs/development/ci-cd.md`, including how maintainers can trigger full runs when they need real-provider validation."* The file exists (`docs/development/ci-cd.md`) but was not updated. The documentation must be included in the same commit as the code change. **10. CHANGELOG has a duplicate `### Fixed` header and references wrong issue number** The PR introduces a second `### Fixed` section header in the `[Unreleased]` block (line 25 in the new CHANGELOG), creating two consecutive `### Fixed` sections instead of adding the entry to the existing one. Additionally, the entry references `#9222` (which is a PR/issue for a previous attempt) rather than `#9128` (the canonical issue this PR closes). Fix: merge into the existing `### Fixed` block and change the reference to `#9128`. --- ### CI Status CI is currently **failing** on: - `ci.yml / fix(ci): guard integration/e2e jobs when LLM secrets unavailable` — failing immediately (0s), consistent with the YAML indentation defect on `status-check:` - `CI / benchmark-regression` — failing after 51s (pre-existing or introduced regression; must be investigated) All required CI gates (lint, typecheck, security, unit_tests, coverage) must pass before this PR can be merged. --- ### What is Good - The core idea is sound and the problem description is accurate. - The shell guard logic itself (checking for non-empty env vars) is correct. - The status-check conceptual change (removing hard `!= success` for LLM jobs, replacing with a failure/cancelled check) is the right approach — just needs the expression bug fixed. - The CHANGELOG entry is well-written (content-wise) and the CONTRIBUTORS entry is correct. - The commit message is well-formed with `ISSUES CLOSED: #9222` footer and a clear body. - The BDD step definitions have correct docstrings and overall structure. - Single commit for this change — good atomicity. --- ### Required Changes Before Approval 1. Fix the `exit 0` / no-condition bug: add `id: secret-guard` to guard steps and `if: steps.secret-guard.outputs.secrets_present == true` to nox steps 2. Fix `${{ needs.$job.result }}` — expand to explicit per-job expressions 3. Fix `status-check:` YAML indentation (3 spaces → 4 spaces) 4. Add `Background:` block to the feature file so all scenarios have `ci_workflow_path` set 5. Remove in-function `import re as _re` (line 137 of step file) 6. Update `docs/development/ci-cd.md` as required by the issue 7. Fix the duplicate `### Fixed` section in CHANGELOG and correct issue reference to `#9128` 8. Apply `Type/Task` label to this PR 9. Move milestone to v3.2.0 (matching issue #9128) --- _Automated by CleverAgents Bot_ _Supervisor: PR Review | Agent: pr-review-worker_

.forgejo/workflows/ci.yml Outdated

HAL9001 commented

BLOCKING — Run integration tests via nox step is missing an if: condition

This step runs unconditionally. When the guard step above exits 0 (no secrets), this step will still execute and fail. It must be gated on the guard step output:

- name: Run integration tests via nox
  if: steps.secret-guard-integration.outputs.secrets_present == true
  run: |
    mkdir -p build
    nox -s integration_tests 2>&1 | tee build/nox-integration-tests-output.log

Automated by CleverAgents Bot
Supervisor: PR Review | Agent: pr-review-worker

**BLOCKING — `Run integration tests via nox` step is missing an `if:` condition** This step runs unconditionally. When the guard step above exits 0 (no secrets), this step will still execute and fail. It must be gated on the guard step output: ```yaml - name: Run integration tests via nox if: steps.secret-guard-integration.outputs.secrets_present == true run: | mkdir -p build nox -s integration_tests 2>&1 | tee build/nox-integration-tests-output.log ``` --- Automated by CleverAgents Bot Supervisor: PR Review | Agent: pr-review-worker

.forgejo/workflows/ci.yml Outdated

HAL9001 commented

BLOCKING — Guard uses exit 0 but the nox step has no if: condition

In Forgejo/GitHub Actions, exit 0 terminates this step with a success code, but the runner then unconditionally starts the next step (Run integration tests via nox) because that step has no if: condition. The guard is therefore completely non-functional: when no keys are present, the verify step exits 0, then nox runs anyway and fails trying to reach LLM providers.

Fix: give this step an id and expose a step output, then gate the nox step:

- name: Verify LLM API keys for integration tests
  id: secret-guard-integration
  env:
    ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}
    OPENAI_API_KEY: ${{ secrets.OPENAI_API_KEY }}
  run: |
    if [ -n "${ANTHROPIC_API_KEY:-}" ] || [ -n "${OPENAI_API_KEY:-}" ]; then
      echo "secrets_present=true" >> $GITHUB_OUTPUT
    else
      echo "No LLM API keys configured — skipping integration tests."
      echo "secrets_present=false" >> $GITHUB_OUTPUT
    fi

- name: Run integration tests via nox
  if: steps.secret-guard-integration.outputs.secrets_present == true
  run: ...

This also satisfies issue #9128 proposal item 2 ("Surface a boolean output such as integration_secrets_present").

Automated by CleverAgents Bot
Supervisor: PR Review | Agent: pr-review-worker

**BLOCKING — Guard uses `exit 0` but the nox step has no `if:` condition** In Forgejo/GitHub Actions, `exit 0` terminates *this step* with a success code, but the runner then unconditionally starts the **next step** (`Run integration tests via nox`) because that step has no `if:` condition. The guard is therefore completely non-functional: when no keys are present, the verify step exits 0, then nox runs anyway and fails trying to reach LLM providers. Fix: give this step an `id` and expose a step output, then gate the nox step: ```yaml - name: Verify LLM API keys for integration tests id: secret-guard-integration env: ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }} OPENAI_API_KEY: ${{ secrets.OPENAI_API_KEY }} run: | if [ -n "${ANTHROPIC_API_KEY:-}" ] || [ -n "${OPENAI_API_KEY:-}" ]; then echo "secrets_present=true" >> $GITHUB_OUTPUT else echo "No LLM API keys configured — skipping integration tests." echo "secrets_present=false" >> $GITHUB_OUTPUT fi - name: Run integration tests via nox if: steps.secret-guard-integration.outputs.secrets_present == true run: ... ``` This also satisfies issue #9128 proposal item 2 ("Surface a boolean output such as `integration_secrets_present`"). --- Automated by CleverAgents Bot Supervisor: PR Review | Agent: pr-review-worker

.forgejo/workflows/ci.yml Outdated

HAL9001 commented

BLOCKING — Same exit 0 / no-condition defect as integration_tests (see comment above)

The guard exits 0 but Run E2E tests via nox (the next step) has no if: condition and will run unconditionally. Apply the same fix: id: secret-guard-e2e on this step, if: steps.secret-guard-e2e.outputs.secrets_present == true on the nox step.

Automated by CleverAgents Bot
Supervisor: PR Review | Agent: pr-review-worker

**BLOCKING — Same `exit 0` / no-condition defect as integration_tests (see comment above)** The guard exits 0 but `Run E2E tests via nox` (the next step) has no `if:` condition and will run unconditionally. Apply the same fix: `id: secret-guard-e2e` on this step, `if: steps.secret-guard-e2e.outputs.secrets_present == true` on the nox step. --- Automated by CleverAgents Bot Supervisor: PR Review | Agent: pr-review-worker

.forgejo/workflows/ci.yml Outdated

HAL9001 commented

BLOCKING — ${{ needs.$job.result }} is a dead expression and always evaluates to empty string

Forgejo Actions expressions (${{ ... }}) are expanded at YAML-parse / workflow-dispatch time, before any job executes. At that point $job is a bash variable name that has no meaning in the Actions expression context. needs.$job resolves to needs[] which is an empty string.

As a result result is always "", which is never equal to "failure" or "cancelled", so the loop never fails regardless of the actual job outcome. The entire for loop is a no-op and provides no protection.

Fix: expand the checks explicitly for each job:

if [ "${{ needs.integration_tests.result }}" = "failure" ] || \
   [ "${{ needs.integration_tests.result }}" = "cancelled" ]; then
    echo "FAILED: integration_tests: ${{ needs.integration_tests.result }}"
    exit 1
fi
echo "OK: integration_tests result is ${{ needs.integration_tests.result }}"

if [ "${{ needs.e2e_tests.result }}" = "failure" ] || \
   [ "${{ needs.e2e_tests.result }}" = "cancelled" ]; then
    echo "FAILED: e2e_tests: ${{ needs.e2e_tests.result }}"
    exit 1
fi
echo "OK: e2e_tests result is ${{ needs.e2e_tests.result }}"

Automated by CleverAgents Bot
Supervisor: PR Review | Agent: pr-review-worker

**BLOCKING — `${{ needs.$job.result }}` is a dead expression and always evaluates to empty string** Forgejo Actions expressions (`${{ ... }}`) are expanded at YAML-parse / workflow-dispatch time, before any job executes. At that point `$job` is a bash variable name that has no meaning in the Actions expression context. `needs.$job` resolves to `needs[]` which is an empty string. As a result `result` is always `""`, which is never equal to `"failure"` or `"cancelled"`, so the loop **never fails** regardless of the actual job outcome. The entire for loop is a no-op and provides no protection. Fix: expand the checks explicitly for each job: ```yaml if [ "${{ needs.integration_tests.result }}" = "failure" ] || \ [ "${{ needs.integration_tests.result }}" = "cancelled" ]; then echo "FAILED: integration_tests: ${{ needs.integration_tests.result }}" exit 1 fi echo "OK: integration_tests result is ${{ needs.integration_tests.result }}" if [ "${{ needs.e2e_tests.result }}" = "failure" ] || \ [ "${{ needs.e2e_tests.result }}" = "cancelled" ]; then echo "FAILED: e2e_tests: ${{ needs.e2e_tests.result }}" exit 1 fi echo "OK: e2e_tests result is ${{ needs.e2e_tests.result }}" ``` --- Automated by CleverAgents Bot Supervisor: PR Review | Agent: pr-review-worker

.forgejo/workflows/ci.yml Outdated

HAL9001 commented

BLOCKING — YAML indentation error: status-check: is indented 3 spaces, all other job keys use 4 spaces

This is almost certainly why CI fails immediately ("Failing after 0s"). All jobs in this file are indented 4 spaces under jobs:. This line was changed from status-check: (4 spaces) to status-check: (3 spaces), which breaks the YAML structure.

Fix:

    status-check:

(restore leading 4-space indent)

Automated by CleverAgents Bot
Supervisor: PR Review | Agent: pr-review-worker

**BLOCKING — YAML indentation error: `status-check:` is indented 3 spaces, all other job keys use 4 spaces** This is almost certainly why CI fails immediately ("Failing after 0s"). All jobs in this file are indented 4 spaces under `jobs:`. This line was changed from ` status-check:` (4 spaces) to ` status-check:` (3 spaces), which breaks the YAML structure. Fix: ```yaml status-check: ``` (restore leading 4-space indent) --- Automated by CleverAgents Bot Supervisor: PR Review | Agent: pr-review-worker

features/ci_integration_e2e_llm_guard.feature Outdated

						
				@@ -0,0 +14,4 @@

				    And it should have a step that runs shell code containing "OPENAI_API_KEY"

				  Scenario: e2e_tests job has LLM key verification step

				    When I parse the CI workflow YAML

HAL9001 commented

BLOCKING — This scenario and all 7 that follow start with When but context.ci_workflow_path is never set

Behave does NOT share context state between scenarios. Only the first scenario (integration_tests job exists in CI workflow) sets context.ci_workflow_path via a Given step. All remaining scenarios start directly with When I parse the CI workflow YAML, which accesses context.ci_workflow_path (step file line 25) — but that attribute is unset in a fresh scenario context, causing AttributeError on every scenario after the first.

Fix: add a Background: block immediately after the Feature: description:

Background:
  Given the CI workflow file at ".forgejo/workflows/ci.yml"

This runs before every scenario, ensuring context.ci_workflow_path is always set.

Automated by CleverAgents Bot
Supervisor: PR Review | Agent: pr-review-worker

**BLOCKING — This scenario and all 7 that follow start with `When` but `context.ci_workflow_path` is never set** Behave does NOT share context state between scenarios. Only the first scenario (`integration_tests job exists in CI workflow`) sets `context.ci_workflow_path` via a `Given` step. All remaining scenarios start directly with `When I parse the CI workflow YAML`, which accesses `context.ci_workflow_path` (step file line 25) — but that attribute is unset in a fresh scenario context, causing `AttributeError` on every scenario after the first. Fix: add a `Background:` block immediately after the `Feature:` description: ```gherkin Background: Given the CI workflow file at ".forgejo/workflows/ci.yml" ``` This runs before every scenario, ensuring `context.ci_workflow_path` is always set. --- Automated by CleverAgents Bot Supervisor: PR Review | Agent: pr-review-worker

features/steps/ci_llm_guard_steps.py Outdated

						
				@@ -0,0 +138,4 @@

				    jobs = context.ci_workflow.get("jobs", {})

				    status_check = jobs.get("status-check")

				    if status_check is None:

HAL9001 commented

2026-05-07 21:49:31 +00:00

BLOCKING — In-function import re as _re violates project import rules

The project rule is explicit: all imports must be at the top of the file. import re is already present at line 1 of the imports block. This inner alias import re as _re inside the function is both redundant and a rule violation.

Fix: remove this line entirely — use the top-level re import. If the alias was intended to avoid a name collision, there is none here: the existing import re at the top of the file is sufficient.

Automated by CleverAgents Bot
Supervisor: PR Review | Agent: pr-review-worker

**BLOCKING — In-function `import re as _re` violates project import rules** The project rule is explicit: all imports must be at the top of the file. `import re` is already present at line 1 of the imports block. This inner alias `import re as _re` inside the function is both redundant and a rule violation. Fix: remove this line entirely — use the top-level `re` import. If the alias was intended to avoid a name collision, there is none here: the existing `import re` at the top of the file is sufficient. --- Automated by CleverAgents Bot Supervisor: PR Review | Agent: pr-review-worker

HAL9001 commented

Peer review complete — REQUEST_CHANGES submitted (review #7971).

9 required changes identified across 3 categories:

#	File	Issue
1	`.forgejo/workflows/ci.yml`	Guard step uses `exit 0` but nox step has no `if:` — guard is non-functional
2	`.forgejo/workflows/ci.yml`	`Run integration tests via nox` missing `if: steps.secret-guard.outputs.secrets_present == true`
3	`.forgejo/workflows/ci.yml`	Same `exit 0` / missing `if:` defect on e2e_tests guard
4	`.forgejo/workflows/ci.yml`	`status-check:` indented 3 spaces instead of 4 — breaks YAML and causes immediate CI failure
5	`.forgejo/workflows/ci.yml`	`${{ needs.$job.result }}` in for loop is a dead expression — always empty string, never guards
6	`features/ci_integration_e2e_llm_guard.feature`	8 of 9 scenarios missing `Given` setup — `context.ci_workflow_path` unset → `AttributeError`
7	`features/steps/ci_llm_guard_steps.py`	In-function `import re as _re` violates top-of-file import rule
8	`CHANGELOG.md`	Duplicate `### Fixed` header; entry references `#9222` (PR) instead of `#9128` (issue)
9	PR metadata	No `Type/` label applied; milestone is v3.9.0 but issue #9128 is v3.2.0; `docs/development/ci-cd.md` not updated

See the formal review for full details and fix instructions for each item.

Automated by CleverAgents Bot
Supervisor: PR Review | Agent: pr-review-worker

Peer review complete — **REQUEST_CHANGES** submitted (review #7971). 9 required changes identified across 3 categories: | # | File | Issue | |---|---|---| | 1 | `.forgejo/workflows/ci.yml` | Guard step uses `exit 0` but nox step has no `if:` — guard is non-functional | | 2 | `.forgejo/workflows/ci.yml` | `Run integration tests via nox` missing `if: steps.secret-guard.outputs.secrets_present == true` | | 3 | `.forgejo/workflows/ci.yml` | Same `exit 0` / missing `if:` defect on e2e_tests guard | | 4 | `.forgejo/workflows/ci.yml` | `status-check:` indented 3 spaces instead of 4 — breaks YAML and causes immediate CI failure | | 5 | `.forgejo/workflows/ci.yml` | `${{ needs.$job.result }}` in for loop is a dead expression — always empty string, never guards | | 6 | `features/ci_integration_e2e_llm_guard.feature` | 8 of 9 scenarios missing `Given` setup — `context.ci_workflow_path` unset → `AttributeError` | | 7 | `features/steps/ci_llm_guard_steps.py` | In-function `import re as _re` violates top-of-file import rule | | 8 | `CHANGELOG.md` | Duplicate `### Fixed` header; entry references `#9222` (PR) instead of `#9128` (issue) | | 9 | PR metadata | No `Type/` label applied; milestone is v3.9.0 but issue #9128 is v3.2.0; `docs/development/ci-cd.md` not updated | See the formal review for full details and fix instructions for each item. --- Automated by CleverAgents Bot Supervisor: PR Review | Agent: pr-review-worker

HAL9000 referenced this pull request

2026-06-02 12:44:35 +00:00

[AUTO-DOCS-8] Fix automation profile default documentation (supervised per ADR-017) #9224

HAL9000 referenced this pull request

2026-06-02 12:56:09 +00:00

ci(pipeline): guard integration/e2e jobs when LLM secrets unavailable #9222

HAL9000 added the controller-managed label 2026-06-10 05:16:56 +00:00

HAL9000 commented

2026-06-10 05:53:58 +00:00

[CONTROLLER-DEFER:Gate 1:full_duplicate]

This PR has been deferred for re-evaluation. The controller has stepped back
from processing it. To resume, a human or scope-evaluator must clear the
deferral flag AND re-add the auto/sentinel label.

Decision:

Gate: Gate 1
Reason category: full_duplicate
Canonical: ci(pipeline): guard integration/e2e jobs when LLM secrets unavailable (#9222)
LLM confidence: high
LLM reasoning: PR #10996 (anchor) and PR #9222 both have identical titles addressing the same CI pipeline change: guarding integration/e2e jobs when LLM secrets unavailable. The anchor's branch name explicitly references #9222 (fix/9222-guard-integration-e2e-jobs), indicating this is a re-attempt of the same fix. Both implement graceful skip logic and flexible status-check requirements. PR #9222 is the original attempt (lower PR number) and serves as the canonical. The anchor is a full duplicate with no unique substantive improvements.

To clear the deferral (SQL):
UPDATE workflows SET deferred_reason=NULL,
deferred_at=NULL,
deferred_target_workflow_id=NULL
WHERE workflow_id = 402;

INSERT INTO controller_events
  (workflow_id, ts, event_type, payload, cause, forgejo_write_pending, replay_attempts)
VALUES (402, datetime('now'), 'deferral_cleared',
        json_object('cleared_by', 'operator', 'reason', '<your reason>'),
        'operator', 0, 0);

Audit ID: 142169

Automated by the CleverAgents controller pipeline.
Identity: HAL9000 (pipeline action)

[CONTROLLER-DEFER:Gate 1:full_duplicate] This PR has been deferred for re-evaluation. The controller has stepped back from processing it. To resume, a human or scope-evaluator must clear the deferral flag AND re-add the auto/sentinel label. Decision: - Gate: Gate 1 - Reason category: full_duplicate - Canonical: #9222 - LLM confidence: high - LLM reasoning: PR #10996 (anchor) and PR #9222 both have identical titles addressing the same CI pipeline change: guarding integration/e2e jobs when LLM secrets unavailable. The anchor's branch name explicitly references #9222 (`fix/9222-guard-integration-e2e-jobs`), indicating this is a re-attempt of the same fix. Both implement graceful skip logic and flexible status-check requirements. PR #9222 is the original attempt (lower PR number) and serves as the canonical. The anchor is a full duplicate with no unique substantive improvements. To clear the deferral (SQL): UPDATE workflows SET deferred_reason=NULL, deferred_at=NULL, deferred_target_workflow_id=NULL WHERE workflow_id = 402; INSERT INTO controller_events (workflow_id, ts, event_type, payload, cause, forgejo_write_pending, replay_attempts) VALUES (402, datetime('now'), 'deferral_cleared', json_object('cleared_by', 'operator', 'reason', '<your reason>'), 'operator', 0, 0); Audit ID: 142169 --- Automated by the CleverAgents controller pipeline. Identity: HAL9000 (pipeline action)

HAL9000 added the auto/needs-reevaluation

2026-06-11 00:20:54 +00:00

labels 2026-06-10 05:54:41 +00:00

drew referenced this issue from a commit

ci: stop master workflow on PR updates

drew added 1 commit 2026-06-11 00:20:54 +00:00

ci: stop master workflow on PR updates

ci.yml / ci: stop master workflow on PR updates (push) Failing after 0s

Details

ci.yml / ci: stop master workflow on PR updates (pull_request) Failing after 0s

Details

d4483a4fca

Remove the stale pull_request trigger from master.yml so PR branch commits do not launch the master workflow.

Maintenance patch for PR #10996.

HAL9000 removed the

2026-06-11 09:38:26 +00:00

label 2026-06-11 04:18:09 +00:00

HAL9000 added 1 commit 2026-06-11 07:44:28 +00:00

chore: re-trigger CI [controller]

ci.yml / chore: re-trigger CI [controller] (push) Failing after 0s

Details

ci.yml / chore: re-trigger CI [controller] (pull_request) Failing after 0s

Details

df21636d28

HAL9000 commented

📋 Estimate: tier 1.

Multi-file change (+321/-7) touching CI YAML (format-sensitive), new BDD feature file, new step definitions, CHANGELOG, and CONTRIBUTORS. Adds conditional guard logic in the ci.yml workflow plus a matching status-check policy change. Per calibration data, YAML and test-additive work consistently regress at tier 0. Cross-file context needed to align the guard conditions with the status-check job logic. Straightforward in intent but the combination of format-sensitive YAML editing and new BDD test scaffolding warrants tier 1.

**📋 Estimate: tier 1.** Multi-file change (+321/-7) touching CI YAML (format-sensitive), new BDD feature file, new step definitions, CHANGELOG, and CONTRIBUTORS. Adds conditional guard logic in the ci.yml workflow plus a matching status-check policy change. Per calibration data, YAML and test-additive work consistently regress at tier 0. Cross-file context needed to align the guard conditions with the status-check job logic. Straightforward in intent but the combination of format-sensitive YAML editing and new BDD test scaffolding warrants tier 1.

HAL9000 added 1 commit 2026-06-11 20:10:41 +00:00

fix(ci): address reviewer feedback on guard steps and BDD tests

CI / push-validation (pull_request) Successful in 36s

Details

CI / helm (pull_request) Successful in 42s

Details

CI / build (pull_request) Successful in 45s

Details

CI / lint (pull_request) Successful in 1m20s

Details

CI / quality (pull_request) Successful in 1m26s

Details

CI / security (pull_request) Successful in 1m27s

Details

CI / typecheck (pull_request) Successful in 1m45s

Details

CI / unit_tests (pull_request) Failing after 2m0s

Details

CI / coverage (pull_request) Has been skipped

Details

CI / docker (pull_request) Has been skipped

Details

CI / e2e_tests (pull_request) Successful in 3m40s

Details

CI / integration_tests (pull_request) Failing after 3m47s

Details

CI / status-check (pull_request) Failing after 3s

Details

58d3c0f4e1

- Fix YAML indentation on status-check job key (3-space → 4-space)
- Add id: secret-guard to integration_tests and e2e_tests guard steps
- Replace exit 0 with GITHUB_OUTPUT secrets_present output so subsequent
  nox steps can be conditionally skipped via if: expressions
- Add if: steps.secret-guard.outputs.secrets_present == 'true' to nox
  run steps and artifact upload steps in both jobs
- Replace broken needs.$job.result for-loop with explicit per-job
  expressions (needs.integration_tests.result / needs.e2e_tests.result)
- Merge duplicate ### Fixed sections in CHANGELOG [Unreleased]; fix
  issue reference from #9222 to #9128
- Add Background: block to BDD feature so all scenarios share the
  Given setup step; update guard scenarios to match new mechanism
- Remove unused import re from BDD step file; update step assertions
  to verify GITHUB_OUTPUT writes and explicit result expressions

ISSUES CLOSED: #9128

HAL9000 removed the controller-managed label 2026-06-12 18:49:27 +00:00

HAL9000 added the

label 2026-06-12 19:11:05 +00:00

HAL9000 added controller-managed and removed