UAT: Specification Workflow Examples 1, 9, 10, 11, 13, and 15 have no Robot Framework integration tests #4072

Open
opened 2026-04-06 09:56:31 +00:00 by freemo · 0 comments
Owner

Metadata

  • Branch: feat/integration-tests-missing-workflows
  • Commit Message: test(robot): add integration tests for workflow examples 1, 9, 10, 11, 13, 15
  • Milestone: (none — backlog)
  • Parent Epic: (Integration Testing Epic)

Background and Context

CONTRIBUTING.md (line 53–54) states:

Multi-Level Testing Mandate: Every coding task must include or update tests at multiple levels: unit tests, integration tests, and performance benchmarks. Testing is non-optional and is part of the definition of done for any task.

The specification (docs/specification.md) defines 18 workflow examples (Examples 1–18) that represent the primary user-facing workflows. Each workflow example should have a corresponding Robot Framework integration test to verify the end-to-end behavior.

What Was Tested

Cross-reference of all 18 workflow examples in docs/specification.md against the robot/ directory for corresponding integration test files.

Expected Behavior

All 18 workflow examples defined in the specification should have corresponding Robot Framework integration tests in the robot/ directory.

Actual Behavior

6 out of 18 workflow examples (33%) have NO integration tests at all:

Example Title Integration Test Status
Example 1 Hello World — Fix a Single Bug None MISSING
Example 2 Automated Test Generation robot/wf02_test_generation_integration.robot
Example 3 Multi-File Refactoring with Invariants None MISSING
Example 4 Multi-Project Dependency Update robot/wf04_multi_project_dependency.robot
Example 5 Database Schema Migration robot/int_wf05_db_migration.robot
Example 6 Writing Technical Documentation robot/int_wf06_doc_generation.robot
Example 7 CI/CD Integration robot/wf07_cicd_integration.robot
Example 8 Cloud Infrastructure Management robot/wf08_cloud_infra_supervised.robot
Example 9 Session-Driven Interactive Exploration None MISSING
Example 10 Full-Auto Batch Operations None MISSING
Example 11 Complex Graph Actor for Multi-Stage Code Review None MISSING
Example 12 Large-Scale Feature Implementation robot/large_project_decompose.robot
Example 13 Custom Automation Profile with Semantic Escalation None MISSING
Example 14 Server Mode — Team Collaboration robot/wf14_server_mode_integration.robot
Example 15 Disaster Recovery — Rollback a Failed Apply None MISSING
Example 16 Devcontainer-Driven Development robot/devcontainer_lifecycle.robot
Example 17 Explicit Container with Directory Mount robot/e2e/wf17_explicit_container.robot
Example 18 Container with Remote Repo Clone robot/e2e/wf18_container_clone.robot

Missing Workflow Details

Example 1: Hello World — Fix a Single Bug

  • The most fundamental workflow: agents initresource add git-checkoutproject createvalidation addaction createplan useplan executeplan diffplan apply
  • This is the primary "getting started" workflow and has NO integration test
  • Critical because it exercises the complete plan lifecycle with a real git repository resource

Example 3: Multi-File Refactoring with Invariants

  • Exercises invariant registration, invariant checking during plan execution, and correction flows
  • No integration test exists for the invariant-gated workflow

Example 9: Session-Driven Interactive Exploration

  • Exercises agents session create, agents session list, agents session show, and session-based plan execution
  • No integration test exists for session management workflows

Example 10: Full-Auto Batch Operations

  • Exercises the trusted automation profile for fully automated plan execution without human approval
  • No integration test exists for the trusted/full-auto automation profile workflow

Example 11: Complex Graph Actor for Multi-Stage Code Review

  • Exercises custom actor configuration with graph-based execution
  • No integration test exists for complex actor workflows

Example 13: Custom Automation Profile with Semantic Escalation

  • Exercises custom automation profile creation and semantic escalation triggers
  • No integration test exists for custom automation profile workflows

Example 15: Disaster Recovery — Rollback a Failed Apply

  • Exercises agents plan rollback with a checkpoint ID to revert a failed apply
  • No integration test exists for the rollback/disaster recovery workflow
  • This is particularly critical given the existing bug #4068 about plan rollback accepting missing CHECKPOINT_ID

Impact

Without integration tests for these workflows:

  • Regressions in these critical user workflows go undetected until they reach production
  • The agents init command (Example 1) has no integration test — the most basic user action
  • Session management (Example 9) has no integration test
  • The rollback/disaster recovery workflow (Example 15) has no integration test, which is especially concerning given the existing rollback bug

Subtasks

  • Create robot/wf01_hello_world.robot + robot/helper_wf01_hello_world.py for Example 1 (init → resource → project → validation → action → plan lifecycle)
  • Create robot/wf03_refactoring_invariants.robot + helper for Example 3 (invariant-gated refactoring)
  • Create robot/wf09_session_exploration.robot + helper for Example 9 (session create/list/show/use)
  • Create robot/wf10_full_auto_batch.robot + helper for Example 10 (trusted automation profile)
  • Create robot/wf11_graph_actor_review.robot + helper for Example 11 (complex graph actor)
  • Create robot/wf13_custom_automation_profile.robot + helper for Example 13 (custom profile + semantic escalation)
  • Create robot/wf15_disaster_recovery.robot + helper for Example 15 (plan rollback)
  • All new integration tests must use real services (no mocks) per CONTRIBUTING.md
  • Run nox -s integration_tests to verify all new tests pass

Definition of Done

  • All 18 workflow examples have corresponding Robot Framework integration tests
  • All new tests use real CLI subprocess invocations with real SQLite database
  • nox -s integration_tests passes
  • Coverage >= 97%

Supporting Information

  • Spec reference: docs/specification.md lines 36260–46597 (Workflow Examples section)
  • Correct pattern: robot/helper_m1_e2e_verification.py — uses run_cli() with real subprocess and real database
  • Related bug: #4068agents plan rollback missing CHECKPOINT_ID validation (Example 15 would catch this)

Automated by CleverAgents Bot
Supervisor: UAT Testing | Agent: ca-uat-tester

## Metadata - **Branch**: `feat/integration-tests-missing-workflows` - **Commit Message**: `test(robot): add integration tests for workflow examples 1, 9, 10, 11, 13, 15` - **Milestone**: (none — backlog) - **Parent Epic**: (Integration Testing Epic) ## Background and Context `CONTRIBUTING.md` (line 53–54) states: > **Multi-Level Testing Mandate:** Every coding task must include or update tests at multiple levels: unit tests, integration tests, and performance benchmarks. Testing is non-optional and is part of the definition of done for any task. The specification (`docs/specification.md`) defines 18 workflow examples (Examples 1–18) that represent the primary user-facing workflows. Each workflow example should have a corresponding Robot Framework integration test to verify the end-to-end behavior. ## What Was Tested Cross-reference of all 18 workflow examples in `docs/specification.md` against the `robot/` directory for corresponding integration test files. ## Expected Behavior All 18 workflow examples defined in the specification should have corresponding Robot Framework integration tests in the `robot/` directory. ## Actual Behavior **6 out of 18 workflow examples (33%) have NO integration tests at all:** | Example | Title | Integration Test | Status | |---------|-------|-----------------|--------| | Example 1 | Hello World — Fix a Single Bug | None | ❌ MISSING | | Example 2 | Automated Test Generation | `robot/wf02_test_generation_integration.robot` | ✅ | | Example 3 | Multi-File Refactoring with Invariants | None | ❌ MISSING | | Example 4 | Multi-Project Dependency Update | `robot/wf04_multi_project_dependency.robot` | ✅ | | Example 5 | Database Schema Migration | `robot/int_wf05_db_migration.robot` | ✅ | | Example 6 | Writing Technical Documentation | `robot/int_wf06_doc_generation.robot` | ✅ | | Example 7 | CI/CD Integration | `robot/wf07_cicd_integration.robot` | ✅ | | Example 8 | Cloud Infrastructure Management | `robot/wf08_cloud_infra_supervised.robot` | ✅ | | Example 9 | Session-Driven Interactive Exploration | None | ❌ MISSING | | Example 10 | Full-Auto Batch Operations | None | ❌ MISSING | | Example 11 | Complex Graph Actor for Multi-Stage Code Review | None | ❌ MISSING | | Example 12 | Large-Scale Feature Implementation | `robot/large_project_decompose.robot` | ✅ | | Example 13 | Custom Automation Profile with Semantic Escalation | None | ❌ MISSING | | Example 14 | Server Mode — Team Collaboration | `robot/wf14_server_mode_integration.robot` | ✅ | | Example 15 | Disaster Recovery — Rollback a Failed Apply | None | ❌ MISSING | | Example 16 | Devcontainer-Driven Development | `robot/devcontainer_lifecycle.robot` | ✅ | | Example 17 | Explicit Container with Directory Mount | `robot/e2e/wf17_explicit_container.robot` | ✅ | | Example 18 | Container with Remote Repo Clone | `robot/e2e/wf18_container_clone.robot` | ✅ | ### Missing Workflow Details **Example 1: Hello World — Fix a Single Bug** - The most fundamental workflow: `agents init` → `resource add git-checkout` → `project create` → `validation add` → `action create` → `plan use` → `plan execute` → `plan diff` → `plan apply` - This is the primary "getting started" workflow and has NO integration test - Critical because it exercises the complete plan lifecycle with a real git repository resource **Example 3: Multi-File Refactoring with Invariants** - Exercises invariant registration, invariant checking during plan execution, and correction flows - No integration test exists for the invariant-gated workflow **Example 9: Session-Driven Interactive Exploration** - Exercises `agents session create`, `agents session list`, `agents session show`, and session-based plan execution - No integration test exists for session management workflows **Example 10: Full-Auto Batch Operations** - Exercises the `trusted` automation profile for fully automated plan execution without human approval - No integration test exists for the trusted/full-auto automation profile workflow **Example 11: Complex Graph Actor for Multi-Stage Code Review** - Exercises custom actor configuration with graph-based execution - No integration test exists for complex actor workflows **Example 13: Custom Automation Profile with Semantic Escalation** - Exercises custom automation profile creation and semantic escalation triggers - No integration test exists for custom automation profile workflows **Example 15: Disaster Recovery — Rollback a Failed Apply** - Exercises `agents plan rollback` with a checkpoint ID to revert a failed apply - No integration test exists for the rollback/disaster recovery workflow - This is particularly critical given the existing bug #4068 about `plan rollback` accepting missing CHECKPOINT_ID ## Impact Without integration tests for these workflows: - Regressions in these critical user workflows go undetected until they reach production - The `agents init` command (Example 1) has no integration test — the most basic user action - Session management (Example 9) has no integration test - The rollback/disaster recovery workflow (Example 15) has no integration test, which is especially concerning given the existing rollback bug ## Subtasks - [ ] Create `robot/wf01_hello_world.robot` + `robot/helper_wf01_hello_world.py` for Example 1 (init → resource → project → validation → action → plan lifecycle) - [ ] Create `robot/wf03_refactoring_invariants.robot` + helper for Example 3 (invariant-gated refactoring) - [ ] Create `robot/wf09_session_exploration.robot` + helper for Example 9 (session create/list/show/use) - [ ] Create `robot/wf10_full_auto_batch.robot` + helper for Example 10 (trusted automation profile) - [ ] Create `robot/wf11_graph_actor_review.robot` + helper for Example 11 (complex graph actor) - [ ] Create `robot/wf13_custom_automation_profile.robot` + helper for Example 13 (custom profile + semantic escalation) - [ ] Create `robot/wf15_disaster_recovery.robot` + helper for Example 15 (plan rollback) - [ ] All new integration tests must use real services (no mocks) per CONTRIBUTING.md - [ ] Run `nox -s integration_tests` to verify all new tests pass ## Definition of Done - [ ] All 18 workflow examples have corresponding Robot Framework integration tests - [ ] All new tests use real CLI subprocess invocations with real SQLite database - [ ] `nox -s integration_tests` passes - [ ] Coverage >= 97% ## Supporting Information - **Spec reference**: `docs/specification.md` lines 36260–46597 (Workflow Examples section) - **Correct pattern**: `robot/helper_m1_e2e_verification.py` — uses `run_cli()` with real subprocess and real database - **Related bug**: #4068 — `agents plan rollback` missing CHECKPOINT_ID validation (Example 15 would catch this) --- **Automated by CleverAgents Bot** Supervisor: UAT Testing | Agent: ca-uat-tester
HAL9000 added this to the v3.5.0 milestone 2026-04-09 03:11:20 +00:00
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
cleveragents/cleveragents-core#4072
No description provided.