feat/multi-agent #5

					
				@ -0,0 +2,4 @@

				A powerful, reactive Agent Framework using RxPy streams for complex AI agent orchestration and message routing.

				[![PyPI Package](https://img.shields.io/pypi/v/cleveragents.svg)](https://pypi.org/project/cleveragents/)

brent.edwards commented

2025-10-03 17:08:15 +00:00

This leads to a 404 error.

aditya commented

2025-10-17 18:11:34 +00:00

@brent.edwards The 404 link has been removed.

brent.edwards marked this conversation as resolved

README.md

					
				@ -0,0 +3,4 @@

				A powerful, reactive Agent Framework using RxPy streams for complex AI agent orchestration and message routing.

				[![PyPI Package](https://img.shields.io/pypi/v/cleveragents.svg)](https://pypi.org/project/cleveragents/)

				[![Travis-CI Build Status](https://img.shields.io/travis/cleverthis/cleveragents.svg)](https://travis-ci.org/cleverthis/cleveragents)

brent.edwards commented

2025-10-03 17:10:34 +00:00

This also leads to a 404 error.

aditya commented

2025-10-17 18:11:52 +00:00

@brent.edwards The 404 link has been removed.

brent.edwards marked this conversation as resolved

README.md Outdated

					
				@ -0,0 +29,4 @@

				### Install from PyPI

				```bash

				pip install cleveragents

brent.edwards commented

2025-10-03 17:26:21 +00:00

Here's what happens when I try this command:

❯ pip install cleveragents
ERROR: Could not find a version that satisfies the requirement cleveragents (from versions: none)

[notice] A new release of pip is available: 25.0.1 -> 25.2
[notice] To update, run: pip install --upgrade pip
ERROR: No matching distribution found for cleveragents

Here's what happens when I try this command: ``` ❯ pip install cleveragents ERROR: Could not find a version that satisfies the requirement cleveragents (from versions: none) [notice] A new release of pip is available: 25.0.1 -> 25.2 [notice] To update, run: pip install --upgrade pip ERROR: No matching distribution found for cleveragents ```

aditya commented

2025-10-17 18:12:57 +00:00

@brent.edwards updated the install instructions in README.md

brent.edwards marked this conversation as resolved

README.md

					
				@ -0,0 +210,4 @@

				      provider: openai

				      model: gpt-4

				      api_key: "sk-your-key-here"

				```

brent.edwards commented

2025-10-03 21:07:28 +00:00

Information about configuration files can already be found in lines 92-110.

aditya commented

2025-10-17 18:14:13 +00:00

@brent.edwards removed the duplicate information.

brent.edwards marked this conversation as resolved

README.md Outdated

					
				@ -0,0 +273,4 @@

				```bash

				# Run all tests

				python -m behave tests/features

brent.edwards commented

2025-10-03 21:11:29 +00:00

Bug in the software:

With the installation as above, this command fails.

To fix it, we need to type pip install behave.

When I run the tests, I get the following results:

Failing scenarios:
  tests/features/application_missing_lines_coverage.feature:154  Subscription re-setup after operations lines 644-648
  tests/features/tool_agent.feature:84  Tool Agent shell command execution

73 features passed, 2 failed, 0 skipped
1217 scenarios passed, 2 failed, 0 skipped
6559 steps passed, 2 failed, 0 skipped
Took 0min 49.345s

The tests need to pass before we can move forward.

Another question --

Not all of the tests are in the format for behave. Examples include:

test_json_sanitization.py
scripts/test_single_turn_writer.sh
test_multi_agent_interactive.sh

I can't see how the one line runs all of the tests.

Bug in the software: With the installation as above, this command fails. To fix it, we need to type `pip install behave`. When I run the tests, I get the following results: ``` Failing scenarios: tests/features/application_missing_lines_coverage.feature:154 Subscription re-setup after operations lines 644-648 tests/features/tool_agent.feature:84 Tool Agent shell command execution 73 features passed, 2 failed, 0 skipped 1217 scenarios passed, 2 failed, 0 skipped 6559 steps passed, 2 failed, 0 skipped Took 0min 49.345s ``` The tests need to pass before we can move forward. Another question -- Not all of the tests are in the format for `behave`. Examples include: * `test_json_sanitization.py` * `scripts/test_single_turn_writer.sh` * `test_multi_agent_interactive.sh` I can't see how the one line runs all of the tests.

brent.edwards commented

2025-10-06 20:11:17 +00:00

Update:

When I run the tests, I still see one failing:

Failing scenarios:
  tests/features/tool_agent.feature:84  Tool Agent shell command execution

74 features passed, 1 failed, 0 skipped
1218 scenarios passed, 1 failed, 0 skipped
6560 steps passed, 1 failed, 0 skipped
Took 0min 49.185s

Update: When I run the tests, I still see one failing: ``` Failing scenarios: tests/features/tool_agent.feature:84 Tool Agent shell command execution 74 features passed, 1 failed, 0 skipped 1218 scenarios passed, 1 failed, 0 skipped 6560 steps passed, 1 failed, 0 skipped Took 0min 49.185s ```

brent.edwards commented

2025-10-09 20:56:17 +00:00

Hi Aditya --

When I run python -m behave tests/features/, here is the summary that I now get:

Failing scenarios:
  tests/features/application_direct_coverage.feature:8  Direct prompt template processing
  tests/features/tool_agent.feature:84  Tool Agent shell command execution

Errored scenarios:
  tests/features/application_direct_coverage.feature:13  Direct single-shot with message content
  tests/features/application_direct_coverage.feature:18  Direct interactive session setup
  tests/features/application_direct_coverage.feature:23  Direct application disposal
  tests/features/application_direct_coverage.feature:28  Direct visualization generation
  tests/features/application_direct_coverage.feature:33  Direct configuration conversion
  tests/features/application_direct_coverage.feature:38  Direct template registration
  tests/features/application_direct_coverage.feature:43  Direct agent creation paths
  tests/features/application_direct_coverage.feature:48  Direct route setup
  tests/features/application_direct_coverage.feature:53  Direct stream operations
  tests/features/application_direct_coverage.feature:58  Direct interactive session full workflow
  tests/features/application_direct_coverage.feature:63  Direct timeout and error handling
  tests/features/application_direct_coverage.feature:68  Direct complex template processing
  tests/features/application_missing_lines_coverage.feature:10  Single-shot with no config loaded error on line 220
  tests/features/application_missing_lines_coverage.feature:15  Single-shot with None message content handling lines 231-236
  tests/features/application_missing_lines_coverage.feature:20  Single-shot with message without content attribute lines 234-236
  tests/features/application_missing_lines_coverage.feature:25  Interactive session with no config loaded error on line 288
  tests/features/application_missing_lines_coverage.feature:30  Interactive help command lines 320-321
  tests/features/application_missing_lines_coverage.feature:35  Interactive stream command processing lines 322-324
  tests/features/application_missing_lines_coverage.feature:40  Interactive graph command processing lines 325-327
  tests/features/application_missing_lines_coverage.feature:45  Interactive empty input handling lines 328-329
  tests/features/application_missing_lines_coverage.feature:50  Interactive keyboard interrupt handling lines 340-342
  tests/features/application_missing_lines_coverage.feature:55  Interactive EOF handling lines 343-344
  tests/features/application_missing_lines_coverage.feature:60  Config to dict conversion with no config line 358-359
  tests/features/application_missing_lines_coverage.feature:65  Template registration with no config early return line 376-377
  tests/features/application_missing_lines_coverage.feature:70  Template registration with non-dict templates lines 384-385
  tests/features/application_missing_lines_coverage.feature:75  Enhanced registry with raw template processing lines 414-441
  tests/features/application_missing_lines_coverage.feature:80  Agent creation error with no factory line 455-456
  tests/features/application_missing_lines_coverage.feature:85  Agent creation with enhanced registry template instances line 474-491
  tests/features/application_missing_lines_coverage.feature:90  Agent creation with regular registry fallback lines 490-491
  tests/features/application_missing_lines_coverage.feature:95  Agent creation with regular registry instantiation lines 495-498
  tests/features/application_missing_lines_coverage.feature:100  Agent creation with no instantiate capability lines 501-502
  tests/features/application_missing_lines_coverage.feature:105  Route setup with no routes early return lines 518-519
  tests/features/application_missing_lines_coverage.feature:110  Route setup bridge initialization lines 522-527
  tests/features/application_missing_lines_coverage.feature:115  Route template with no registry fallback lines 538-544
  tests/features/application_missing_lines_coverage.feature:120  Stream route template field updates lines 548-561
  tests/features/application_missing_lines_coverage.feature:125  Graph route template field updates lines 562-567
  tests/features/application_missing_lines_coverage.feature:130  State class resolution failure lines 582-591
  tests/features/application_missing_lines_coverage.feature:135  Bridge route registration lines 607-609
  tests/features/application_missing_lines_coverage.feature:140  Stream operations with no config early return line 615-616
  tests/features/application_missing_lines_coverage.feature:145  Merge operations processing lines 619-624
  tests/features/application_missing_lines_coverage.feature:150  Split operations processing lines 627-632
  tests/features/application_missing_lines_coverage.feature:155  Subscription re-setup after operations lines 644-648
  tests/features/application_missing_lines_coverage.feature:160  Pipeline setup with no config early return line 654-655
  tests/features/application_missing_lines_coverage.feature:165  Pipeline configuration conversion lines 658-667
  tests/features/application_missing_lines_coverage.feature:170  Stream command with invalid stream lines 682-683
  tests/features/application_missing_lines_coverage.feature:175  Graph command with invalid graph lines 702-703
  tests/features/application_missing_lines_coverage.feature:180  Graph command usage error line 720
  tests/features/application_missing_lines_coverage.feature:185  Graph execution with no messages lines 736-739
  tests/features/application_missing_lines_coverage.feature:190  Graph execution error handling lines 741-742
  tests/features/application_missing_lines_coverage.feature:195  Application disposal lines 746-747
  tests/features/application_missing_lines_coverage.feature:200  Network visualization with complex configuration lines 760-795
  tests/features/application_missing_lines_coverage.feature:205  Network visualization with unsupported format lines 798

72 features passed, 2 failed, 1 error, 0 skipped
1171 scenarios passed, 2 failed, 52 error, 0 skipped
6449 steps passed, 2 failed, 52 error, 104 skipped
Took 0min 43.403s

Hi Aditya -- When I run `python -m behave tests/features/`, here is the summary that I now get: ``` Failing scenarios: tests/features/application_direct_coverage.feature:8 Direct prompt template processing tests/features/tool_agent.feature:84 Tool Agent shell command execution Errored scenarios: tests/features/application_direct_coverage.feature:13 Direct single-shot with message content tests/features/application_direct_coverage.feature:18 Direct interactive session setup tests/features/application_direct_coverage.feature:23 Direct application disposal tests/features/application_direct_coverage.feature:28 Direct visualization generation tests/features/application_direct_coverage.feature:33 Direct configuration conversion tests/features/application_direct_coverage.feature:38 Direct template registration tests/features/application_direct_coverage.feature:43 Direct agent creation paths tests/features/application_direct_coverage.feature:48 Direct route setup tests/features/application_direct_coverage.feature:53 Direct stream operations tests/features/application_direct_coverage.feature:58 Direct interactive session full workflow tests/features/application_direct_coverage.feature:63 Direct timeout and error handling tests/features/application_direct_coverage.feature:68 Direct complex template processing tests/features/application_missing_lines_coverage.feature:10 Single-shot with no config loaded error on line 220 tests/features/application_missing_lines_coverage.feature:15 Single-shot with None message content handling lines 231-236 tests/features/application_missing_lines_coverage.feature:20 Single-shot with message without content attribute lines 234-236 tests/features/application_missing_lines_coverage.feature:25 Interactive session with no config loaded error on line 288 tests/features/application_missing_lines_coverage.feature:30 Interactive help command lines 320-321 tests/features/application_missing_lines_coverage.feature:35 Interactive stream command processing lines 322-324 tests/features/application_missing_lines_coverage.feature:40 Interactive graph command processing lines 325-327 tests/features/application_missing_lines_coverage.feature:45 Interactive empty input handling lines 328-329 tests/features/application_missing_lines_coverage.feature:50 Interactive keyboard interrupt handling lines 340-342 tests/features/application_missing_lines_coverage.feature:55 Interactive EOF handling lines 343-344 tests/features/application_missing_lines_coverage.feature:60 Config to dict conversion with no config line 358-359 tests/features/application_missing_lines_coverage.feature:65 Template registration with no config early return line 376-377 tests/features/application_missing_lines_coverage.feature:70 Template registration with non-dict templates lines 384-385 tests/features/application_missing_lines_coverage.feature:75 Enhanced registry with raw template processing lines 414-441 tests/features/application_missing_lines_coverage.feature:80 Agent creation error with no factory line 455-456 tests/features/application_missing_lines_coverage.feature:85 Agent creation with enhanced registry template instances line 474-491 tests/features/application_missing_lines_coverage.feature:90 Agent creation with regular registry fallback lines 490-491 tests/features/application_missing_lines_coverage.feature:95 Agent creation with regular registry instantiation lines 495-498 tests/features/application_missing_lines_coverage.feature:100 Agent creation with no instantiate capability lines 501-502 tests/features/application_missing_lines_coverage.feature:105 Route setup with no routes early return lines 518-519 tests/features/application_missing_lines_coverage.feature:110 Route setup bridge initialization lines 522-527 tests/features/application_missing_lines_coverage.feature:115 Route template with no registry fallback lines 538-544 tests/features/application_missing_lines_coverage.feature:120 Stream route template field updates lines 548-561 tests/features/application_missing_lines_coverage.feature:125 Graph route template field updates lines 562-567 tests/features/application_missing_lines_coverage.feature:130 State class resolution failure lines 582-591 tests/features/application_missing_lines_coverage.feature:135 Bridge route registration lines 607-609 tests/features/application_missing_lines_coverage.feature:140 Stream operations with no config early return line 615-616 tests/features/application_missing_lines_coverage.feature:145 Merge operations processing lines 619-624 tests/features/application_missing_lines_coverage.feature:150 Split operations processing lines 627-632 tests/features/application_missing_lines_coverage.feature:155 Subscription re-setup after operations lines 644-648 tests/features/application_missing_lines_coverage.feature:160 Pipeline setup with no config early return line 654-655 tests/features/application_missing_lines_coverage.feature:165 Pipeline configuration conversion lines 658-667 tests/features/application_missing_lines_coverage.feature:170 Stream command with invalid stream lines 682-683 tests/features/application_missing_lines_coverage.feature:175 Graph command with invalid graph lines 702-703 tests/features/application_missing_lines_coverage.feature:180 Graph command usage error line 720 tests/features/application_missing_lines_coverage.feature:185 Graph execution with no messages lines 736-739 tests/features/application_missing_lines_coverage.feature:190 Graph execution error handling lines 741-742 tests/features/application_missing_lines_coverage.feature:195 Application disposal lines 746-747 tests/features/application_missing_lines_coverage.feature:200 Network visualization with complex configuration lines 760-795 tests/features/application_missing_lines_coverage.feature:205 Network visualization with unsupported format lines 798 72 features passed, 2 failed, 1 error, 0 skipped 1171 scenarios passed, 2 failed, 52 error, 0 skipped 6449 steps passed, 2 failed, 52 error, 104 skipped Took 0min 43.403s ```

brent.edwards commented

2025-10-10 20:04:45 +00:00

Hi Aditya --

When I run python -m behave tests/features now, this is what I get:

Failing scenarios:
  tests/features/application_direct_coverage.feature:8  Direct prompt template processing
  tests/features/tool_agent.feature:84  Tool Agent shell command execution
  tests/features/tool_agent.feature:608  Tool agent file write append mode

Errored scenarios:
  tests/features/application_direct_coverage.feature:13  Direct single-shot with message content
  tests/features/application_direct_coverage.feature:18  Direct interactive session setup
  tests/features/application_direct_coverage.feature:23  Direct application disposal
  tests/features/application_direct_coverage.feature:28  Direct visualization generation
  tests/features/application_direct_coverage.feature:33  Direct configuration conversion
  tests/features/application_direct_coverage.feature:38  Direct template registration
  tests/features/application_direct_coverage.feature:43  Direct agent creation paths
  tests/features/application_direct_coverage.feature:48  Direct route setup
  tests/features/application_direct_coverage.feature:53  Direct stream operations
  tests/features/application_direct_coverage.feature:58  Direct interactive session full workflow
  tests/features/application_direct_coverage.feature:63  Direct timeout and error handling
  tests/features/application_direct_coverage.feature:68  Direct complex template processing
  tests/features/application_missing_lines_coverage.feature:10  Single-shot with no config loaded error on line 220
  tests/features/application_missing_lines_coverage.feature:15  Single-shot with None message content handling lines 231-236
  tests/features/application_missing_lines_coverage.feature:20  Single-shot with message without content attribute lines 234-236
  tests/features/application_missing_lines_coverage.feature:25  Interactive session with no config loaded error on line 288
  tests/features/application_missing_lines_coverage.feature:30  Interactive help command lines 320-321
  tests/features/application_missing_lines_coverage.feature:35  Interactive stream command processing lines 322-324
  tests/features/application_missing_lines_coverage.feature:40  Interactive graph command processing lines 325-327
  tests/features/application_missing_lines_coverage.feature:45  Interactive empty input handling lines 328-329
  tests/features/application_missing_lines_coverage.feature:50  Interactive keyboard interrupt handling lines 340-342
  tests/features/application_missing_lines_coverage.feature:55  Interactive EOF handling lines 343-344
  tests/features/application_missing_lines_coverage.feature:60  Config to dict conversion with no config line 358-359
  tests/features/application_missing_lines_coverage.feature:65  Template registration with no config early return line 376-377
  tests/features/application_missing_lines_coverage.feature:70  Template registration with non-dict templates lines 384-385
  tests/features/application_missing_lines_coverage.feature:75  Enhanced registry with raw template processing lines 414-441
  tests/features/application_missing_lines_coverage.feature:80  Agent creation error with no factory line 455-456
  tests/features/application_missing_lines_coverage.feature:85  Agent creation with enhanced registry template instances line 474-491
  tests/features/application_missing_lines_coverage.feature:90  Agent creation with regular registry fallback lines 490-491
  tests/features/application_missing_lines_coverage.feature:95  Agent creation with regular registry instantiation lines 495-498
  tests/features/application_missing_lines_coverage.feature:100  Agent creation with no instantiate capability lines 501-502
  tests/features/application_missing_lines_coverage.feature:105  Route setup with no routes early return lines 518-519
  tests/features/application_missing_lines_coverage.feature:110  Route setup bridge initialization lines 522-527
  tests/features/application_missing_lines_coverage.feature:115  Route template with no registry fallback lines 538-544
  tests/features/application_missing_lines_coverage.feature:120  Stream route template field updates lines 548-561
  tests/features/application_missing_lines_coverage.feature:125  Graph route template field updates lines 562-567
  tests/features/application_missing_lines_coverage.feature:130  State class resolution failure lines 582-591
  tests/features/application_missing_lines_coverage.feature:135  Bridge route registration lines 607-609
  tests/features/application_missing_lines_coverage.feature:140  Stream operations with no config early return line 615-616
  tests/features/application_missing_lines_coverage.feature:145  Merge operations processing lines 619-624
  tests/features/application_missing_lines_coverage.feature:150  Split operations processing lines 627-632
  tests/features/application_missing_lines_coverage.feature:155  Subscription re-setup after operations lines 644-648
  tests/features/application_missing_lines_coverage.feature:160  Pipeline setup with no config early return line 654-655
  tests/features/application_missing_lines_coverage.feature:165  Pipeline configuration conversion lines 658-667
  tests/features/application_missing_lines_coverage.feature:170  Stream command with invalid stream lines 682-683
  tests/features/application_missing_lines_coverage.feature:175  Graph command with invalid graph lines 702-703
  tests/features/application_missing_lines_coverage.feature:180  Graph command usage error line 720
  tests/features/application_missing_lines_coverage.feature:185  Graph execution with no messages lines 736-739
  tests/features/application_missing_lines_coverage.feature:190  Graph execution error handling lines 741-742
  tests/features/application_missing_lines_coverage.feature:195  Application disposal lines 746-747
  tests/features/application_missing_lines_coverage.feature:200  Network visualization with complex configuration lines 760-795
  tests/features/application_missing_lines_coverage.feature:205  Network visualization with unsupported format lines 798

72 features passed, 2 failed, 1 error, 0 skipped
1170 scenarios passed, 3 failed, 52 error, 0 skipped
6448 steps passed, 3 failed, 52 error, 104 skipped
Took 0min 42.813s

Hi Aditya -- When I run `python -m behave tests/features` now, this is what I get: ``` Failing scenarios: tests/features/application_direct_coverage.feature:8 Direct prompt template processing tests/features/tool_agent.feature:84 Tool Agent shell command execution tests/features/tool_agent.feature:608 Tool agent file write append mode Errored scenarios: tests/features/application_direct_coverage.feature:13 Direct single-shot with message content tests/features/application_direct_coverage.feature:18 Direct interactive session setup tests/features/application_direct_coverage.feature:23 Direct application disposal tests/features/application_direct_coverage.feature:28 Direct visualization generation tests/features/application_direct_coverage.feature:33 Direct configuration conversion tests/features/application_direct_coverage.feature:38 Direct template registration tests/features/application_direct_coverage.feature:43 Direct agent creation paths tests/features/application_direct_coverage.feature:48 Direct route setup tests/features/application_direct_coverage.feature:53 Direct stream operations tests/features/application_direct_coverage.feature:58 Direct interactive session full workflow tests/features/application_direct_coverage.feature:63 Direct timeout and error handling tests/features/application_direct_coverage.feature:68 Direct complex template processing tests/features/application_missing_lines_coverage.feature:10 Single-shot with no config loaded error on line 220 tests/features/application_missing_lines_coverage.feature:15 Single-shot with None message content handling lines 231-236 tests/features/application_missing_lines_coverage.feature:20 Single-shot with message without content attribute lines 234-236 tests/features/application_missing_lines_coverage.feature:25 Interactive session with no config loaded error on line 288 tests/features/application_missing_lines_coverage.feature:30 Interactive help command lines 320-321 tests/features/application_missing_lines_coverage.feature:35 Interactive stream command processing lines 322-324 tests/features/application_missing_lines_coverage.feature:40 Interactive graph command processing lines 325-327 tests/features/application_missing_lines_coverage.feature:45 Interactive empty input handling lines 328-329 tests/features/application_missing_lines_coverage.feature:50 Interactive keyboard interrupt handling lines 340-342 tests/features/application_missing_lines_coverage.feature:55 Interactive EOF handling lines 343-344 tests/features/application_missing_lines_coverage.feature:60 Config to dict conversion with no config line 358-359 tests/features/application_missing_lines_coverage.feature:65 Template registration with no config early return line 376-377 tests/features/application_missing_lines_coverage.feature:70 Template registration with non-dict templates lines 384-385 tests/features/application_missing_lines_coverage.feature:75 Enhanced registry with raw template processing lines 414-441 tests/features/application_missing_lines_coverage.feature:80 Agent creation error with no factory line 455-456 tests/features/application_missing_lines_coverage.feature:85 Agent creation with enhanced registry template instances line 474-491 tests/features/application_missing_lines_coverage.feature:90 Agent creation with regular registry fallback lines 490-491 tests/features/application_missing_lines_coverage.feature:95 Agent creation with regular registry instantiation lines 495-498 tests/features/application_missing_lines_coverage.feature:100 Agent creation with no instantiate capability lines 501-502 tests/features/application_missing_lines_coverage.feature:105 Route setup with no routes early return lines 518-519 tests/features/application_missing_lines_coverage.feature:110 Route setup bridge initialization lines 522-527 tests/features/application_missing_lines_coverage.feature:115 Route template with no registry fallback lines 538-544 tests/features/application_missing_lines_coverage.feature:120 Stream route template field updates lines 548-561 tests/features/application_missing_lines_coverage.feature:125 Graph route template field updates lines 562-567 tests/features/application_missing_lines_coverage.feature:130 State class resolution failure lines 582-591 tests/features/application_missing_lines_coverage.feature:135 Bridge route registration lines 607-609 tests/features/application_missing_lines_coverage.feature:140 Stream operations with no config early return line 615-616 tests/features/application_missing_lines_coverage.feature:145 Merge operations processing lines 619-624 tests/features/application_missing_lines_coverage.feature:150 Split operations processing lines 627-632 tests/features/application_missing_lines_coverage.feature:155 Subscription re-setup after operations lines 644-648 tests/features/application_missing_lines_coverage.feature:160 Pipeline setup with no config early return line 654-655 tests/features/application_missing_lines_coverage.feature:165 Pipeline configuration conversion lines 658-667 tests/features/application_missing_lines_coverage.feature:170 Stream command with invalid stream lines 682-683 tests/features/application_missing_lines_coverage.feature:175 Graph command with invalid graph lines 702-703 tests/features/application_missing_lines_coverage.feature:180 Graph command usage error line 720 tests/features/application_missing_lines_coverage.feature:185 Graph execution with no messages lines 736-739 tests/features/application_missing_lines_coverage.feature:190 Graph execution error handling lines 741-742 tests/features/application_missing_lines_coverage.feature:195 Application disposal lines 746-747 tests/features/application_missing_lines_coverage.feature:200 Network visualization with complex configuration lines 760-795 tests/features/application_missing_lines_coverage.feature:205 Network visualization with unsupported format lines 798 72 features passed, 2 failed, 1 error, 0 skipped 1170 scenarios passed, 3 failed, 52 error, 0 skipped 6448 steps passed, 3 failed, 52 error, 104 skipped Took 0min 42.813s ```

aditya commented

2025-10-17 18:27:33 +00:00

@brent.edwards

all behave tests and unit tests are passing now.
Also
scripts/test_single_turn_writer.sh
test_multi_agent_interactive.sh
files have been removed.

@brent.edwards all behave tests and unit tests are passing now. Also scripts/test_single_turn_writer.sh test_multi_agent_interactive.sh files have been removed.

brent.edwards commented

2025-10-20 19:57:32 +00:00

When I ran it locally, I got the following failure:

Failing scenarios:
  tests/features/tool_agent.feature:84  Tool Agent shell command execution

74 features passed, 1 failed, 0 skipped
1224 scenarios passed, 1 failed, 0 skipped
6606 steps passed, 1 failed, 0 skipped
Took 0min 49.820s

When you're back, we can work together to figure out the difference between our systems.

Thanks for fixing most of the problems!

When I ran it locally, I got the following failure: ``` Failing scenarios: tests/features/tool_agent.feature:84 Tool Agent shell command execution 74 features passed, 1 failed, 0 skipped 1224 scenarios passed, 1 failed, 0 skipped 6606 steps passed, 1 failed, 0 skipped Took 0min 49.820s ``` When you're back, we can work together to figure out the difference between our systems. Thanks for fixing most of the problems!

brent.edwards marked this conversation as resolved

README.md Outdated

					
				@ -0,0 +276,4 @@

				python -m behave tests/features

				# Run with coverage

				python -m pytest --cov=cleveragents

brent.edwards commented

2025-10-03 22:58:22 +00:00

pytest is not installed.
After I install pytest, I get the following problems:

 python -m pytest --cov=cleveragents
ERROR: usage: __main__.py [options] [file_or_dir] [file_or_dir] [...]
__main__.py: error: unrecognized arguments: --cov=cleveragents
  inifile: /home/brent.edwards/Workspace/cleveragents-core/setup.cfg
  rootdir: /home/brent.edwards/Workspace/cleveragents-core

The tests appear to mostly be based on behave not pytest.

I guess that you could use coverage.py but I had problems running it.

1. `pytest` is not installed. 2. After I install `pytest`, I get the following problems: ``` python -m pytest --cov=cleveragents ERROR: usage: __main__.py [options] [file_or_dir] [file_or_dir] [...] __main__.py: error: unrecognized arguments: --cov=cleveragents inifile: /home/brent.edwards/Workspace/cleveragents-core/setup.cfg rootdir: /home/brent.edwards/Workspace/cleveragents-core ``` 3. The tests appear to mostly be based on `behave` not `pytest`. I guess that you could use `coverage.py` but I had problems running it.

brent.edwards commented

2025-10-08 18:36:53 +00:00

This command now runs tests/unit/core/test_json_sanitization.py and tests/unit/core/test_tool_command_processing.py. But that's not all of the tests.

This command now runs `tests/unit/core/test_json_sanitization.py` and `tests/unit/core/test_tool_command_processing.py`. But that's not all of the tests.

brent.edwards marked this conversation as resolved

README.md Outdated

					
				@ -0,0 +279,4 @@

				python -m pytest --cov=cleveragents

				# Run with tox for multiple environments

				tox

brent.edwards commented

2025-10-03 21:23:41 +00:00

Running tox gives the following problem in the middle:

lists of files in version control and sdist do not match!
missing from VCS:
  examples/documents/BusinessServiceAgreement.docx
  examples/documents/BusinessServiceAgreement.txt
missing from sdist:
  README.md
check: exit 1 (1.61 seconds) /home/brent.edwards/Workspace/cleveragents-core> check-manifest /home/brent.edwards/Workspace/cleveragents-core pid=469604

and the tests run endlessly but it occasionally pauses with the message

Failing scenarios:
  tests/features/application_missing_lines_coverage.feature:154  Subscription re-setup after operations lines 644-648
  tests/features/tool_agent.feature:84  Tool Agent shell command execution

73 features passed, 2 failed, 0 skipped
1217 scenarios passed, 2 failed, 0 skipped
6559 steps passed, 2 failed, 0 skipped
Took 0min 49.383s
py311-nocov: exit 1 (122.66 seconds) /home/brent.edwards/Workspace/cleveragents-core> behave tests/features pid=485307
py311-nocov: FAIL ✖ in 2 minutes 13.48 seconds

regularly.

Running `tox` gives the following problem in the middle: ``` lists of files in version control and sdist do not match! missing from VCS: examples/documents/BusinessServiceAgreement.docx examples/documents/BusinessServiceAgreement.txt missing from sdist: README.md check: exit 1 (1.61 seconds) /home/brent.edwards/Workspace/cleveragents-core> check-manifest /home/brent.edwards/Workspace/cleveragents-core pid=469604 ``` and the tests run endlessly but it occasionally pauses with the message ``` Failing scenarios: tests/features/application_missing_lines_coverage.feature:154 Subscription re-setup after operations lines 644-648 tests/features/tool_agent.feature:84 Tool Agent shell command execution 73 features passed, 2 failed, 0 skipped 1217 scenarios passed, 2 failed, 0 skipped 6559 steps passed, 2 failed, 0 skipped Took 0min 49.383s py311-nocov: exit 1 (122.66 seconds) /home/brent.edwards/Workspace/cleveragents-core> behave tests/features pid=485307 py311-nocov: FAIL ✖ in 2 minutes 13.48 seconds ``` regularly.

brent.edwards commented

2025-10-08 18:40:49 +00:00

When I run tox now, I get the following in the middle:

Failing scenarios:
  tests/features/tool_agent.feature:84  Tool Agent shell command execution

and it keeps rerunning.

When I run `tox` now, I get the following in the middle: ``` Failing scenarios: tests/features/tool_agent.feature:84 Tool Agent shell command execution ``` and it keeps rerunning.

brent.edwards commented

2025-10-09 21:46:09 +00:00

I just tried tox -e py312-nocov and here are the results:

Failing scenarios:
  tests/features/application_direct_coverage.feature:8  Direct prompt template processing
  tests/features/tool_agent.feature:84  Tool Agent shell command execution

Errored scenarios:
  tests/features/application_direct_coverage.feature:13  Direct single-shot with message content
  tests/features/application_direct_coverage.feature:18  Direct interactive session setup
  tests/features/application_direct_coverage.feature:23  Direct application disposal
  tests/features/application_direct_coverage.feature:28  Direct visualization generation
  tests/features/application_direct_coverage.feature:33  Direct configuration conversion
  tests/features/application_direct_coverage.feature:38  Direct template registration
  tests/features/application_direct_coverage.feature:43  Direct agent creation paths
  tests/features/application_direct_coverage.feature:48  Direct route setup
  tests/features/application_direct_coverage.feature:53  Direct stream operations
  tests/features/application_direct_coverage.feature:58  Direct interactive session full workflow
  tests/features/application_direct_coverage.feature:63  Direct timeout and error handling
  tests/features/application_direct_coverage.feature:68  Direct complex template processing
  tests/features/application_missing_lines_coverage.feature:10  Single-shot with no config loaded error on line 220
  tests/features/application_missing_lines_coverage.feature:15  Single-shot with None message content handling lines 231-236
  tests/features/application_missing_lines_coverage.feature:20  Single-shot with message without content attribute lines 234-236
  tests/features/application_missing_lines_coverage.feature:25  Interactive session with no config loaded error on line 288
  tests/features/application_missing_lines_coverage.feature:30  Interactive help command lines 320-321
  tests/features/application_missing_lines_coverage.feature:35  Interactive stream command processing lines 322-324
  tests/features/application_missing_lines_coverage.feature:40  Interactive graph command processing lines 325-327
  tests/features/application_missing_lines_coverage.feature:45  Interactive empty input handling lines 328-329
  tests/features/application_missing_lines_coverage.feature:50  Interactive keyboard interrupt handling lines 340-342
  tests/features/application_missing_lines_coverage.feature:55  Interactive EOF handling lines 343-344
  tests/features/application_missing_lines_coverage.feature:60  Config to dict conversion with no config line 358-359
  tests/features/application_missing_lines_coverage.feature:65  Template registration with no config early return line 376-377
  tests/features/application_missing_lines_coverage.feature:70  Template registration with non-dict templates lines 384-385
  tests/features/application_missing_lines_coverage.feature:75  Enhanced registry with raw template processing lines 414-441
  tests/features/application_missing_lines_coverage.feature:80  Agent creation error with no factory line 455-456
  tests/features/application_missing_lines_coverage.feature:85  Agent creation with enhanced registry template instances line 474-491
  tests/features/application_missing_lines_coverage.feature:90  Agent creation with regular registry fallback lines 490-491
  tests/features/application_missing_lines_coverage.feature:95  Agent creation with regular registry instantiation lines 495-498
  tests/features/application_missing_lines_coverage.feature:100  Agent creation with no instantiate capability lines 501-502
  tests/features/application_missing_lines_coverage.feature:105  Route setup with no routes early return lines 518-519
  tests/features/application_missing_lines_coverage.feature:110  Route setup bridge initialization lines 522-527
  tests/features/application_missing_lines_coverage.feature:115  Route template with no registry fallback lines 538-544
  tests/features/application_missing_lines_coverage.feature:120  Stream route template field updates lines 548-561
  tests/features/application_missing_lines_coverage.feature:125  Graph route template field updates lines 562-567
  tests/features/application_missing_lines_coverage.feature:130  State class resolution failure lines 582-591
  tests/features/application_missing_lines_coverage.feature:135  Bridge route registration lines 607-609
  tests/features/application_missing_lines_coverage.feature:140  Stream operations with no config early return line 615-616
  tests/features/application_missing_lines_coverage.feature:145  Merge operations processing lines 619-624
  tests/features/application_missing_lines_coverage.feature:150  Split operations processing lines 627-632
  tests/features/application_missing_lines_coverage.feature:155  Subscription re-setup after operations lines 644-648
  tests/features/application_missing_lines_coverage.feature:160  Pipeline setup with no config early return line 654-655
  tests/features/application_missing_lines_coverage.feature:165  Pipeline configuration conversion lines 658-667
  tests/features/application_missing_lines_coverage.feature:170  Stream command with invalid stream lines 682-683
  tests/features/application_missing_lines_coverage.feature:175  Graph command with invalid graph lines 702-703
  tests/features/application_missing_lines_coverage.feature:180  Graph command usage error line 720
  tests/features/application_missing_lines_coverage.feature:185  Graph execution with no messages lines 736-739
  tests/features/application_missing_lines_coverage.feature:190  Graph execution error handling lines 741-742
  tests/features/application_missing_lines_coverage.feature:195  Application disposal lines 746-747
  tests/features/application_missing_lines_coverage.feature:200  Network visualization with complex configuration lines 760-795
  tests/features/application_missing_lines_coverage.feature:205  Network visualization with unsupported format lines 798

72 features passed, 2 failed, 1 error, 0 skipped
1171 scenarios passed, 2 failed, 52 error, 0 skipped
6449 steps passed, 2 failed, 52 error, 104 skipped
Took 0min 42.873s
py312-nocov: exit 1 (116.51 seconds) /home/brent.edwards/Workspace/cleveragents-core> behave tests/features pid=46119
  py312-nocov: FAIL code 1 (168.51=setup[52.00]+cmd[116.51] seconds)
  evaluation failed :( (168.66 seconds)

I just tried `tox -e py312-nocov` and here are the results: ``` Failing scenarios: tests/features/application_direct_coverage.feature:8 Direct prompt template processing tests/features/tool_agent.feature:84 Tool Agent shell command execution Errored scenarios: tests/features/application_direct_coverage.feature:13 Direct single-shot with message content tests/features/application_direct_coverage.feature:18 Direct interactive session setup tests/features/application_direct_coverage.feature:23 Direct application disposal tests/features/application_direct_coverage.feature:28 Direct visualization generation tests/features/application_direct_coverage.feature:33 Direct configuration conversion tests/features/application_direct_coverage.feature:38 Direct template registration tests/features/application_direct_coverage.feature:43 Direct agent creation paths tests/features/application_direct_coverage.feature:48 Direct route setup tests/features/application_direct_coverage.feature:53 Direct stream operations tests/features/application_direct_coverage.feature:58 Direct interactive session full workflow tests/features/application_direct_coverage.feature:63 Direct timeout and error handling tests/features/application_direct_coverage.feature:68 Direct complex template processing tests/features/application_missing_lines_coverage.feature:10 Single-shot with no config loaded error on line 220 tests/features/application_missing_lines_coverage.feature:15 Single-shot with None message content handling lines 231-236 tests/features/application_missing_lines_coverage.feature:20 Single-shot with message without content attribute lines 234-236 tests/features/application_missing_lines_coverage.feature:25 Interactive session with no config loaded error on line 288 tests/features/application_missing_lines_coverage.feature:30 Interactive help command lines 320-321 tests/features/application_missing_lines_coverage.feature:35 Interactive stream command processing lines 322-324 tests/features/application_missing_lines_coverage.feature:40 Interactive graph command processing lines 325-327 tests/features/application_missing_lines_coverage.feature:45 Interactive empty input handling lines 328-329 tests/features/application_missing_lines_coverage.feature:50 Interactive keyboard interrupt handling lines 340-342 tests/features/application_missing_lines_coverage.feature:55 Interactive EOF handling lines 343-344 tests/features/application_missing_lines_coverage.feature:60 Config to dict conversion with no config line 358-359 tests/features/application_missing_lines_coverage.feature:65 Template registration with no config early return line 376-377 tests/features/application_missing_lines_coverage.feature:70 Template registration with non-dict templates lines 384-385 tests/features/application_missing_lines_coverage.feature:75 Enhanced registry with raw template processing lines 414-441 tests/features/application_missing_lines_coverage.feature:80 Agent creation error with no factory line 455-456 tests/features/application_missing_lines_coverage.feature:85 Agent creation with enhanced registry template instances line 474-491 tests/features/application_missing_lines_coverage.feature:90 Agent creation with regular registry fallback lines 490-491 tests/features/application_missing_lines_coverage.feature:95 Agent creation with regular registry instantiation lines 495-498 tests/features/application_missing_lines_coverage.feature:100 Agent creation with no instantiate capability lines 501-502 tests/features/application_missing_lines_coverage.feature:105 Route setup with no routes early return lines 518-519 tests/features/application_missing_lines_coverage.feature:110 Route setup bridge initialization lines 522-527 tests/features/application_missing_lines_coverage.feature:115 Route template with no registry fallback lines 538-544 tests/features/application_missing_lines_coverage.feature:120 Stream route template field updates lines 548-561 tests/features/application_missing_lines_coverage.feature:125 Graph route template field updates lines 562-567 tests/features/application_missing_lines_coverage.feature:130 State class resolution failure lines 582-591 tests/features/application_missing_lines_coverage.feature:135 Bridge route registration lines 607-609 tests/features/application_missing_lines_coverage.feature:140 Stream operations with no config early return line 615-616 tests/features/application_missing_lines_coverage.feature:145 Merge operations processing lines 619-624 tests/features/application_missing_lines_coverage.feature:150 Split operations processing lines 627-632 tests/features/application_missing_lines_coverage.feature:155 Subscription re-setup after operations lines 644-648 tests/features/application_missing_lines_coverage.feature:160 Pipeline setup with no config early return line 654-655 tests/features/application_missing_lines_coverage.feature:165 Pipeline configuration conversion lines 658-667 tests/features/application_missing_lines_coverage.feature:170 Stream command with invalid stream lines 682-683 tests/features/application_missing_lines_coverage.feature:175 Graph command with invalid graph lines 702-703 tests/features/application_missing_lines_coverage.feature:180 Graph command usage error line 720 tests/features/application_missing_lines_coverage.feature:185 Graph execution with no messages lines 736-739 tests/features/application_missing_lines_coverage.feature:190 Graph execution error handling lines 741-742 tests/features/application_missing_lines_coverage.feature:195 Application disposal lines 746-747 tests/features/application_missing_lines_coverage.feature:200 Network visualization with complex configuration lines 760-795 tests/features/application_missing_lines_coverage.feature:205 Network visualization with unsupported format lines 798 72 features passed, 2 failed, 1 error, 0 skipped 1171 scenarios passed, 2 failed, 52 error, 0 skipped 6449 steps passed, 2 failed, 52 error, 104 skipped Took 0min 42.873s py312-nocov: exit 1 (116.51 seconds) /home/brent.edwards/Workspace/cleveragents-core> behave tests/features pid=46119 py312-nocov: FAIL code 1 (168.51=setup[52.00]+cmd[116.51] seconds) evaluation failed :( (168.66 seconds) ```

brent.edwards commented

2025-10-10 20:13:23 +00:00

When I try tox -e py312-nocov, I get the following:

Failing scenarios:
  tests/features/application_direct_coverage.feature:8  Direct prompt template processing
  tests/features/tool_agent.feature:84  Tool Agent shell command execution
  tests/features/tool_agent.feature:608  Tool agent file write append mode

Errored scenarios:
  tests/features/application_direct_coverage.feature:13  Direct single-shot with message content
  tests/features/application_direct_coverage.feature:18  Direct interactive session setup
  tests/features/application_direct_coverage.feature:23  Direct application disposal
  tests/features/application_direct_coverage.feature:28  Direct visualization generation
  tests/features/application_direct_coverage.feature:33  Direct configuration conversion
  tests/features/application_direct_coverage.feature:38  Direct template registration
  tests/features/application_direct_coverage.feature:43  Direct agent creation paths
  tests/features/application_direct_coverage.feature:48  Direct route setup
  tests/features/application_direct_coverage.feature:53  Direct stream operations
  tests/features/application_direct_coverage.feature:58  Direct interactive session full workflow
  tests/features/application_direct_coverage.feature:63  Direct timeout and error handling
  tests/features/application_direct_coverage.feature:68  Direct complex template processing
  tests/features/application_missing_lines_coverage.feature:10  Single-shot with no config loaded error on line 220
  tests/features/application_missing_lines_coverage.feature:15  Single-shot with None message content handling lines 231-236
  tests/features/application_missing_lines_coverage.feature:20  Single-shot with message without content attribute lines 234-236
  tests/features/application_missing_lines_coverage.feature:25  Interactive session with no config loaded error on line 288
  tests/features/application_missing_lines_coverage.feature:30  Interactive help command lines 320-321
  tests/features/application_missing_lines_coverage.feature:35  Interactive stream command processing lines 322-324
  tests/features/application_missing_lines_coverage.feature:40  Interactive graph command processing lines 325-327
  tests/features/application_missing_lines_coverage.feature:45  Interactive empty input handling lines 328-329
  tests/features/application_missing_lines_coverage.feature:50  Interactive keyboard interrupt handling lines 340-342
  tests/features/application_missing_lines_coverage.feature:55  Interactive EOF handling lines 343-344
  tests/features/application_missing_lines_coverage.feature:60  Config to dict conversion with no config line 358-359
  tests/features/application_missing_lines_coverage.feature:65  Template registration with no config early return line 376-377
  tests/features/application_missing_lines_coverage.feature:70  Template registration with non-dict templates lines 384-385
  tests/features/application_missing_lines_coverage.feature:75  Enhanced registry with raw template processing lines 414-441
  tests/features/application_missing_lines_coverage.feature:80  Agent creation error with no factory line 455-456
  tests/features/application_missing_lines_coverage.feature:85  Agent creation with enhanced registry template instances line 474-491
  tests/features/application_missing_lines_coverage.feature:90  Agent creation with regular registry fallback lines 490-491
  tests/features/application_missing_lines_coverage.feature:95  Agent creation with regular registry instantiation lines 495-498
  tests/features/application_missing_lines_coverage.feature:100  Agent creation with no instantiate capability lines 501-502
  tests/features/application_missing_lines_coverage.feature:105  Route setup with no routes early return lines 518-519
  tests/features/application_missing_lines_coverage.feature:110  Route setup bridge initialization lines 522-527
  tests/features/application_missing_lines_coverage.feature:115  Route template with no registry fallback lines 538-544
  tests/features/application_missing_lines_coverage.feature:120  Stream route template field updates lines 548-561
  tests/features/application_missing_lines_coverage.feature:125  Graph route template field updates lines 562-567
  tests/features/application_missing_lines_coverage.feature:130  State class resolution failure lines 582-591
  tests/features/application_missing_lines_coverage.feature:135  Bridge route registration lines 607-609
  tests/features/application_missing_lines_coverage.feature:140  Stream operations with no config early return line 615-616
  tests/features/application_missing_lines_coverage.feature:145  Merge operations processing lines 619-624
  tests/features/application_missing_lines_coverage.feature:150  Split operations processing lines 627-632
  tests/features/application_missing_lines_coverage.feature:155  Subscription re-setup after operations lines 644-648
  tests/features/application_missing_lines_coverage.feature:160  Pipeline setup with no config early return line 654-655
  tests/features/application_missing_lines_coverage.feature:165  Pipeline configuration conversion lines 658-667
  tests/features/application_missing_lines_coverage.feature:170  Stream command with invalid stream lines 682-683
  tests/features/application_missing_lines_coverage.feature:175  Graph command with invalid graph lines 702-703
  tests/features/application_missing_lines_coverage.feature:180  Graph command usage error line 720
  tests/features/application_missing_lines_coverage.feature:185  Graph execution with no messages lines 736-739
  tests/features/application_missing_lines_coverage.feature:190  Graph execution error handling lines 741-742
  tests/features/application_missing_lines_coverage.feature:195  Application disposal lines 746-747
  tests/features/application_missing_lines_coverage.feature:200  Network visualization with complex configuration lines 760-795
  tests/features/application_missing_lines_coverage.feature:205  Network visualization with unsupported format lines 798

72 features passed, 2 failed, 1 error, 0 skipped
1170 scenarios passed, 3 failed, 52 error, 0 skipped
6448 steps passed, 3 failed, 52 error, 104 skipped
Took 0min 42.628s
py312-nocov: exit 1 (116.96 seconds) /home/brent.edwards/Workspace/cleveragents-core> behave tests/features pid=57746
  py312-nocov: FAIL code 1 (132.58=setup[15.62]+cmd[116.96] seconds)
  evaluation failed :( (132.93 seconds)

When I try ` tox -e py312-nocov`, I get the following: ``` Failing scenarios: tests/features/application_direct_coverage.feature:8 Direct prompt template processing tests/features/tool_agent.feature:84 Tool Agent shell command execution tests/features/tool_agent.feature:608 Tool agent file write append mode Errored scenarios: tests/features/application_direct_coverage.feature:13 Direct single-shot with message content tests/features/application_direct_coverage.feature:18 Direct interactive session setup tests/features/application_direct_coverage.feature:23 Direct application disposal tests/features/application_direct_coverage.feature:28 Direct visualization generation tests/features/application_direct_coverage.feature:33 Direct configuration conversion tests/features/application_direct_coverage.feature:38 Direct template registration tests/features/application_direct_coverage.feature:43 Direct agent creation paths tests/features/application_direct_coverage.feature:48 Direct route setup tests/features/application_direct_coverage.feature:53 Direct stream operations tests/features/application_direct_coverage.feature:58 Direct interactive session full workflow tests/features/application_direct_coverage.feature:63 Direct timeout and error handling tests/features/application_direct_coverage.feature:68 Direct complex template processing tests/features/application_missing_lines_coverage.feature:10 Single-shot with no config loaded error on line 220 tests/features/application_missing_lines_coverage.feature:15 Single-shot with None message content handling lines 231-236 tests/features/application_missing_lines_coverage.feature:20 Single-shot with message without content attribute lines 234-236 tests/features/application_missing_lines_coverage.feature:25 Interactive session with no config loaded error on line 288 tests/features/application_missing_lines_coverage.feature:30 Interactive help command lines 320-321 tests/features/application_missing_lines_coverage.feature:35 Interactive stream command processing lines 322-324 tests/features/application_missing_lines_coverage.feature:40 Interactive graph command processing lines 325-327 tests/features/application_missing_lines_coverage.feature:45 Interactive empty input handling lines 328-329 tests/features/application_missing_lines_coverage.feature:50 Interactive keyboard interrupt handling lines 340-342 tests/features/application_missing_lines_coverage.feature:55 Interactive EOF handling lines 343-344 tests/features/application_missing_lines_coverage.feature:60 Config to dict conversion with no config line 358-359 tests/features/application_missing_lines_coverage.feature:65 Template registration with no config early return line 376-377 tests/features/application_missing_lines_coverage.feature:70 Template registration with non-dict templates lines 384-385 tests/features/application_missing_lines_coverage.feature:75 Enhanced registry with raw template processing lines 414-441 tests/features/application_missing_lines_coverage.feature:80 Agent creation error with no factory line 455-456 tests/features/application_missing_lines_coverage.feature:85 Agent creation with enhanced registry template instances line 474-491 tests/features/application_missing_lines_coverage.feature:90 Agent creation with regular registry fallback lines 490-491 tests/features/application_missing_lines_coverage.feature:95 Agent creation with regular registry instantiation lines 495-498 tests/features/application_missing_lines_coverage.feature:100 Agent creation with no instantiate capability lines 501-502 tests/features/application_missing_lines_coverage.feature:105 Route setup with no routes early return lines 518-519 tests/features/application_missing_lines_coverage.feature:110 Route setup bridge initialization lines 522-527 tests/features/application_missing_lines_coverage.feature:115 Route template with no registry fallback lines 538-544 tests/features/application_missing_lines_coverage.feature:120 Stream route template field updates lines 548-561 tests/features/application_missing_lines_coverage.feature:125 Graph route template field updates lines 562-567 tests/features/application_missing_lines_coverage.feature:130 State class resolution failure lines 582-591 tests/features/application_missing_lines_coverage.feature:135 Bridge route registration lines 607-609 tests/features/application_missing_lines_coverage.feature:140 Stream operations with no config early return line 615-616 tests/features/application_missing_lines_coverage.feature:145 Merge operations processing lines 619-624 tests/features/application_missing_lines_coverage.feature:150 Split operations processing lines 627-632 tests/features/application_missing_lines_coverage.feature:155 Subscription re-setup after operations lines 644-648 tests/features/application_missing_lines_coverage.feature:160 Pipeline setup with no config early return line 654-655 tests/features/application_missing_lines_coverage.feature:165 Pipeline configuration conversion lines 658-667 tests/features/application_missing_lines_coverage.feature:170 Stream command with invalid stream lines 682-683 tests/features/application_missing_lines_coverage.feature:175 Graph command with invalid graph lines 702-703 tests/features/application_missing_lines_coverage.feature:180 Graph command usage error line 720 tests/features/application_missing_lines_coverage.feature:185 Graph execution with no messages lines 736-739 tests/features/application_missing_lines_coverage.feature:190 Graph execution error handling lines 741-742 tests/features/application_missing_lines_coverage.feature:195 Application disposal lines 746-747 tests/features/application_missing_lines_coverage.feature:200 Network visualization with complex configuration lines 760-795 tests/features/application_missing_lines_coverage.feature:205 Network visualization with unsupported format lines 798 72 features passed, 2 failed, 1 error, 0 skipped 1170 scenarios passed, 3 failed, 52 error, 0 skipped 6448 steps passed, 3 failed, 52 error, 104 skipped Took 0min 42.628s py312-nocov: exit 1 (116.96 seconds) /home/brent.edwards/Workspace/cleveragents-core> behave tests/features pid=57746 py312-nocov: FAIL code 1 (132.58=setup[15.62]+cmd[116.96] seconds) evaluation failed :( (132.93 seconds) ```

aditya commented

2025-10-17 18:46:07 +00:00

All tests are passing now.

brent.edwards commented

2025-10-20 20:02:19 +00:00

When I run tox -e py312-nocov, I still get:

Failing scenarios:
  tests/features/tool_agent.feature:84  Tool Agent shell command execution

74 features passed, 1 failed, 0 skipped
1224 scenarios passed, 1 failed, 0 skipped
6606 steps passed, 1 failed, 0 skipped
Took 0min 49.164s
py312-nocov: exit 1 (122.16 seconds) /home/brent.edwards/Workspace/cleveragents-core> behave tests/features pid=2539995
  py312-nocov: FAIL code 1 (138.74=setup[16.59]+cmd[122.16] seconds)
  evaluation failed :( (138.84 seconds)

When I run ` tox -e py312-nocov`, I still get: ``` Failing scenarios: tests/features/tool_agent.feature:84 Tool Agent shell command execution 74 features passed, 1 failed, 0 skipped 1224 scenarios passed, 1 failed, 0 skipped 6606 steps passed, 1 failed, 0 skipped Took 0min 49.164s py312-nocov: exit 1 (122.16 seconds) /home/brent.edwards/Workspace/cleveragents-core> behave tests/features pid=2539995 py312-nocov: FAIL code 1 (138.74=setup[16.59]+cmd[122.16] seconds) evaluation failed :( (138.84 seconds) ```

docs/YAML_SYNTAX.md

					
				@ -0,0 +41,4 @@

				  - sources: [source_streams]

				    target: target_stream

				splits:

brent.edwards commented

2025-10-03 21:24:48 +00:00

Is splits required or optional?

Is `splits` required or optional?

aditya commented

2025-10-17 18:59:45 +00:00

splits is optional. Also mentioned the same in the document

docs/YAML_SYNTAX.md

					
				@ -0,0 +61,4 @@

				  global:

				    variable_name: value

				```

brent.edwards commented

2025-10-03 21:29:01 +00:00

There should be a section about global configuration here.

It should include links to documentation of Jinja and Mustache.

There should be a section about global configuration here. It should include links to documentation of Jinja and Mustache.

aditya commented

2025-10-17 19:00:10 +00:00

added global configuration section and links to documentation of Jinja and Mustache

docs/YAML_SYNTAX.md

					
				@ -0,0 +104,4 @@

				      safe_mode: true

				      timeout: 30

				```

brent.edwards commented

2025-10-03 21:33:32 +00:00

Can you explain what a tool agent is? The only definition so far is in README.md line 241.

You list several tools but don't explain what they are or what they do. This is the best place to explain the basic tools.

Can you explain what a tool agent is? The only definition so far is in `README.md` line 241. You list several tools but don't explain what they are or what they do. This is the best place to explain the basic tools.

aditya commented

2025-10-17 19:21:42 +00:00

Added the details for tool agent. @brent.edwards

docs/YAML_SYNTAX.md

					
				@ -0,0 +130,4 @@

				        input: sub_agent1

				        output: sub_stream1

				```

brent.edwards commented

2025-10-03 21:34:55 +00:00

Could you explain here what a composite agent is?

What does it do, why is it important?

Could you explain here what a composite agent is? What does it do, why is it important?

aditya commented

2025-10-17 19:23:33 +00:00

Added the details for composite agent. @brent.edwards

docs/YAML_SYNTAX.md Outdated

					
				@ -0,0 +145,4 @@

				    operators:

				      - type: map

				        params:

				          agent: agent_name

brent.edwards commented

2025-10-03 21:39:40 +00:00

I guess that it would be more helpful to see an example function here than an agent.

There seem to be other ways to run other agents, but I haven't yet seen a way to run simple code.

I guess that it would be more helpful to see an example function here than an agent. There seem to be other ways to run other agents, but I haven't yet seen a way to run simple code.

aditya commented

2025-10-17 19:25:05 +00:00

@brent.edwards added the same here

docs/YAML_SYNTAX.md Outdated

					
				@ -0,0 +177,4 @@

				- `replay`: Replay all previous values

				**Operators:**

				- `map`: Transform messages using agents or functions

brent.edwards commented

2025-10-03 21:41:12 +00:00

Could you link to where there's more documentation for the functions? I suspect that will be very important to programmers.

aditya commented

2025-10-17 19:26:49 +00:00

added @brent.edwards

docs/YAML_SYNTAX.md Outdated

					
				@ -0,0 +220,4 @@

				    parallel_execution: true

				    state_class: "my_module.MyState"

				```

brent.edwards commented

2025-10-03 21:44:20 +00:00

Could you point to where someone can find more details on entry_point and edges -- the central point of a graph?

Could you point to where someone can find more details on `entry_point` and `edges` -- the central point of a graph?

docs/YAML_SYNTAX.md

					
				@ -0,0 +253,4 @@

				      preserve_subscriptions: true

				      preserve_checkpointing: true

				```

brent.edwards commented

2025-10-03 21:44:57 +00:00

Could you point to where people can find more information about bridge routes?

examples/legal_contract_metadata_extractor.yaml Outdated

					
				@ -0,0 +1,190 @@

				agents:

brent.edwards commented

2025-10-03 21:48:04 +00:00

Is this the latest version of the code? For example, this code doesn't seem to write its analysis to a file.

brent.edwards marked this conversation as resolved

examples/multi_agent_paper_writer.yaml

					
				@ -0,0 +1,274 @@

				# Multi-Agent Paper Writer - Multiple Specialized Agents Working Together

brent.edwards commented

2025-10-03 22:33:03 +00:00

As part of this header, could you mention that this code needs the --unsafe flag?

As part of this header, could you mention that this code needs the `--unsafe` flag?

brent.edwards marked this conversation as resolved

src/cleveragents/core/application.py Outdated

					
				@ -294,7 +294,14 @@ class ReactiveCleverAgentsApp:

				            # Set up observers for output and errors

				            def on_output(msg: StreamMessage):

brent.edwards commented

2025-10-03 23:24:07 +00:00

Just FYI -- pylint requires a docstring for methods, and a return value.

Just FYI -- `pylint` requires a docstring for methods, and a return value.

brent.edwards marked this conversation as resolved

src/cleveragents/core/application.py Outdated

					
				@ -298,0 +300,4 @@

				                    print(f"\n[DEBUG] Empty output received\n")

				                else:

				                    # Check for tool execution commands

				                    processed_content = self._process_tool_commands(content_str)

brent.edwards commented

2025-10-03 22:48:36 +00:00

This code changes the program's behavior. Could you write a quick test that covers this change?

src/cleveragents/core/application.py Outdated

					
				@ -338,3 +345,3 @@

				                    # Give streams time to process

				                    await asyncio.sleep(0.1)

				                    await asyncio.sleep(2.0)

brent.edwards commented

2025-10-03 22:51:28 +00:00

FYI... I get nervous when code tries to guesstimate how long processing will take. It's always either never enough time or too much time.

Is it possible to use some signal between the stream and this code to notify when the stream is done processing?

FYI... I get nervous when code tries to guesstimate how long processing will take. It's always either never enough time or too much time. Is it possible to use some signal between the stream and this code to notify when the stream is done processing?

src/cleveragents/core/application.py Outdated

					
				@ -668,6 +664,151 @@ class ReactiveCleverAgentsApp:

				            self.langgraph_bridge.create_hybrid_pipeline(config_dict)

				            self.logger.debug(f"Created hybrid pipeline: {pipeline_name}")

				    def _sanitize_json_string(self, json_str: str) -> str:

brent.edwards commented

2025-10-03 23:10:05 +00:00

By the way, thank you very much for the test_json_sanitation.py file!

This method could be static.

By the way, thank you very much for the `test_json_sanitation.py` file! This method could be static.

src/cleveragents/core/application.py Outdated

					
				@ -671,0 +677,4 @@

				        Returns:

				            Sanitized JSON string with control characters properly escaped

				        """

				        import re

brent.edwards commented

2025-10-03 23:45:03 +00:00

src/cleveragents/core/application.py:680:8: C0415: Import outside toplevel (re) (import-outside-toplevel)

brent.edwards marked this conversation as resolved

src/cleveragents/core/application.py Outdated

					
				@ -671,0 +678,4 @@

				            Sanitized JSON string with control characters properly escaped

				        """

				        import re

				        import json

brent.edwards commented

2025-10-03 23:45:15 +00:00

src/cleveragents/core/application.py:681:8: C0415: Import outside toplevel (json) (import-outside-toplevel)

brent.edwards marked this conversation as resolved

src/cleveragents/core/application.py Outdated

					
				@ -671,0 +679,4 @@

				        """

				        import re

				        import json

brent.edwards commented

2025-10-03 23:43:03 +00:00

src/cleveragents/core/application.py:682:0: C0303: Trailing whitespace (trailing-whitespace)

brent.edwards marked this conversation as resolved

src/cleveragents/core/application.py Outdated

					
				@ -671,0 +686,4 @@

				            return json_str  # Already valid, no sanitization needed

				        except json.JSONDecodeError:

				            pass  # Need to sanitize

brent.edwards commented

2025-10-03 23:42:51 +00:00

src/cleveragents/core/application.py:689:0: C0303: Trailing whitespace (trailing-whitespace)

brent.edwards marked this conversation as resolved

src/cleveragents/core/application.py Outdated

					
				@ -671,0 +689,4 @@

				        # Strategy: Find all quoted string values and escape control characters within them

				        # We need to be careful to only escape content inside string values, not the JSON structure

brent.edwards commented

2025-10-03 23:42:38 +00:00

src/cleveragents/core/application.py:692:0: C0303: Trailing whitespace (trailing-whitespace)

brent.edwards marked this conversation as resolved

src/cleveragents/core/application.py Outdated

					
				@ -671,0 +690,4 @@

				        # Strategy: Find all quoted string values and escape control characters within them

				        # We need to be careful to only escape content inside string values, not the JSON structure

				        def escape_string_content(match):

brent.edwards commented

2025-10-03 23:38:30 +00:00

src/cleveragents/core/application.py:693: error: Function is missing a type annotation [no-untyped-def]

brent.edwards marked this conversation as resolved

src/cleveragents/core/application.py Outdated

					
				@ -671,0 +695,4 @@

				            # match.group(0) is the full match including quotes

				            # match.group(1) is the content inside the quotes

				            content = match.group(1)

brent.edwards commented

2025-10-03 23:42:26 +00:00

src/cleveragents/core/application.py:698:0: C0303: Trailing whitespace (trailing-whitespace)

brent.edwards marked this conversation as resolved

src/cleveragents/core/application.py Outdated

					
				@ -671,0 +705,4 @@

				            content = content.replace('\b', '\\b')

				            content = content.replace('\f', '\\f')

				            content = content.replace('"', '\\"')  # Escape quotes

brent.edwards commented

2025-10-03 23:44:26 +00:00

src/cleveragents/core/application.py:708:0: C0303: Trailing whitespace (trailing-whitespace)

(I'm not going to include more of these. Use pylint to find all of them.)

src/cleveragents/core/application.py:708:0: C0303: Trailing whitespace (trailing-whitespace) (I'm not going to include more of these. Use `pylint` to find all of them.)

brent.edwards marked this conversation as resolved

src/cleveragents/core/application.py Outdated

					
				@ -671,0 +712,4 @@

				        # Pattern to match quoted strings (both keys and values)

				        # This matches: "anything including newlines and special chars"

				        # We use a negative lookbehind to avoid matching escaped quotes

				        pattern = r'"((?:[^"\\]|\\.)*)"|"([^"]*(?:\n[^"]*)*)"'

brent.edwards commented

2025-10-03 22:53:44 +00:00

You write pattern twice; only the second one is used. You should remove this variable.

You write `pattern` twice; only the second one is used. You should remove this variable.

brent.edwards marked this conversation as resolved

src/cleveragents/core/application.py Outdated

					
				@ -671,0 +721,4 @@

				        return sanitized

				    def _process_tool_commands(self, content: str) -> str:

brent.edwards commented

2025-10-03 23:45:37 +00:00

src/cleveragents/core/application.py:724:4: R0914: Too many local variables (17/15) (too-many-locals)

src/cleveragents/core/application.py Outdated

					
				@ -671,0 +727,4 @@

				        Detects [TOOL_EXECUTE:tool_name] commands and executes actual tools.

				        """

				        import re

brent.edwards commented

2025-10-03 23:46:05 +00:00

src/cleveragents/core/application.py:730:8: C0415: Import outside toplevel (re) (import-outside-toplevel)

brent.edwards marked this conversation as resolved

src/cleveragents/core/application.py Outdated

					
				@ -671,0 +728,4 @@

				        Detects [TOOL_EXECUTE:tool_name] commands and executes actual tools.

				        """

				        import re

				        import json

brent.edwards commented

2025-10-03 23:46:20 +00:00

src/cleveragents/core/application.py:731:8: C0415: Import outside toplevel (json) (import-outside-toplevel)

brent.edwards marked this conversation as resolved

src/cleveragents/core/application.py Outdated

					
				@ -671,0 +729,4 @@

				        """

				        import re

				        import json

				        import asyncio

brent.edwards commented

2025-10-03 23:46:43 +00:00

src/cleveragents/core/application.py:732:8: W0404: Reimport 'asyncio' (imported line 8) (reimported)

src/cleveragents/core/application.py:732:8: W0621: Redefining name 'asyncio' from outer scope (line 8) (redefined-outer-name)

src/cleveragents/core/application.py:732:8: W0404: Reimport 'asyncio' (imported line 8) (reimported) src/cleveragents/core/application.py:732:8: W0621: Redefining name 'asyncio' from outer scope (line 8) (redefined-outer-name)

brent.edwards marked this conversation as resolved

src/cleveragents/core/application.py Outdated

					
				@ -671,0 +734,4 @@

				        # Pattern to match tool execution commands

				        pattern = r'\[TOOL_EXECUTE:(\w+)\]\s*(\{[^}]+\})\s*\[/TOOL_EXECUTE\]'

				        def execute_tool_sync(tool_name, tool_params):

brent.edwards commented

2025-10-03 23:31:56 +00:00

The parameters will need types:

src/cleveragents/core/application.py:737: error: Function is missing a type annotation [no-untyped-def]

The parameters will need types: src/cleveragents/core/application.py:737: error: Function is missing a type annotation [no-untyped-def]

brent.edwards marked this conversation as resolved

src/cleveragents/core/application.py Outdated

					
				@ -671,0 +755,4 @@

				                # Try to get the current running loop

				                try:

				                    loop = asyncio.get_running_loop()

brent.edwards commented

2025-10-03 23:33:16 +00:00

loop is never used.

`loop` is never used.

brent.edwards marked this conversation as resolved

src/cleveragents/core/application.py Outdated

					
				@ -671,0 +757,4 @@

				                try:

				                    loop = asyncio.get_running_loop()

				                    # We're already in an async context, create a task

				                    import concurrent.futures

brent.edwards commented

2025-10-03 23:47:19 +00:00

src/cleveragents/core/application.py:760:20: C0415: Import outside toplevel (concurrent.futures) (import-outside-toplevel)

brent.edwards marked this conversation as resolved

src/cleveragents/core/application.py Outdated

					
				@ -671,0 +767,4 @@

				                    # No running loop, we can use asyncio.run

				                    result = asyncio.run(file_manager.process_message(tool_request, context))

				                return f"\n✅ {result}"

brent.edwards commented

2025-10-03 23:14:28 +00:00

It's possible that result was never filled.

It's possible that `result` was never filled.

src/cleveragents/core/application.py Outdated

					
				@ -671,0 +769,4 @@

				                return f"\n✅ {result}"

				            except Exception as e:

brent.edwards commented

2025-10-03 23:47:05 +00:00

src/cleveragents/core/application.py:772:19: W0718: Catching too general exception Exception (broad-exception-caught)

brent.edwards marked this conversation as resolved

src/cleveragents/core/application.py Outdated

					
				@ -671,0 +770,4 @@

				                return f"\n✅ {result}"

				            except Exception as e:

				                self.logger.error(f"Tool execution failed: {e}")

brent.edwards commented

2025-10-03 23:47:37 +00:00

src/cleveragents/core/application.py:773:16: W1203: Use lazy % formatting in logging functions (logging-fstring-interpolation)

brent.edwards marked this conversation as resolved

src/cleveragents/core/application.py Outdated

					
				@ -671,0 +790,4 @@

				                # Sanitize JSON string before parsing to handle LLM-generated malformed JSON

				                sanitized_params_str = self._sanitize_json_string(tool_params_str)

				                tool_params = json.loads(sanitized_params_str)

				                tool_result = execute_tool_sync(tool_name, tool_params)

brent.edwards commented

2025-10-03 23:39:12 +00:00

src/cleveragents/core/application.py:793: error: Call to untyped function "execute_tool_sync" in typed context [no-untyped-call]

brent.edwards marked this conversation as resolved

src/cleveragents/core/application.py Outdated

					
				@ -671,0 +799,4 @@

				                    result_content[match.end():]

				                )

				            except json.JSONDecodeError as e:

				                self.logger.error(f"Failed to parse tool params: {e}")

brent.edwards commented

2025-10-03 23:48:06 +00:00

src/cleveragents/core/application.py:802:16: W1203: Use lazy % formatting in logging functions (logging-fstring-interpolation)

brent.edwards marked this conversation as resolved

src/cleveragents/core/application.py Outdated

					
				@ -671,0 +800,4 @@

				                )

				            except json.JSONDecodeError as e:

				                self.logger.error(f"Failed to parse tool params: {e}")

				                error_msg = f"\n❌ Error: Invalid tool parameters format"

brent.edwards commented

2025-10-03 23:48:21 +00:00

src/cleveragents/core/application.py:803:28: W1309: Using an f-string that does not have any interpolated variables (f-string-without-interpolation)

brent.edwards marked this conversation as resolved

aditya added 4 commits

2025-10-06 12:57:55 +00:00

chore: add legal contract metadata update 0da823b4b9

chore: update legal contract metadata extractor orchestration c62f8bbb39

refactor: Reorganize unit tests into modular structure (tests/unit/core/, agents/, reactive/, etc.), dd comprehensive test suite for tool command processing, Add sample contract and integration test script 0f0a321c75

chore: fixed all pylint and mypy warnings ed59c1cacd

aditya added 1 commit

2025-10-06 13:21:33 +00:00

test: fix the broken bdd test cases 0d9b100263

brent.edwards reviewed

2025-10-06 19:52:52 +00:00

README.md Outdated

					
				@ -0,0 +36,4 @@

				```bash

				git clone https://git.cleverthis.com/cleveragents/cleveragents-core

				cd cleveragents

brent.edwards commented

2025-10-06 19:52:52 +00:00

This line should be

cd cleveragents-core

This line should be ``` cd cleveragents-core ```

brent.edwards reviewed

2025-10-06 23:49:13 +00:00

brent.edwards left a comment

Thank you for your hard work!

src/cleveragents/agents/base.py Outdated

					
				@ -194,0 +201,4 @@

				                # No event loop in current thread, create one

				                loop = asyncio.new_event_loop()

				                asyncio.set_event_loop(loop)

				            self._memory_lock_instance = asyncio.Lock()

brent.edwards commented

2025-10-06 22:58:49 +00:00

The fact that you're creating a memory lock means to me that this code is meant to be multithreaded. If I'm wrong -- if you're getting a memory lock for another reason -- then ignore this problem.

This code appears to be a check then act race condition.

If two threads tried to get the memory lock at the same time, and if the two threads both saw that self._memory_lock_instance is None, then two asyncio.Lock() would be created.

The fact that you're creating a memory lock means to me that this code is meant to be multithreaded. If I'm wrong -- if you're getting a memory lock for another reason -- then ignore this problem. This code appears to be a [check then act](https://www.geeksforgeeks.org/java/race-conditions/) race condition. If two threads tried to get the memory lock at the same time, and if the two threads both saw that `self._memory_lock_instance is None`, then two `asyncio.Lock()` would be created.

src/cleveragents/core/application.py Outdated

					
				@ -338,3 +348,3 @@

				                    # Give streams time to process

				                    await asyncio.sleep(0.1)

				                    await asyncio.sleep(2.0)

brent.edwards commented

2025-10-06 23:45:42 +00:00

FYI -- I still get nervous whenever I see sleep with a specific length of time to "give time to process". What happens when that's not enough time?

tests/scripts/test_legal_contract_analyzer.sh Outdated

					
				@ -0,0 +1,167 @@

				#!/bin/bash

brent.edwards commented

2025-10-06 21:26:58 +00:00

First and foremost, feel free to use any (reasonable) language. This is not a mandate.

In my experience, Python is better for writing tests than Bash. It's easier to understand and expand.

First and foremost, feel free to use any (reasonable) language. This is not a mandate. In my experience, Python is better for writing tests than Bash. It's easier to understand and expand.

brent.edwards commented

2025-10-06 21:48:23 +00:00

A second item -- these tests should be included in the instructions of how to test CleverAgents.

tests/scripts/test_legal_contract_analyzer.sh Outdated

					
				@ -0,0 +30,4 @@

				# Run the legal contract analyzer

				python -m cleveragents run \

				  -c examples/legal_contract_metadata_extractor.yaml \

				  -p "sample_contract.txt" \

brent.edwards commented

2025-10-06 21:30:19 +00:00

Where is sample_contract.txt?

Where is `sample_contract.txt`?

tests/scripts/test_legal_contract_analyzer.sh Outdated

					
				@ -0,0 +53,4 @@

				if grep -q "FILE_LOADER AGENT" /tmp/contract_test_output.log; then

				    echo "✅ FILE_LOADER agent simulation found"

				else

				    echo "❌ FILE_LOADER agent simulation not found"

brent.edwards commented

2025-10-06 21:27:36 +00:00

Failing this should cause the test to fail.

tests/scripts/test_legal_contract_analyzer.sh Outdated

					
				@ -0,0 +59,4 @@

				if grep -q "TEXT_PREPROCESSOR AGENT\|CONTRACT_ANALYZER AGENT\|RISK_ASSESSOR AGENT\|JSON_FORMATTER AGENT" /tmp/contract_test_output.log; then

				    echo "✅ Analysis agent simulation found"

				else

				    echo "❌ Analysis agent simulation not found (may continue in next interaction)"

brent.edwards commented

2025-10-06 21:27:50 +00:00

Failing this should cause the test to fail.

tests/scripts/test_legal_contract_analyzer.sh Outdated

					
				@ -0,0 +70,4 @@

				if grep -q "TOOL_EXECUTE:file_read" /tmp/contract_test_output.log; then

				    echo "✅ File reading tool execution command found"

				else

				    echo "⚠️  File reading tool execution command not found"

brent.edwards commented

2025-10-06 21:28:52 +00:00

Failing this should cause the overall test to fail. (Et cetera for five more locations below.)

tests/scripts/test_legal_contract_analyzer.sh Outdated

					
				@ -0,0 +164,4 @@

				# Clean up

				rm -f /tmp/contract_test_output.log

brent.edwards commented

2025-10-06 21:24:38 +00:00

When I run this test locally, individual tests fail (though the general test passes.)

Here's part of the output:

=== TEST 2: Agent Simulation Verification ===
Checking if orchestrator simulates specialist agents...
✅ FILE_LOADER agent simulation found
❌ Analysis agent simulation not found (may continue in next interaction)

=== TEST 3: Tool Execution Verification ===
Checking if file reading tool executed properly...
⚠️  File reading tool execution command not found

=== TEST 4: Contract Loading Validation ===
Checking if contract was loaded...
❌ Contract content not found in output

=== TEST 5: Metadata Extraction Validation ===
Checking if contract metadata was extracted...
❌ Party extraction not working
❌ Party extraction not working
⚠️  Financial terms extraction may need additional processing

When I run this test locally, individual tests fail (though the general test passes.) Here's part of the output: ``` === TEST 2: Agent Simulation Verification === Checking if orchestrator simulates specialist agents... ✅ FILE_LOADER agent simulation found ❌ Analysis agent simulation not found (may continue in next interaction) === TEST 3: Tool Execution Verification === Checking if file reading tool executed properly... ⚠️ File reading tool execution command not found === TEST 4: Contract Loading Validation === Checking if contract was loaded... ❌ Contract content not found in output === TEST 5: Metadata Extraction Validation === Checking if contract metadata was extracted... ❌ Party extraction not working ❌ Party extraction not working ⚠️ Financial terms extraction may need additional processing ```

tests/scripts/test_multi_agent_interactive.sh Outdated

					
				@ -0,0 +24,4 @@

				echo "Starting test..."

				echo ""

				cat << 'EOF' | timeout 120 python -m cleveragents interactive -c examples/multi_agent_paper_writer.yaml --unsafe 2>&1 | grep -v "^\[" | head -100

brent.edwards commented

2025-10-06 21:46:10 +00:00

With the setup given, my version of timeout isn't working.

You should probably say that any output means that something is probably wrong.

With the setup given, my version of `timeout` isn't working. You should probably say that any output means that something is probably wrong.

tests/scripts/test_multi_agent_interactive.sh Outdated

					
				@ -0,0 +34,4 @@

				echo ""

				echo "=========================================="

				echo "Test Complete!"

brent.edwards commented

2025-10-06 21:32:52 +00:00

Other than timing out, can this test ever fail?

brent.edwards commented

2025-10-06 21:56:55 +00:00

Here is my output of this test. It doesn't look like a successful test to me. What do you think?

❯ sh tests/scripts/test_multi_agent_interactive.sh 
==========================================
Multi-Agent Paper Writer Test
==========================================

This will simulate a conversation with multiple agents:
  1. RESEARCHER gathers requirements
  2. WRITER creates the paper
  3. Natural handoffs between agents

Starting test...


Reactive CleverAgents Interactive Session
Type 'exit' to quit, 'help' for commands

>>> [httpx] HTTP Request: POST https://api.openai.com/v1/chat/completions "HTTP/1.1 200 OK"

=== RESEARCHER AGENT ===>
Hello! How can I assist you today? Are you looking to start a research paper? If so, could you please provide me some information about the topic you're interested in?

>>> [openai._base_client] Retrying request to /chat/completions in 0.488640 seconds

=== RESEARCHER AGENT ===>
That's an interesting topic! To help you tailor the paper, I have a few questions:

1. What is the specific aspect of quantum entanglement you want to focus on? 
2. Who is your target audience? (e.g., students, physicists, general public)
3. Are there any specific resources or references you'd like to include?
4. What is the expected length of the paper?
5. Do you have any deadline for this task? 

These details will help us create a comprehensive paper.

>>> [openai._base_client] Retrying request to /chat/completions in 0.449424 seconds

=== RESEARCHER AGENT ===>
Great! Writing for a specialized audience like quantum physics researchers will allow us to delve deep into the topic. 

Could you now answer the remaining questions:
1. What is the specific aspect of quantum entanglement you want to focus on in your paper? 
2. Are there any specific resources or references you'd like to include?
3. What is the expected length of the paper?
4. Do you have any deadline for this task? 

Having this information will help us structure the paper appropriately.

>>> [openai._base_client] Retrying request to /chat/completions in 0.415122 seconds

=== WRITER AGENT ===>
I apologize but I still need some more information to begin writing. Could you please provide:

1. The specific aspect of quantum entanglement you want to focus on.
2. Any specific resources or references to include.
3. The expected length of the paper.
4. The deadline for this task. 

These details will help me create a well-structured and effective paper.

>>> 
Goodbye!

==========================================
Test Complete!
==========================================

Here is my output of this test. It doesn't look like a successful test to me. What do you think? ``` ❯ sh tests/scripts/test_multi_agent_interactive.sh ========================================== Multi-Agent Paper Writer Test ========================================== This will simulate a conversation with multiple agents: 1. RESEARCHER gathers requirements 2. WRITER creates the paper 3. Natural handoffs between agents Starting test... Reactive CleverAgents Interactive Session Type 'exit' to quit, 'help' for commands >>> [httpx] HTTP Request: POST https://api.openai.com/v1/chat/completions "HTTP/1.1 200 OK" === RESEARCHER AGENT ===> Hello! How can I assist you today? Are you looking to start a research paper? If so, could you please provide me some information about the topic you're interested in? >>> [openai._base_client] Retrying request to /chat/completions in 0.488640 seconds === RESEARCHER AGENT ===> That's an interesting topic! To help you tailor the paper, I have a few questions: 1. What is the specific aspect of quantum entanglement you want to focus on? 2. Who is your target audience? (e.g., students, physicists, general public) 3. Are there any specific resources or references you'd like to include? 4. What is the expected length of the paper? 5. Do you have any deadline for this task? These details will help us create a comprehensive paper. >>> [openai._base_client] Retrying request to /chat/completions in 0.449424 seconds === RESEARCHER AGENT ===> Great! Writing for a specialized audience like quantum physics researchers will allow us to delve deep into the topic. Could you now answer the remaining questions: 1. What is the specific aspect of quantum entanglement you want to focus on in your paper? 2. Are there any specific resources or references you'd like to include? 3. What is the expected length of the paper? 4. Do you have any deadline for this task? Having this information will help us structure the paper appropriately. >>> [openai._base_client] Retrying request to /chat/completions in 0.415122 seconds === WRITER AGENT ===> I apologize but I still need some more information to begin writing. Could you please provide: 1. The specific aspect of quantum entanglement you want to focus on. 2. Any specific resources or references to include. 3. The expected length of the paper. 4. The deadline for this task. These details will help me create a well-structured and effective paper. >>> Goodbye! ========================================== Test Complete! ========================================== ```

tests/scripts/test_single_turn_writer.sh Outdated

					
				@ -0,0 +16,4 @@

				echo ""

				timeout 120 python -m cleveragents run \

				  -c examples/scientific_paper_writer_single_turn.yaml \

brent.edwards commented

2025-10-06 21:47:36 +00:00

This filename does not exist.

tests/unit/core/test_tool_command_processing.py

					
				@ -0,0 +20,4 @@

				    """Test suite for tool command processing with nested JSON."""

				    def test_simple_file_read_command(self):

				        """Test that simple file_read commands are detected correctly."""

brent.edwards commented

2025-10-06 22:48:59 +00:00

When I run this test, I get the following warning:

tests/unit/core/test_tool_command_processing.py::TestToolCommandProcessing::test_simple_file_read_command
  /home/brent.edwards/Workspace/cleveragents-core/src/cleveragents/core/application.py:92: DeprecationWarning: There is no current event loop
    self.loop = asyncio.get_event_loop()

-- Docs: https://docs.pytest.org/en/stable/how-to/capture-warnings.html

When I run this test, I get the following warning: ``` tests/unit/core/test_tool_command_processing.py::TestToolCommandProcessing::test_simple_file_read_command /home/brent.edwards/Workspace/cleveragents-core/src/cleveragents/core/application.py:92: DeprecationWarning: There is no current event loop self.loop = asyncio.get_event_loop() -- Docs: https://docs.pytest.org/en/stable/how-to/capture-warnings.html ```

brent.edwards marked this conversation as resolved

aditya added 3 commits

2025-10-07 15:34:57 +00:00

fix: graph context fix f969995644

chore: add working multi agent langraph yaml files 5190be5781

test: add end to end test scripts for langraph yaml files 758d371b0a

aditya added 2 commits

2025-10-08 12:36:54 +00:00

feat: add text append to file feature to file write tool, now paper writer tool works section by section as well aeb67fc9bd

tests: add behave bdd test cases for new changes 1df5de4087

brent.edwards reviewed

2025-10-08 18:24:01 +00:00

README.md Outdated

					
				@ -0,0 +274,4 @@

				```bash

				# Run all tests

				python -m behave tests/features

brent.edwards commented

2025-10-08 18:24:01 +00:00

Could you include here how to run all of the other tests?

brent.edwards marked this conversation as resolved

brent.edwards reviewed

2025-10-08 19:01:29 +00:00

examples/legal_contract_metadata_extractor.yaml Outdated

					
				@ -0,0 +1,384 @@

				# Legal Contract Metadata Extractor - True Multi-Agent Analysis System

brent.edwards commented

2025-10-08 19:01:28 +00:00

When I ran the program against the Business Services Agreement.docx that I listed, it gave the following response:

```json
{
  "document_info": {
    "analysis_date": "2022-10-15",
    "document_type": "service_agreement",
    "total_confidence": 0.91,
    "analysis_version": "1.0"
  },
  "parties": [],
  "dates": {},
  "financial_terms": {},
  "obligations": {},
  "legal_terms": {},
  "risk_assessment": {
    "high_risk_terms": [
      {
        "term": "Liability Limit",
        "description": "The liability limit of 12 months of payments may be insufficient for potential damages, especially in case of data breaches or significant service disruptions.",
        "confidence": 0.90
      },
      {
        "term": "99.9% uptime guarantee",
        "description": "While this is a standard clause, the contract does not specify any penalties or remedies if the provider fails to meet this guarantee.",
        "confidence": 0.85
      }
    ],
    "missing_standard_clauses": [
      {
        "clause": "Data protection and privacy clauses",
        "description": "Given the nature of services provided, the contract should include specific data protection and privacy clauses.",
        "confidence": 0.95
      },
      {
        "clause": "Force Majeure",
        "description": "The contract does not include a force majeure clause, which could leave the parties unprotected in the event of unforeseen circumstances.",
        "confidence": 0.90
      },
      {
        "clause": "Dispute Resolution",
        "description": "The contract lacks a dispute resolution clause, leaving the parties without a clear path to resolve potential disagreements.",
        "confidence": 0.85
      }
    ],
    "ambiguous_language": [
      {
        "term": "24/7 technical support",
        "description": "The contract does not specify what constitutes 'technical support'. This could lead to disagreements about the level of service expected.",
        "confidence": 0.80
      },
      {
        "term": "Network infrastructure management",
        "description": "The contract does not provide a clear definition or scope for 'network infrastructure management'.",
        "confidence": 0.75
      }
    ],
    "compliance_concerns": [
      {
        "concern": "Data Privacy",
        "description": "The contract does not mention compliance with data privacy regulations such as GDPR or CCPA. This could expose both parties to regulatory risks.",
        "confidence": 0.90
      },
      {
        "concern": "Confidentiality",
        "description": "While the contract requires confidentiality, it does not specify the procedures for handling confidential information.",
        "confidence": 0.85
      }
    ],
    "overall_assessment": {
      "risk_score": 0.75,
      "risk_level": "High",
      "recommendation": "Revise the contract to include missing clauses, clarify ambiguous terms, and address compliance concerns."
    }
  },
  "extraction_summary": {}
}

There are several important things wrong:

The analysis date should be today, not three years ago.
parties, dates, financial terms, obligations, and legal terms are not mentioned.
There is no liability limit in the contract.
There is no 99.9% uptime guarantee.
"network infrastructure management" is probably not appropriate.
The contract never mentions confidentiality.
There is no extraction summary.

If I recall correctly, it looks like the same analysis that was given for a different contract.

When I ran the program against the `Business Services Agreement.docx` that I listed, it gave the following response: ``` ```json { "document_info": { "analysis_date": "2022-10-15", "document_type": "service_agreement", "total_confidence": 0.91, "analysis_version": "1.0" }, "parties": [], "dates": {}, "financial_terms": {}, "obligations": {}, "legal_terms": {}, "risk_assessment": { "high_risk_terms": [ { "term": "Liability Limit", "description": "The liability limit of 12 months of payments may be insufficient for potential damages, especially in case of data breaches or significant service disruptions.", "confidence": 0.90 }, { "term": "99.9% uptime guarantee", "description": "While this is a standard clause, the contract does not specify any penalties or remedies if the provider fails to meet this guarantee.", "confidence": 0.85 } ], "missing_standard_clauses": [ { "clause": "Data protection and privacy clauses", "description": "Given the nature of services provided, the contract should include specific data protection and privacy clauses.", "confidence": 0.95 }, { "clause": "Force Majeure", "description": "The contract does not include a force majeure clause, which could leave the parties unprotected in the event of unforeseen circumstances.", "confidence": 0.90 }, { "clause": "Dispute Resolution", "description": "The contract lacks a dispute resolution clause, leaving the parties without a clear path to resolve potential disagreements.", "confidence": 0.85 } ], "ambiguous_language": [ { "term": "24/7 technical support", "description": "The contract does not specify what constitutes 'technical support'. This could lead to disagreements about the level of service expected.", "confidence": 0.80 }, { "term": "Network infrastructure management", "description": "The contract does not provide a clear definition or scope for 'network infrastructure management'.", "confidence": 0.75 } ], "compliance_concerns": [ { "concern": "Data Privacy", "description": "The contract does not mention compliance with data privacy regulations such as GDPR or CCPA. This could expose both parties to regulatory risks.", "confidence": 0.90 }, { "concern": "Confidentiality", "description": "While the contract requires confidentiality, it does not specify the procedures for handling confidential information.", "confidence": 0.85 } ], "overall_assessment": { "risk_score": 0.75, "risk_level": "High", "recommendation": "Revise the contract to include missing clauses, clarify ambiguous terms, and address compliance concerns." } }, "extraction_summary": {} } ``` There are several important things wrong: 1. The analysis date should be today, not three years ago. 2. parties, dates, financial terms, obligations, and legal terms are not mentioned. 3. There is no liability limit in the contract. 4. There is no 99.9% uptime guarantee. 5. "network infrastructure management" is probably not appropriate. 6. The contract never mentions confidentiality. 7. There is no extraction summary. If I recall correctly, it looks like the same analysis that was given for a different contract.

brent.edwards reviewed

2025-10-08 21:45:03 +00:00

examples/legal_contract_metadata_extractor_langgraph.yaml

					
				@ -0,0 +1,495 @@

				# Legal Contract Metadata Extractor - LangGraph with Conditional Routing

brent.edwards commented

2025-10-08 21:45:03 +00:00

Even when I start with /graph contract_analysis_workflow, I still need to regularly prompt to get the machine to move forward.

But here was its final result:

{
  "document_info": {
    "file_path": "/home/brent.edwards/Documents/BusinessServicesAgreement.txt",
    "file_size": 5560,
    "num_lines": 89
  },
  "parties": [
    {
      "name": "University of North East South West Dakota at Springfield",
      "departments": ["College of Computer Science", "Department"]
    },
    {
      "name": "CleverThis",
      "address": "1234 Main Street, Hoboken, NJ 08512",
      "tax_id": "800-555-222"
    }
  ],
  "dates": {
    "effective_date": "2025-10-21",
    "termination_date": "2027-01-01"
  },
  "financial_terms": {
    "total_compensation": "$1,000,000 USD",
    "expense_cap": "$100,000 USD",
    "contract_amount": "$1,100,000 USD"
  },
  "obligations": {
    "contractor_services": "Services as per Exhibit A",
    "compensation": "University to compensate Contractor as per Exhibit B",
    "payment_processing": "University to process payment within 30 days of invoice approval",
    "renewal_option": "University to have option to renew the Agreement for ten additional one-year terms"
  },
  "legal_terms": {
    "governing_law": "The University of North East South West University Service Agreement Terms and Conditions"
  },
  "extraction_summary": {
    "parties": ["University of North East South West Dakota at Springfield", "CleverThis"],
    "dates": ["2025-10-21", "2027-01-01"],
    "financial_terms": ["$1,000,000 USD", "$100,000 USD", "$1,100,000 USD"],
    "key_terms": ["Exhibit A: Statement of Work", "Exhibit B: Payment for Services", "Travel Expenses: Reimbursed without mark-up for reasonable expenses", "Invoicing: Upon completion of a deliverable and acceptance by University"]
  }
}

Even when I start with `/graph contract_analysis_workflow`, I still need to regularly prompt to get the machine to move forward. But here was its final result: ``` { "document_info": { "file_path": "/home/brent.edwards/Documents/BusinessServicesAgreement.txt", "file_size": 5560, "num_lines": 89 }, "parties": [ { "name": "University of North East South West Dakota at Springfield", "departments": ["College of Computer Science", "Department"] }, { "name": "CleverThis", "address": "1234 Main Street, Hoboken, NJ 08512", "tax_id": "800-555-222" } ], "dates": { "effective_date": "2025-10-21", "termination_date": "2027-01-01" }, "financial_terms": { "total_compensation": "$1,000,000 USD", "expense_cap": "$100,000 USD", "contract_amount": "$1,100,000 USD" }, "obligations": { "contractor_services": "Services as per Exhibit A", "compensation": "University to compensate Contractor as per Exhibit B", "payment_processing": "University to process payment within 30 days of invoice approval", "renewal_option": "University to have option to renew the Agreement for ten additional one-year terms" }, "legal_terms": { "governing_law": "The University of North East South West University Service Agreement Terms and Conditions" }, "extraction_summary": { "parties": ["University of North East South West Dakota at Springfield", "CleverThis"], "dates": ["2025-10-21", "2027-01-01"], "financial_terms": ["$1,000,000 USD", "$100,000 USD", "$1,100,000 USD"], "key_terms": ["Exhibit A: Statement of Work", "Exhibit B: Payment for Services", "Travel Expenses: Reimbursed without mark-up for reasonable expenses", "Invoicing: Upon completion of a deliverable and acceptance by University"] } } ```

aditya commented

2025-10-10 14:00:44 +00:00

@brent.edwards
This is EXPECTED BEHAVIOR for the LangGraph version with loop-back edges.

The workflow is designed for interactive review:

Each stage completes and loops back to coordinator
User reviews the output
User prompts to continue to next stage
Provides quality control and user oversight

This is IDEAL for professional legal analysis where lawyers want to:

Review each stage before proceeding
Provide corrections or feedback
Control workflow pace
Ensure accuracy at each step

For AUTOMATIC execution without manual prompts, use the stream-based version
which processes through all stages in one execution.

The final JSON output shows the system works correctly - all data was
accurately extracted from the contract.

@brent.edwards This is EXPECTED BEHAVIOR for the LangGraph version with loop-back edges. The workflow is designed for interactive review: - Each stage completes and loops back to coordinator - User reviews the output - User prompts to continue to next stage - Provides quality control and user oversight This is IDEAL for professional legal analysis where lawyers want to: - Review each stage before proceeding - Provide corrections or feedback - Control workflow pace - Ensure accuracy at each step For AUTOMATIC execution without manual prompts, use the stream-based version which processes through all stages in one execution. The final JSON output shows the system works correctly - all data was accurately extracted from the contract.

brent.edwards marked this conversation as resolved

brent.edwards reviewed

2025-10-08 21:48:46 +00:00

src/cleveragents/agents/llm.py

					
				@ -155,1 +158,4 @@

				            elif self.config.get("memory_enabled", False):

				                # Fall back to agent's own memory if no context history

				                history = await self.get_memory("conversation_history", [])

brent.edwards commented

2025-10-08 21:48:46 +00:00

src/cleveragents/agents/llm.py:161:0: C0303: Trailing whitespace (trailing-whitespace)

brent.edwards marked this conversation as resolved

brent.edwards reviewed

2025-10-09 00:36:19 +00:00

brent.edwards left a comment

I haven't gone through the tests.

src/cleveragents/agents/tool.py Outdated

					
				@ -288,2 +295,3 @@

				            with open(filepath, "r", encoding="utf-8") as f:

				                return f.read()

				                content = f.read()

brent.edwards commented

2025-10-08 21:50:13 +00:00

src/cleveragents/agents/tool.py:297:0: C0303: Trailing whitespace (trailing-whitespace)

brent.edwards marked this conversation as resolved

src/cleveragents/agents/tool.py Outdated

					
				@ -290,0 +300,4 @@

				            # but we add a prefix for clean terminal display

				            line_count = content.count('\n') + 1

				            char_count = len(content)

brent.edwards commented

2025-10-08 21:50:32 +00:00

src/cleveragents/agents/tool.py:303:0: C0303: Trailing whitespace (trailing-whitespace)

brent.edwards marked this conversation as resolved

src/cleveragents/agents/tool.py Outdated

					
				@ -290,0 +303,4 @@

				            # Format: Special marker + metadata + full content

				            # The special marker helps identify this for clean display

				            return f"[FILE_READ_SUCCESS]📄 File: {filepath} | Lines: {line_count} | Size: {char_count} chars\n[FILE_CONTENT_START]\n{content}\n[FILE_CONTENT_END]"

brent.edwards commented

2025-10-08 21:50:49 +00:00

src/cleveragents/agents/tool.py:306:0: C0301: Line too long (161/100) (line-too-long)

brent.edwards marked this conversation as resolved

src/cleveragents/agents/tool.py Outdated

					
				@ -294,3 +311,3 @@

				        self, args: Dict[str, Any], context: Optional[Dict[str, Any]]

				    ) -> str:

				        """File writing tool."""

				        """File writing tool with support for write, append, and insert modes."""

brent.edwards commented

2025-10-08 21:54:52 +00:00

src/cleveragents/agents/tool.py:310:4: R0912: Too many branches (15/12) (too-many-branches)
src/cleveragents/agents/tool.py:310:4: R0915: Too many statements (51/50) (too-many-statements)

src/cleveragents/agents/tool.py:310:4: R0912: Too many branches (15/12) (too-many-branches) src/cleveragents/agents/tool.py:310:4: R0915: Too many statements (51/50) (too-many-statements)

brent.edwards marked this conversation as resolved

src/cleveragents/agents/tool.py Outdated

					
				@ -297,2 +313,4 @@

				        """File writing tool with support for write, append, and insert modes."""

				        filepath = args.get("file", "")

				        content = args.get("content", "")

				        mode = args.get("mode", "w")  # "w" (write/overwrite), "a" (append), "insert" (insert at position)

brent.edwards commented

2025-10-08 21:51:05 +00:00

src/cleveragents/agents/tool.py:316:0: C0301: Line too long (106/100) (line-too-long)

If you're using mode as a set of possible values, I recommend using an Enum instead of a string. An Enum automatically makes sure that the program is one of the correct values. For example, it makes sure that the caller didn't use i instead of insert.

src/cleveragents/agents/tool.py:316:0: C0301: Line too long (106/100) (line-too-long) If you're using `mode` as a set of possible values, I recommend using an [`Enum`](https://docs.python.org/3/library/enum.html) instead of a string. An `Enum` automatically makes sure that the program is one of the correct values. For example, it makes sure that the caller didn't use `i` instead of `insert`.

brent.edwards marked this conversation as resolved

src/cleveragents/agents/tool.py Outdated

					
				@ -312,3 +336,1 @@

				            if filepath.startswith("/") and not (

				                context and context.get("_unsafe_mode", False)

				            ):

				            if filepath.startswith("/") and not unsafe_mode:

brent.edwards commented

2025-10-08 22:09:25 +00:00

This condition can never happen. From line 326-327, we know that unsafe_mode is true. So the condition always becomes

if filepath.startswith("/") and false:

This line probably should just be

if filepath.startswith("/")

This condition can never happen. From line 326-327, we know that `unsafe_mode` is true. So the condition always becomes ``` if filepath.startswith("/") and false: ``` This line probably should just be ``` if filepath.startswith("/") ```

brent.edwards marked this conversation as resolved

src/cleveragents/agents/tool.py

					
				@ -314,2 +336,3 @@

				            ):

				            if filepath.startswith("/") and not unsafe_mode:

				                logger.error("Absolute path blocked for %s", filepath)

				                raise ExecutionError("Unsafe file path blocked in safe mode")

brent.edwards commented

2025-10-08 22:10:15 +00:00

There is one other thing that is usually restricted from file paths:

What is your opinion of soft links? Should they be allowed?

There is one other thing that is usually restricted from file paths: What is your opinion of soft links? Should they be allowed?

brent.edwards marked this conversation as resolved

src/cleveragents/agents/tool.py Outdated

					
				@ -321,0 +343,4 @@

				                with open(filepath, "w", encoding="utf-8") as f:

				                    f.write(content)

				                return f"Successfully wrote {len(content)} characters to {filepath}"

brent.edwards commented

2025-10-08 21:51:21 +00:00

src/cleveragents/agents/tool.py:346:0: C0303: Trailing whitespace (trailing-whitespace)

brent.edwards marked this conversation as resolved

src/cleveragents/agents/tool.py Outdated

					
				@ -321,0 +344,4 @@

				                    f.write(content)

				                return f"Successfully wrote {len(content)} characters to {filepath}"

				            elif mode == "a":

brent.edwards commented

2025-10-08 21:54:25 +00:00

src/cleveragents/agents/tool.py:341:12: R1705: Unnecessary "elif" after "return", remove the leading "el" from "elif" (no-else-return)

brent.edwards marked this conversation as resolved

src/cleveragents/agents/tool.py

					
				@ -321,0 +349,4 @@

				                with open(filepath, "a", encoding="utf-8") as f:

				                    f.write(content)

				                return f"Successfully appended {len(content)} characters to {filepath}"

brent.edwards commented

2025-10-08 21:51:34 +00:00

src/cleveragents/agents/tool.py:352:0: C0303: Trailing whitespace (trailing-whitespace)

brent.edwards marked this conversation as resolved

src/cleveragents/agents/tool.py

					
				@ -321,0 +352,4 @@

				            elif mode == "insert":

				                # Insert mode - read existing content, insert at position, write back

				                try:

brent.edwards commented

2025-10-08 23:15:48 +00:00

It would make sense to separate the "add lines to a file" to its own method, to make testing easier. (Not mandatory.)

brent.edwards marked this conversation as resolved

src/cleveragents/agents/tool.py

					
				@ -321,0 +354,4 @@

				                # Insert mode - read existing content, insert at position, write back

				                try:

				                    with open(filepath, "r", encoding="utf-8") as f:

				                        existing_lines = f.readlines()

brent.edwards commented

2025-10-08 22:15:43 +00:00

Is it ever possible that this file will be too large to fit in memory?

brent.edwards marked this conversation as resolved

src/cleveragents/agents/tool.py Outdated

					
				@ -321,0 +358,4 @@

				                except FileNotFoundError:

				                    # If file doesn't exist, create it with the content

				                    existing_lines = []

brent.edwards commented

2025-10-08 21:51:47 +00:00

src/cleveragents/agents/tool.py:361:0: C0303: Trailing whitespace (trailing-whitespace)

brent.edwards marked this conversation as resolved

src/cleveragents/agents/tool.py Outdated

					
				@ -321,0 +360,4 @@

				                    existing_lines = []

				                # Determine insertion position

				                if position is None or position == "end":

brent.edwards commented

2025-10-08 23:12:01 +00:00

That position could be:

an int
a string
None

makes me a tiny bit nervous. I come from a strongly-typed background, and I still prefer when it's impossible to give a variable an impossible value, so that it's harder to get things wrong.

I'm not going to push this problem.

That `position` could be: - an int - a string - None makes me a tiny bit nervous. I come from a strongly-typed background, and I still prefer when it's impossible to give a variable an impossible value, so that it's harder to get things wrong. I'm not going to push this problem.

brent.edwards marked this conversation as resolved

src/cleveragents/agents/tool.py Outdated

					
				@ -321,0 +362,4 @@

				                # Determine insertion position

				                if position is None or position == "end":

				                    # Append at end

				                    existing_lines.append(content if content.endswith('\n') else content + '\n')

brent.edwards commented

2025-10-08 23:01:40 +00:00

On lines 365, 369, and 374, the code always adds an \n to the end of content if it doesn't already exist. It would be better to just do that once.

On lines 365, 369, and 374, the code always adds an `\n` to the end of content if it doesn't already exist. It would be better to just do that once.

brent.edwards marked this conversation as resolved

src/cleveragents/agents/tool.py Outdated

					
				@ -321,0 +371,4 @@

				                elif isinstance(position, int):

				                    # Insert at specific line number (1-indexed)

				                    line_idx = max(0, min(position - 1, len(existing_lines)))

				                    existing_lines.insert(line_idx, content if content.endswith('\n') else content + '\n')

brent.edwards commented

2025-10-08 21:52:00 +00:00

src/cleveragents/agents/tool.py:361:0: C0303: Trailing whitespace (trailing-whitespace)

brent.edwards marked this conversation as resolved

src/cleveragents/agents/tool.py Outdated

					
				@ -321,0 +374,4 @@

				                    existing_lines.insert(line_idx, content if content.endswith('\n') else content + '\n')

				                    insert_location = line_idx + 1

				                else:

				                    raise ExecutionError(f"Invalid position '{position}'. Use 'start', 'end', or line number.")

brent.edwards commented

2025-10-08 21:52:14 +00:00

src/cleveragents/agents/tool.py:377:0: C0301: Line too long (111/100) (line-too-long)

brent.edwards marked this conversation as resolved

src/cleveragents/agents/tool.py Outdated

					
				@ -321,0 +375,4 @@

				                    insert_location = line_idx + 1

				                else:

				                    raise ExecutionError(f"Invalid position '{position}'. Use 'start', 'end', or line number.")

brent.edwards commented

2025-10-08 21:52:28 +00:00

src/cleveragents/agents/tool.py:378:0: C0303: Trailing whitespace (trailing-whitespace)

brent.edwards marked this conversation as resolved

src/cleveragents/agents/tool.py Outdated

					
				@ -321,0 +379,4 @@

				                # Write back

				                with open(filepath, "w", encoding="utf-8") as f:

				                    f.writelines(existing_lines)

brent.edwards commented

2025-10-08 21:52:43 +00:00

src/cleveragents/agents/tool.py:382:0: C0303: Trailing whitespace (trailing-whitespace)

brent.edwards marked this conversation as resolved

src/cleveragents/agents/tool.py Outdated

					
				@ -321,0 +380,4 @@

				                with open(filepath, "w", encoding="utf-8") as f:

				                    f.writelines(existing_lines)

				                return f"Successfully inserted {len(content)} characters at line {insert_location} in {filepath}"

brent.edwards commented

2025-10-08 21:52:56 +00:00

src/cleveragents/agents/tool.py:383:8: W0621: Redefining name 're' from outer scope (line 11) (redefined-outer-name)
src/cleveragents/agents/tool.py:383:8: W0404: Reimport 're' (imported line 11) (reimported)
src/cleveragents/agents/tool.py:383:8: C0415: Import outside toplevel (re) (import-outside-toplevel)

src/cleveragents/agents/tool.py:383:8: W0621: Redefining name 're' from outer scope (line 11) (redefined-outer-name) src/cleveragents/agents/tool.py:383:8: W0404: Reimport 're' (imported line 11) (reimported) src/cleveragents/agents/tool.py:383:8: C0415: Import outside toplevel (re) (import-outside-toplevel)

brent.edwards marked this conversation as resolved

src/cleveragents/agents/tool.py Outdated

					
				@ -321,0 +381,4 @@

				                    f.writelines(existing_lines)

				                return f"Successfully inserted {len(content)} characters at line {insert_location} in {filepath}"

brent.edwards commented

2025-10-08 21:53:09 +00:00

src/cleveragents/agents/tool.py:384:0: C0303: Trailing whitespace (trailing-whitespace)

brent.edwards marked this conversation as resolved

src/cleveragents/agents/tool.py Outdated

					
				@ -321,0 +383,4 @@

				                return f"Successfully inserted {len(content)} characters at line {insert_location} in {filepath}"

				            else:

				                raise ExecutionError(f"Invalid mode '{mode}'. Use 'w' (write), 'a' (append), or 'insert'.")

brent.edwards commented

2025-10-08 21:53:28 +00:00

src/cleveragents/agents/tool.py:386:0: C0301: Line too long (107/100) (line-too-long)

brent.edwards marked this conversation as resolved

src/cleveragents/agents/tool.py Outdated

					
				@ -321,0 +384,4 @@

				            else:

				                raise ExecutionError(f"Invalid mode '{mode}'. Use 'w' (write), 'a' (append), or 'insert'.")

brent.edwards commented

2025-10-08 21:54:00 +00:00

src/cleveragents/agents/tool.py:387:0: C0303: Trailing whitespace (trailing-whitespace)

brent.edwards marked this conversation as resolved

src/cleveragents/core/application.py

					
				@ -670,0 +714,4 @@

				        # Pattern to match any content between quotes, including newlines

				        pattern = r'"([^"\\]*(?:\\.[^"\\]*)*|[^"]*(?:\n[^"]*)*)"'

				        sanitized = re.sub(pattern, escape_string_content, json_str, flags=re.DOTALL)

brent.edwards commented

2025-10-08 23:43:14 +00:00

In lines 678-682, you checked that json_str was a good JSON file.

Could you do two things:

Turn that into a function (local to _sanitize_json_string is fine.)
Call the check at the end.
If the final string is not good JSON, post a warning message.

This will guarantee that it outputs good JSON, in case you missed any necessary transformations.

In lines 678-682, you checked that `json_str` was a good JSON file. Could you do two things: 1. Turn that into a function (local to `_sanitize_json_string` is fine.) 2. Call the check at the end. 3. If the final string is not good JSON, post a warning message. This will guarantee that it outputs good JSON, in case you missed any necessary transformations.

brent.edwards marked this conversation as resolved

src/cleveragents/core/application.py Outdated

					
				@ -670,0 +732,4 @@

				        try:

				            # Find the appropriate agent that can execute this tool

				            target_agent = None

				            for _, agent in self.agents.items():

brent.edwards commented

2025-10-09 00:00:58 +00:00

I think this line would be a touch clearer as

for agent in self.agents.values():

I think this line would be a touch clearer as ``` for agent in self.agents.values(): ```

brent.edwards marked this conversation as resolved

src/cleveragents/core/application.py Outdated

					
				@ -730,8 +869,16 @@ class ReactiveCleverAgentsApp:

				        # Execute graph

				        try:

				            print(f"Executing graph '{graph_name}'...")

brent.edwards commented

2025-10-08 23:36:53 +00:00

src/cleveragents/core/application.py:872:0: C0303: Trailing whitespace (trailing-whitespace)

brent.edwards marked this conversation as resolved

src/cleveragents/core/application.py Outdated

					
				@ -732,1 +871,4 @@

				            print(f"Executing graph '{graph_name}'...")

				            # Prepare metadata with unsafe mode and context

				            metadata: Dict[str, Any] = {"context": self.config.global_context}

brent.edwards commented

2025-10-08 23:34:23 +00:00

src/cleveragents/core/application.py:874: error: Item "None" of "ReactiveConfig | None" has no attribute "global_context" [union-attr]

brent.edwards marked this conversation as resolved

src/cleveragents/core/application.py Outdated

					
				@ -733,0 +873,4 @@

				            # Prepare metadata with unsafe mode and context

				            metadata: Dict[str, Any] = {"context": self.config.global_context}

				            metadata["_unsafe_mode"] = self.unsafe

brent.edwards commented

2025-10-08 23:37:07 +00:00

src/cleveragents/core/application.py:876:0: C0303: Trailing whitespace (trailing-whitespace)

brent.edwards marked this conversation as resolved

src/cleveragents/langgraph/graph.py Outdated

					
				@ -393,12 +381,20 @@ class LangGraph:

				                        {"messages": [{"role": "user", "content": input_data}]},

				                        mode=StateUpdateMode.APPEND,

				                    )

brent.edwards commented

2025-10-09 00:08:14 +00:00

src/cleveragents/langgraph/graph.py:384:0: C0303: Trailing whitespace (trailing-whitespace)

aditya commented

2025-10-17 20:27:55 +00:00

@brent.edwards it has been fixed

brent.edwards marked this conversation as resolved

src/cleveragents/langgraph/nodes.py

					
				@ -151,1 +150,3 @@

				            agent_input = last_message.get("content", "")

				            # Check if this is a tool agent that should receive the last message

				            # (regardless of role) to process commands from other agents

				            from cleveragents.agents.tool import ToolAgent

brent.edwards commented

2025-10-09 00:35:33 +00:00

src/cleveragents/langgraph/nodes.py:152:12: C0415: Import outside toplevel (cleveragents.agents.tool.ToolAgent) (import-outside-toplevel)

brent.edwards marked this conversation as resolved

src/cleveragents/langgraph/nodes.py Outdated

					
				@ -152,0 +160,4 @@

				                    if msg.get("role") == "user":

				                        last_user_message = msg.get("content", "")

				                        break

brent.edwards commented

2025-10-09 00:34:57 +00:00

src/cleveragents/langgraph/nodes.py:163:0: C0303: Trailing whitespace (trailing-whitespace)

brent.edwards marked this conversation as resolved

src/cleveragents/langgraph/nodes.py Outdated

					
				@ -156,0 +171,4 @@

				            "conversation_history": state.messages,

				            "full_context": True

				        }

brent.edwards commented

2025-10-09 00:35:10 +00:00

src/cleveragents/langgraph/nodes.py:174:0: C0303: Trailing whitespace (trailing-whitespace)

brent.edwards marked this conversation as resolved

aditya added 1 commit

2025-10-09 15:02:36 +00:00

fix: resolve linting issues - trailing whitespace and line length bc8cec1d89

brent.edwards reviewed

2025-10-09 20:05:45 +00:00

brent.edwards reviewed

2025-10-09 20:49:51 +00:00

README.md Outdated

					
				@ -0,0 +272,4 @@

				### Running Tests

				```bash

				# Run all tests

brent.edwards commented

2025-10-09 20:49:51 +00:00

One very small change: Could you change this from # Run all tests to # Run all BDD tests or # Run all feature tests?

One very small change: Could you change this from `# Run all tests` to `# Run all BDD tests` or `# Run all feature tests`?

brent.edwards marked this conversation as resolved

brent.edwards reviewed

2025-10-09 22:00:28 +00:00

README.md Outdated

					
				@ -0,0 +294,4 @@

				tox -e py312-nocov

				# Individual test scripts:

				bash tests/scripts/test_single_turn_writer.sh

brent.edwards commented

2025-10-09 22:00:28 +00:00

This test fails with

❯ bash tests/scripts/test_single_turn_writer.sh 
Testing single-turn paper writer...

Usage: python -m cleveragents run [OPTIONS]
Try 'python -m cleveragents run --help' for help.

Error: Invalid value for '--config' / '-c': File 'examples/scientific_paper_writer_single_turn.yaml' does not exist.

This test fails with ``` ❯ bash tests/scripts/test_single_turn_writer.sh Testing single-turn paper writer... Usage: python -m cleveragents run [OPTIONS] Try 'python -m cleveragents run --help' for help. Error: Invalid value for '--config' / '-c': File 'examples/scientific_paper_writer_single_turn.yaml' does not exist. ```

brent.edwards marked this conversation as resolved

brent.edwards reviewed

2025-10-09 22:02:07 +00:00

README.md Outdated

					
				@ -0,0 +295,4 @@

				# Individual test scripts:

				bash tests/scripts/test_single_turn_writer.sh

				bash tests/scripts/test_multi_agent_paper_writer_langgraph.sh

brent.edwards commented

2025-10-09 22:02:07 +00:00

Passes!

brent.edwards marked this conversation as resolved

brent.edwards reviewed

2025-10-09 22:02:59 +00:00

README.md Outdated

					
				@ -0,0 +296,4 @@

				# Individual test scripts:

				bash tests/scripts/test_single_turn_writer.sh

				bash tests/scripts/test_multi_agent_paper_writer_langgraph.sh

				bash tests/scripts/test_legal_contract_analyzer_langgraph.sh

brent.edwards commented

2025-10-09 22:02:59 +00:00

(deleted; no longer true)

brent.edwards commented

2025-10-10 20:46:52 +00:00

bash tests/scripts/test_paper_writer_section_by_section.sh fails because test_paper_writer_section_by_section.sh no longer exists.

`bash tests/scripts/test_paper_writer_section_by_section.sh` fails because `test_paper_writer_section_by_section.sh` no longer exists.

aditya commented

2025-10-17 20:22:38 +00:00

Updated the same.

All 3 Test Scripts PASSING:

test_multi_agent_paper_writer_langgraph.sh
Paper generated successfully
File saved correctly
Multi-agent workflow validated
All 4 agents active
test_legal_contract_analyzer_langgraph.sh
Data extraction accurate
Risk assessment completed
JSON formatting working
File saving successful
test_paper_writer_section_by_section.sh
Section-by-section writing works
File append operations successful
Content preservation validated

Current scripts in tests/scripts/:

test_legal_contract_analyzer_langgraph.sh
test_multi_agent_paper_writer_langgraph.sh
test_paper_writer_section_by_section.sh

README.md has been updated to remove reference to test_single_turn_writer.sh

Updated the same. All 3 Test Scripts PASSING: 1. test_multi_agent_paper_writer_langgraph.sh Paper generated successfully File saved correctly Multi-agent workflow validated All 4 agents active 2. test_legal_contract_analyzer_langgraph.sh Data extraction accurate Risk assessment completed JSON formatting working File saving successful 3. test_paper_writer_section_by_section.sh Section-by-section writing works File append operations successful Content preservation validated Current scripts in tests/scripts/: - test_legal_contract_analyzer_langgraph.sh - test_multi_agent_paper_writer_langgraph.sh - test_paper_writer_section_by_section.sh README.md has been updated to remove reference to test_single_turn_writer.sh

brent.edwards commented

2025-10-20 20:31:39 +00:00

Yup! They pass!

brent.edwards marked this conversation as resolved

brent.edwards reviewed

2025-10-09 22:09:07 +00:00

README.md Outdated

					
				@ -0,0 +297,4 @@

				bash tests/scripts/test_single_turn_writer.sh

				bash tests/scripts/test_multi_agent_paper_writer_langgraph.sh

				bash tests/scripts/test_legal_contract_analyzer_langgraph.sh

				bash tests/scripts/test_paper_writer_section_by_section.sh

brent.edwards commented

2025-10-09 22:09:07 +00:00

When I ran this test, every section had an extra [SECTION_COMPLETE] at the bottom. Here are some portions of the paper illustrating it:

...
finally proposes recommendations for establishing a robust ethical framework for AI, incorporating stakeholder engagement, human rights protections, and regulatory oversight.

[SECTION_COMPLETE]
## Introduction
Artificial Intelligence (AI) has rapidly transformed various sectors, from healthcare to finance, education to transportation, yielding unprecedented efficiencies and possibilities. However, as AI technologies advance, so too do the ethical dilemmas
...
propose recommendations for a comprehensive ethical framework for AI. Our goal is not to hinder AI development but to ensure it aligns with human values and societal norms while offering its many benefits.

[SECTION_COMPLETE]
## Methodology
The study of AI ethics in this paper is primarily conducted through a comprehensive literature review and critical analysis.

When I ran this test, every section had an extra `[SECTION_COMPLETE]` at the bottom. Here are some portions of the paper illustrating it: ``` ... finally proposes recommendations for establishing a robust ethical framework for AI, incorporating stakeholder engagement, human rights protections, and regulatory oversight. [SECTION_COMPLETE] ## Introduction Artificial Intelligence (AI) has rapidly transformed various sectors, from healthcare to finance, education to transportation, yielding unprecedented efficiencies and possibilities. However, as AI technologies advance, so too do the ethical dilemmas ... propose recommendations for a comprehensive ethical framework for AI. Our goal is not to hinder AI development but to ensure it aligns with human values and societal norms while offering its many benefits. [SECTION_COMPLETE] ## Methodology The study of AI ethics in this paper is primarily conducted through a comprehensive literature review and critical analysis. ```

brent.edwards marked this conversation as resolved

brent.edwards reviewed

2025-10-09 22:21:55 +00:00

README.md Outdated

					
				@ -0,0 +299,4 @@

				bash tests/scripts/test_legal_contract_analyzer_langgraph.sh

				bash tests/scripts/test_paper_writer_section_by_section.sh

				bash tests/scripts/test_multi_agent_interactive.sh

				bash tests/scripts/test_legal_contract_analyzer.sh

brent.edwards commented

2025-10-09 22:21:55 +00:00

Although this reported that the test ran successfully, the summary shows other problems:

Analysis complete. To save results, ask user for filename.

✅ TEST 1 PASSED: System executed successfully

=== TEST 2: Agent Simulation Verification ===
Checking if orchestrator simulates specialist agents...
❌ FILE_LOADER agent simulation not found
❌ Analysis agent simulation not found (may continue in next interaction)

=== TEST 3: Tool Execution Verification ===
Checking if file reading tool executed properly...
⚠  File reading tool execution command not found

=== TEST 4: Contract Loading Validation ===
Checking if contract was loaded...
❌ Contract content not found in output

=== TEST 5: Metadata Extraction Validation ===
Checking if contract metadata was extracted...
❌ Party extraction not working
❌ Party extraction not working
⚠  Financial terms extraction may need additional processing

=== TEST 6: Performance Check ===
Checking if processing completed within reasonable time...
✅ Processing completed successfully

==========================================
Test Summary:
==========================================
Agent simulations found: 0

That kind of difference between obvious problems and the summary that everything worked will make users VERY nervous.

Although this reported that the test ran successfully, the summary shows other problems: ``` Analysis complete. To save results, ask user for filename. ✅ TEST 1 PASSED: System executed successfully === TEST 2: Agent Simulation Verification === Checking if orchestrator simulates specialist agents... ❌ FILE_LOADER agent simulation not found ❌ Analysis agent simulation not found (may continue in next interaction) === TEST 3: Tool Execution Verification === Checking if file reading tool executed properly... ⚠ File reading tool execution command not found === TEST 4: Contract Loading Validation === Checking if contract was loaded... ❌ Contract content not found in output === TEST 5: Metadata Extraction Validation === Checking if contract metadata was extracted... ❌ Party extraction not working ❌ Party extraction not working ⚠ Financial terms extraction may need additional processing === TEST 6: Performance Check === Checking if processing completed within reasonable time... ✅ Processing completed successfully ========================================== Test Summary: ========================================== Agent simulations found: 0 ``` That kind of difference between obvious problems and the summary that everything worked will make users VERY nervous.

brent.edwards marked this conversation as resolved

brent.edwards reviewed

2025-10-09 22:35:57 +00:00

src/cleveragents/agents/tool.py Outdated

					
				@ -297,2 +315,4 @@

				        """File writing tool with support for write, append, and insert modes."""

				        filepath = args.get("file", "")

				        content = args.get("content", "")

				        mode = args.get("mode", "w")  # "w" (write/overwrite), "a" (append), "insert" (insert at position)

brent.edwards commented

2025-10-09 22:35:57 +00:00

src/cleveragents/agents/tool.py:318:0: C0301: Line too long (106/100) (line-too-long)

aditya commented

2025-10-17 20:07:21 +00:00

@brent.edwards FIXED in commit 3aa3d62

The line was shortened from 106 to 76 characters by simplifying the comment:

@brent.edwards FIXED in commit 3aa3d62 The line was shortened from 106 to 76 characters by simplifying the comment:

aditya added 1 commit

2025-10-10 14:09:11 +00:00

fix: thread safety AgentWithMemory and improve ToolAgent JSON parsing and mypy issues 3aa3d62389

brent.edwards reviewed

2025-10-10 21:09:42 +00:00

brent.edwards left a comment

A few more comments...

src/cleveragents/agents/base.py

					
				@ -71,3 +72,2 @@

				        self.input_stream.pipe(

				            ops.map(self._process_wrapper),

				            ops.flat_map(lambda future: rx.from_future(future)),

				            ops.map(self._process_wrapper),  # type: ignore[arg-type]

brent.edwards commented

2025-10-10 20:57:42 +00:00

src/cleveragents/agents/base.py:73: error: Unused "type: ignore" comment [unused-ignore]

brent.edwards marked this conversation as resolved

src/cleveragents/agents/base.py Outdated

					
				@ -72,2 +73,2 @@

				            ops.map(self._process_wrapper),

				            ops.flat_map(lambda future: rx.from_future(future)),

				            ops.map(self._process_wrapper),  # type: ignore[arg-type]

				            ops.flat_map(rx.from_future),  # type: ignore[arg-type]

brent.edwards commented

2025-10-10 20:57:58 +00:00

src/cleveragents/agents/base.py:74: error: Unused "type: ignore" comment [unused-ignore]

brent.edwards marked this conversation as resolved

src/cleveragents/agents/base.py Outdated

					
				@ -285,3 +307,3 @@

				                )

				            return rx.create(subscribe)  # type: ignore

				            return rx.create(subscribe)  # type: ignore[no-untyped-call]

brent.edwards commented

2025-10-10 20:58:28 +00:00

src/cleveragents/agents/base.py:309: error: Unused "type: ignore" comment [unused-ignore]

brent.edwards marked this conversation as resolved

src/cleveragents/agents/base.py Outdated

					
				@ -310,3 +332,3 @@

				            return await result_future

				        return ops.map(process_value)

				        return ops.map(process_value)  # type: ignore[arg-type]

brent.edwards commented

2025-10-10 20:58:42 +00:00

src/cleveragents/agents/base.py:334: error: Unused "type: ignore" comment [unused-ignore]

brent.edwards marked this conversation as resolved

src/cleveragents/agents/tool.py Outdated

					
				@ -81,6 +83,58 @@ class ToolAgent(Agent):

				            else:

				                raise AgentCreationError(f"Invalid tool configuration: {tool}")

				    def _extract_json_from_message(self, message: str) -> Optional[Dict[str, Any]]:

brent.edwards commented

2025-10-10 21:01:44 +00:00

src/cleveragents/agents/tool.py:86:4: R0911: Too many return statements (7/6) (too-many-return-statements)

aditya commented

2025-10-17 19:58:57 +00:00

@brent.edwards

This method has 7 return statements due to three distinct JSON extraction strategies,
each requiring success/failure returns:

Direct JSON parsing (2 returns: success + type check)
Markdown code block extraction (2 returns: success + type check)
Regex pattern matching (2 returns: success + type check)
Final fallback (1 return: None)

The pylint suppression is appropriate because:

The method does ONE clear thing: extract JSON from various formats
Each extraction strategy needs its own success/failure paths
Alternative approaches (nested functions or complex conditionals) would reduce readability
Current pattern seems appropriate for this

@brent.edwards This method has 7 return statements due to three distinct JSON extraction strategies, each requiring success/failure returns: 1. Direct JSON parsing (2 returns: success + type check) 2. Markdown code block extraction (2 returns: success + type check) 3. Regex pattern matching (2 returns: success + type check) 4. Final fallback (1 return: None) The pylint suppression is appropriate because: - The method does ONE clear thing: extract JSON from various formats - Each extraction strategy needs its own success/failure paths - Alternative approaches (nested functions or complex conditionals) would reduce readability - Current pattern seems appropriate for this

brent.edwards marked this conversation as resolved

src/cleveragents/agents/tool.py Outdated

					
				@ -84,0 +118,4 @@

				                return None  # Not a dict

				            except json.JSONDecodeError:

				                # Code block had invalid JSON, raise error

				                raise

brent.edwards commented

2025-10-10 21:01:27 +00:00

src/cleveragents/agents/tool.py:119:12: W0706: The except handler raises immediately (try-except-raise)

(What this means: there is no need for the try...except statement. It acts exactly as if there were none.)

src/cleveragents/agents/tool.py:119:12: W0706: The except handler raises immediately (try-except-raise) (What this means: there is no need for the `try`...`except` statement. It acts exactly as if there were none.)

brent.edwards marked this conversation as resolved

src/cleveragents/agents/tool.py Outdated

					
				@ -323,1 +346,4 @@

				                    f.write(content)

				                return f"Successfully wrote {len(content)} characters to {filepath}"

				            elif mode == "a":

brent.edwards commented

2025-10-09 22:38:32 +00:00

src/cleveragents/agents/tool.py:343:12: R1705: Unnecessary "elif" after "return", remove the leading "el" from "elif" (no-else-return)

brent.edwards marked this conversation as resolved

src/cleveragents/agents/tool.py Outdated

					
				@ -324,0 +393,4 @@

				                    f"at line {insert_location} in {filepath}"

				                )

				            else:

				                raise ExecutionError(f"Invalid mode '{mode}'. Use 'w' (write), 'a' (append), or 'insert'.")

brent.edwards commented

2025-10-09 22:38:09 +00:00

src/cleveragents/agents/tool.py:396:0: C0301: Line too long (107/100) (line-too-long)

brent.edwards marked this conversation as resolved

src/cleveragents/agents/tool.py

					
				@ -292,0 +464,4 @@

				        return prefix + cleaned_content

				    def _handle_insert_position(

				        self, filepath: str, cleaned_content: str, position: Union[None, int, Literal["start", "end"]]

brent.edwards commented

2025-10-10 21:00:17 +00:00

src/cleveragents/agents/tool.py:467:0: C0301: Line too long (102/100) (line-too-long)

brent.edwards marked this conversation as resolved

src/cleveragents/agents/tool.py Outdated

					
				@ -321,0 +529,4 @@

				                    f.write(cleaned_content)

				                return f"Successfully wrote {len(cleaned_content)} characters to {filepath}"

				            elif mode == "a":

brent.edwards commented

2025-10-10 21:02:55 +00:00

src/cleveragents/agents/tool.py:526:12: R1705: Unnecessary "elif" after "return", remove the leading "el" from "elif" (no-else-return)

aditya commented

2025-10-17 20:02:31 +00:00

Agreed. The code now uses:

Line 481: if mode == "w": ...
Line 487: if mode == "a": ... (changed from elif)
Line 494: if mode == "insert": ... (changed from elif)

All elif statements after returns have been corrected to if statements.

Agreed. The code now uses: - Line 481: if mode == "w": ... - Line 487: if mode == "a": ... (changed from elif) - Line 494: if mode == "insert": ... (changed from elif) All elif statements after returns have been corrected to if statements.

src/cleveragents/core/application.py

					
				@ -670,0 +705,4 @@

				        # ✅ NEW: Validate the sanitized result

				        if not is_valid_json(sanitized):

				            logger.warning(

brent.edwards commented

2025-10-10 21:06:59 +00:00

src/cleveragents/core/application.py:708:12: E0602: Undefined variable 'logger' (undefined-variable)

(You need to construct the logger variable around line 46ish.)

src/cleveragents/core/application.py:708:12: E0602: Undefined variable 'logger' (undefined-variable) (You need to construct the `logger` variable around line 46ish.)

brent.edwards marked this conversation as resolved

aditya added 1 commit

2025-10-13 13:06:12 +00:00

fix: fix multi agent paper writer langgraph script, mypy and pylint issues fe7cb7ac47

brent.edwards reviewed

2025-10-15 21:56:44 +00:00

src/cleveragents/core/application.py

					
				@ -20,3 +22,1 @@

				from rx import operators as ops

				from rx.core import Observer

				from rx.scheduler.eventloop import AsyncIOScheduler

				from rx.core import Observer  # type: ignore[attr-defined]

brent.edwards commented

2025-10-15 21:56:44 +00:00

My mypy --strict reports:

src/cleveragents/core/application.py:22: error: Unused "type: ignore" comment  [unused-ignore]

My `mypy --strict` reports: ``` src/cleveragents/core/application.py:22: error: Unused "type: ignore" comment [unused-ignore] ```

aditya commented

2025-10-16 11:50:31 +00:00

Hi Brent,

Thank you for pointing that out. The issue related to the mypy errors has been resolved in another branch named mypy-pylint-fixed.

Hi Brent, Thank you for pointing that out. The issue related to the mypy errors has been resolved in another branch named mypy-pylint-fixed.

brent.edwards marked this conversation as resolved

brent.edwards reviewed

2025-10-15 21:57:14 +00:00

src/cleveragents/core/application.py

					
				@ -21,2 +22,2 @@

				from rx.core import Observer

				from rx.scheduler.eventloop import AsyncIOScheduler

				from rx.core import Observer  # type: ignore[attr-defined]

				from rx.scheduler.eventloop import AsyncIOScheduler  # type: ignore[attr-defined]

brent.edwards commented

2025-10-15 21:57:14 +00:00

My mypy --strict reports:

src/cleveragents/core/application.py:23: error: Unused "type: ignore" comment  [unused-ignore]

My `mypy --strict` reports: ``` src/cleveragents/core/application.py:23: error: Unused "type: ignore" comment [unused-ignore] ```

aditya commented

2025-10-16 11:50:41 +00:00

Hi Brent,

Thank you for pointing that out. The issue related to the mypy errors has been resolved in another branch named mypy-pylint-fixed.

Hi Brent, Thank you for pointing that out. The issue related to the mypy errors has been resolved in another branch named mypy-pylint-fixed.

brent.edwards marked this conversation as resolved

aditya added 1 commit

2025-10-17 20:10:58 +00:00

test; fix the failing behave tests e60de39c6d

aditya added 1 commit

2025-10-17 20:27:32 +00:00

docs: update docs/YAML_SYNTAX.md and README.md 904decff02

aditya added 1 commit

2025-10-17 20:53:26 +00:00

test: fix test scripts and update readme fb92bf6790

brent.edwards reviewed

2025-10-20 19:14:37 +00:00

README.md

					
				@ -0,0 +40,4 @@

				cd cleveragents-core

				pip install -e .

				```

brent.edwards commented

2025-10-20 19:14:37 +00:00

You are now duplicating the installation instructions. These two sets of instructions are identical. Please remove one.

brent.edwards marked this conversation as resolved

aditya added 1 commit

2025-10-23 15:10:00 +00:00

test: fix tool_agent shell command mocking and cleanup and update README.md 3e6e984567

brent.edwards reviewed

2025-10-23 19:49:40 +00:00

README.md

					
				@ -0,0 +191,4 @@

				# Google Gemini

				export GOOGLE_API_KEY="your-google-key"

				```

				```

brent.edwards commented

2025-10-23 19:48:09 +00:00

You still need to get rid of the doubled set of ``` here. I have examples below of the difference when you remove the second set of ```.

You still need to get rid of the doubled set of \`\`\` here. I have examples below of the difference when you remove the second set of ```.

Before-removal.png

523 KiB

After-removal.png

474 KiB

aditya added 1 commit

2025-10-24 17:32:35 +00:00

update README.md to fix linting issues 237ecd1682

aditya referenced this pull request

2025-10-27 10:04:02 +00:00

mypy-pylint-fixed #21

khird reviewed

2025-10-28 14:03:12 +00:00

src/cleveragents/agents/base.py

					
				@ -196,0 +207,4 @@

				                if self._memory_lock_instance is None:

				                    try:

				                        loop = asyncio.get_event_loop()

				                    except RuntimeError:

khird commented

2025-10-27 20:05:58 +00:00

Your README claims compatibility with Python 3.9+, but getting a RuntimeError when no event loop exists is new behaviour in 3.14.

src/cleveragents/agents/tool.py

					
				@ -84,0 +100,4 @@

				        message_stripped = message.strip()

				        # If message looks like JSON (starts with { and ends with }), it must be valid JSON

				        if message_stripped.startswith("{") and message_stripped.endswith("}"):

khird commented

2025-10-27 20:40:37 +00:00

The rules here for what you'll accept are inconsistent.

Suppose you are given {"number": 1} - this goes into the first case and it is valid JSON so you get the answer.
Suppose you are given {"number": 1}, {"letter": "a"} - this goes into the first case but is invalid JSON so you get None.
Suppose you are given {"number": 1}, {"letter": "a"}, "garbage" - this goes into the third case where you get the contents of the first dict (i.e. the same answer as in the first example).

Why not fall through the first case instead of returning None, so that the second example would match the third case and you'd get the same behaviour as in the third example?

The rules here for what you'll accept are inconsistent. - Suppose you are given `{"number": 1}` - this goes into the first case and it is valid JSON so you get the answer. - Suppose you are given `{"number": 1}, {"letter": "a"}` - this goes into the first case but is invalid JSON so you get None. - Suppose you are given `{"number": 1}, {"letter": "a"}, "garbage"` - this goes into the third case where you get the contents of the first dict (i.e. the same answer as in the first example). Why not fall through the first case instead of returning None, so that the second example would match the third case and you'd get the same behaviour as in the third example?

src/cleveragents/agents/tool.py

					
				@ -292,0 +377,4 @@

				        self, filepath: str, unsafe_mode: bool

				    ) -> None:

				        """Validate file path for safety in safe mode."""

				        if not self.safe_mode:

khird commented

2025-10-27 20:55:41 +00:00

The interaction of the parameter unsafe_mode and the member variable self.safe_mode looks suspicious here.

If safe mode is off, then even if the caller tries to turn off unsafe mode, zero validation actually occurs.

If safe mode is on, then parent directories can't be referenced, but absolute filepaths can, unless the caller turns off unsafe mode as well.

Are you sure this is the desired behaviour?

The interaction of the parameter `unsafe_mode` and the member variable `self.safe_mode` looks suspicious here. If safe mode is off, then even if the caller tries to turn off unsafe mode, zero validation actually occurs. If safe mode is on, then parent directories can't be referenced, but absolute filepaths can, unless the caller turns off unsafe mode as well. Are you sure this is the desired behaviour?

src/cleveragents/core/application.py

					
				@ -670,0 +724,4 @@

				            return f'"{content}"'

				        # Pattern to match any content between quotes, including newlines

				        pattern = r'"([^"\\]*(?:\\.[^"\\]*)*|[^"]*(?:\n[^"]*)*)"'

khird commented

2025-10-28 13:23:22 +00:00

Please double-check this regex - you're clearly doing something with the backslashes, but the explanatory comment doesn't describe what. To me, it looks like anything matched by the expression to the left of the pipe should also be matched by the expression to its right, so it'd be redundant. But I'm not very confident I've mentally parsed it correctly.

tests/unit/core/test_tool_command_processing.py

					
				@ -0,0 +154,4 @@

				        print("✅ TEST PASSED: Complex nested JSON from actual usage")

				    def test_old_pattern_would_fail(self):

				        """Verify that the old regex pattern would fail with nested JSON."""

khird commented

2025-10-28 14:01:53 +00:00

What is the point of testing this? Writing tests against our code makes sense; writing tests against code that is no longer in the project? Not so much.

This pull request can be merged automatically.

You are not authorized to merge this pull request.

View command line instructions

Checkout

From your project repository, check out a new branch and test the changes.

git fetch -u origin feat/multi-agent:feat/multi-agent

git switch feat/multi-agent

Merge

Merge the changes and update on Forgejo.

Warning: The "Autodetect manual merge" setting is not enabled for this repository, you will have to mark this pull request as manually merged afterwards.

git switch master

git merge --no-ff feat/multi-agent

git switch feat/multi-agent

git rebase master

git switch master

git merge --ff-only feat/multi-agent

git switch feat/multi-agent

git rebase master

git switch master

git merge --no-ff feat/multi-agent

git switch master

git merge --squash feat/multi-agent