test: add TDD bug-capture test for #989 — JSON decode crash in persistence #1166
Labels
No labels
auto/needs-reevaluation
controller-managed
auto/blocked-by-deps
auto/ci-timeout
auto/claimed-implementer
auto/claimed-merge
auto/claimed-reviewer
auto/driver-down
auto/invariant-violation
auto/last-attempt-tier-0
auto/last-attempt-tier-1
auto/last-attempt-tier-2
auto/last-attempt-tier-min
Automation Tracking
auto/needs-conflict-resolution
auto/needs-implementer
auto/postmortem
auto/ready-to-merge
auto/restart-throttled
auto/revert
auto/sentinel
auto/stale-inactivity
auto/unstable
Blocked
Bounty
$100
Bounty
$1000
Bounty
$10000
Bounty
$20
Bounty
$2000
Bounty
$250
Bounty
$50
Bounty
$500
Bounty
$5000
Bounty
$750
MoSCoW
Could have
MoSCoW
Must have
MoSCoW
Should have
Needs Feedback
Points
1
Points
13
Points
2
Points
21
Points
3
Points
34
Points
5
Points
55
Points
8
Points
88
Priority
Backlog
Priority
CI Blocker
Priority
Critical
Priority
High
Priority
Low
Priority
Medium
Signed-off: Owner
Signed-off: Scrum Master
Signed-off: Tech Lead
Spike
State
Completed
State
Duplicate
State
In Progress
State
In Review
State
Paused
State
Unverified
State
Verified
State
Wont Do
Type
Automation
Type
Bug
Type
Discussion
Type
Documentation
Type
Epic
Type
Feature
Type
Legendary
Type
Refactor
Type
Support
Type
Task
Type
Testing
No project
No assignees
2 participants
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
cleveragents/cleveragents-core!1166
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "tdd/m5-json-decode-crash"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Summary
features/tdd_json_decode_crash_persistence.featurewith required tags:@tdd_bug @tdd_bug_989 @tdd_expected_fail.features/steps/tdd_json_decode_crash_persistence_steps.pythat persist malformedsafety_jsonand executeAutomationProfileRepository.get_by_name(...)to reproduce bug #989.JSONDecodeError), which currently fails for the correct reason and is inverted by@tdd_expected_fail.Why
Bug #989 reports that persistence-domain conversion leaks raw
JSONDecodeErrorwhen corrupted JSON is stored in automation profile rows. This PR adds the required pre-fix regression capture so the subsequent bugfix can remove@tdd_expected_failand prove the defect is fixed.Notes
robot/resource_dag.robotwas stabilized by sharing a single SQLAlchemy session inside inline scripts. This was required to keep mandatory nox integration quality gates passing for this branch.Quality Gates
nox -s lint✅nox -s typecheck✅nox -s unit_tests✅nox -s integration_tests✅nox -s e2e_tests✅nox -s coverage_report✅ (97.67%)nox(full default suite) ✅Closes #1094
Code Review Note
Unable to review — the branch
tdd/m5-json-decode-crashwas not found on the remote. Please verify the branch exists and has been pushed.Day 48 Planning Review — TDD PR for Bug #989
All 8 TDD review criteria pass:
tdd/m5-json-decode-crash✓@tdd_bug @tdd_bug_989 @tdd_expected_fail— all three present and correctly named ✓test:prefix ✓_to_domaindecode, asserts corruption-specific error (not rawJSONDecodeError) ✓Test quality is excellent — the three-stage assertion structure (error raised, not raw JSONDecodeError, contains "corrupt") is well-designed.
Minor note: The diff includes changes to
robot/resource_dag.robot(SQLAlchemy session sharing for 3 Robot test cases). The PR description documents this as required for CI stability. While ideally TDD PRs contain only the test, this is documented and defensible. Not blocking.APPROVED. @brent.edwards — please reply to @freemo's comment about branch availability.
Review: APPROVED with Comments
Well-written TDD test. The Robot
resource_dag.robotfix (repositories accepting session callables) is tangential to the TDD test — per CONTRIBUTING.md §Atomic Commits, it should be a separate commit. The TDD test itself is clean with proper type annotations and docstrings.Review claimed by reviewer pool instance reviewer-pool-2. Dispatching independent code review.
Review claimed by reviewer pool instance reviewer-pool-1. Dispatching independent code review.
Independent Code Review — APPROVED (with conflict note)
TDD Test Quality: ✅ Excellent
The Behave TDD bug-capture test for #989 is well-designed:
Feature file (
features/tdd_json_decode_crash_persistence.feature):@tdd_bug @tdd_bug_989 @tdd_expected_fail✅Step definitions (
features/steps/tdd_json_decode_crash_persistence_steps.py):from __future__ import annotations✅# type: ignoresuppressions ✅Test logic correctly captures bug #989:
'{"require_sandbox": true'— missing closing brace)get_by_name()which triggers_to_domain()→json.loads()on corrupt dataBug capture verification: Confirmed that
_to_domain()at line 4512 ofrepositories.pydoes_json.loads(safety_json_str)without try/except, so the rawJSONDecodeErrorWILL leak through, confirming the bug exists and the test captures it correctly.Commit Quality: ✅
test: add TDD bug-capture test for #989 — JSON decode crash in persistenceISSUES CLOSED: #1094✅PR Metadata: ✅
Type/Testinglabel ✅Closes #1094in body ✅Minor Observation (non-blocking)
The test step uses column names (
auto_strategize,auto_execute,auto_apply, etc.) that don't exist onAutomationProfileModel. The actual columns aredecompose_task,create_tool,select_tool, etc. This works silently due to__allow_unmapped__ = Trueon the model (the unmapped kwargs become Python attributes, while the actual DB columns get their defaults). The test still correctly captures the bug since the focus is onsafety_jsoncorruption. When the bugfix PR removes@tdd_expected_fail, consider updating these to match the actual model columns for clarity.⚠️ Merge Conflict
The PR is marked
mergeable: false. Therobot/resource_dag.robotchanges (session sharing) were already merged to master via commitb6c31696(PR for #762) with improvements (variable renamed toshared_session, explicitclose()calls,timeout=60s). The branch needs a rebase onto master, dropping the Robot file changes since they're already superseded.Code quality: APPROVED. Merge blocked only by conflict — needs rebase.
@ -0,0 +39,4 @@session.add(AutomationProfileModel(name="bug-989-corrupt-json",description="Corrupt JSON regression fixture",Minor: These column names (
auto_strategize,auto_execute,auto_apply, etc.) don't exist onAutomationProfileModel. The actual float threshold columns aredecompose_task,create_tool,select_tool,edit_code,execute_command,create_file,delete_content,access_network,install_dependency,modify_config,approve_plan.This works silently because the model has
__allow_unmapped__ = True, so SQLAlchemy accepts arbitrary kwargs as Python attributes while the actual DB columns get theirdefault=0.0values. The test still correctly captures bug #989 since the focus is onsafety_jsoncorruption, not these threshold values.Non-blocking, but consider updating when the bugfix PR removes
@tdd_expected_fail.Review claimed by reviewer pool instance pr-reviewer-pool-2813550-1775153400. Dispatching independent code review.
Automated by CleverAgents Bot
Supervisor: PR Review | Agent: ca-continuous-pr-reviewer
Independent Code Review — APPROVED (merge blocked by conflict)
TDD Test Quality: ✅ Excellent
The Behave TDD bug-capture test for #989 is well-structured and correctly captures the defect:
Feature file (
features/tdd_json_decode_crash_persistence.feature):@tdd_bug @tdd_bug_989 @tdd_expected_fail✅Step definitions (
features/steps/tdd_json_decode_crash_persistence_steps.py):from __future__ import annotations✅# type: ignoresuppressions ✅Test logic correctly captures bug #989:
'{"require_sandbox": true'— missing closing brace)get_by_name()which triggers_to_domain()→json.loads()on corrupt dataCommit Quality: ✅
ISSUES CLOSED: #1094✅PR Metadata: ✅
Type/Testinglabel ✅Closes #1094in body ✅⚠️ Merge Conflict — Rebase Required
The PR is marked
mergeable: false. Therobot/resource_dag.robotsession-sharing changes in this branch conflict with master because equivalent (and improved) changes were already merged. The branch needs a rebase onto master, dropping the Robot file changes since they are already superseded on master.Code quality: APPROVED. Merge is blocked only by the conflict — a rebase is needed.
Automated by CleverAgents Bot
Supervisor: PR Review | Agent: ca-pr-self-reviewer
⚠️ Merge Conflict — Rebase Required
@brent.edwards — This PR has been approved (code quality is excellent), but it cannot be merged due to a conflict in
robot/resource_dag.robot.The session-sharing changes in that file have already been merged to master via a separate PR. To resolve this:
master:git rebase masterrobot/resource_dag.robotchanges (accept master's version) since they are already supersededOnce the conflict is resolved and CI passes, this PR can be merged immediately.
Automated by CleverAgents Bot
Supervisor: PR Review | Agent: ca-pr-self-reviewer
Review claimed by reviewer pool instance pr-reviewer-pool-2988182-1775156309. Dispatching independent code review.
Automated by CleverAgents Bot
Supervisor: PR Review | Agent: ca-continuous-pr-reviewer
Independent Code Review — APPROVED (merge blocked by conflict)
TDD Test Quality: ✅ Excellent
Feature file (
features/tdd_json_decode_crash_persistence.feature):@tdd_bug @tdd_bug_989 @tdd_expected_fail✅Step definitions (
features/steps/tdd_json_decode_crash_persistence_steps.py):from __future__ import annotations✅# type: ignoresuppressions ✅Bug Capture Verification: ✅ Correct
I independently verified the bug path in
repositories.pyline 4512:This line has no try/except, so a malformed
safety_jsonstring will leak a rawJSONDecodeErrorto callers. The test correctly:'{"require_sandbox": true'— missing closing brace) in thesafety_jsoncolumnget_by_name()which triggers_to_domain()→json.loads()on the corrupt datajson.JSONDecodeError(domain should wrap it)This correctly fails today (proving the bug exists) and will pass only when a proper domain-specific error wrapper is added.
Cosmetic Observation (non-blocking)
The test step uses column names (
auto_strategize,auto_execute,auto_apply, etc.) that don't exist as mapped columns onAutomationProfileModel. The actual columns aredecompose_task,create_tool,select_tool, etc. This works silently because__allow_unmapped__ = Trueon the model — the unmapped kwargs become Python attributes while actual DB columns get their defaults. The test still correctly captures the bug since the focus is onsafety_jsoncorruption. Consider updating these to match actual model columns when the bugfix PR removes@tdd_expected_fail.Commit Quality: ✅
test: add TDD bug-capture test for #989 — JSON decode crash in persistence✅ISSUES CLOSED: #1094✅PR Metadata: ✅
Type/Testinglabel ✅Closes #1094in body ✅tdd/m5-json-decode-crashmatches issue metadata ✅Robot File Changes (tangential)
The
robot/resource_dag.robotchanges (session sharing vialambda: session) are documented as required for CI stability. However, master already has improved versions of these changes (usingshared_sessionvariable name and explicitclose()calls). This is the source of the merge conflict.⚠️ Merge Conflict — Rebase Required
The PR is marked
mergeable: false. Therobot/resource_dag.robotsession-sharing changes in this branch conflict with master because equivalent (and improved) changes were already merged. The branch needs a rebase onto master, dropping the Robot file changes since they are already superseded.Code quality: APPROVED. Merge is blocked only by the conflict — a rebase is needed.
Automated by CleverAgents Bot
Supervisor: PR Review | Agent: ca-pr-self-reviewer
Review claimed by reviewer pool instance pr-reviewer-pool-3151342-1775157992. Dispatching independent code review.
Automated by CleverAgents Bot
Supervisor: PR Review | Agent: ca-continuous-pr-reviewer
Review claimed by reviewer pool instance pr-reviewer-pool-3151342-1775157992. Dispatching independent code review.
Automated by CleverAgents Bot
Supervisor: PR Review | Agent: ca-continuous-pr-reviewer
Review claimed by reviewer pool instance pr-reviewer-pool-3983434-1775170710. Dispatching independent code review.
Automated by CleverAgents Bot
Supervisor: PR Review | Agent: ca-continuous-pr-reviewer
045bfd03f324dd2b3a2aReview claimed by reviewer pool instance pr-reviewer-pool-2377036-1775183920. Dispatching independent code review (stale claim from previous instance).
Automated by CleverAgents Bot
Supervisor: PR Review | Agent: ca-continuous-pr-reviewer
Independent Code Review — APPROVED ✅
Review Criteria Assessment
1. Does the code implement what the PR claims? ✅ YES
The PR adds a TDD bug-capture test for bug #989 (JSON decode crash in persistence). The implementation correctly:
@tdd_bug @tdd_bug_989 @tdd_expected_failtagssafety_json→_to_domain()→json.loads()crashJSONDecodeError, (3) contains "corrupt" semantics2. Are there Behave unit tests? ✅ YES
features/tdd_json_decode_crash_persistence.feature(14 lines, clean BDD structure)features/steps/tdd_json_decode_crash_persistence_steps.py(98 lines)3. Is there static typing? ✅ YES
from __future__ import annotationspresent-> str,-> None)context: Any)# type: ignoresuppressions in the new code4. Does the commit message follow conventional commits? ✅ YES
test: add TDD bug-capture test for #989 — JSON decode crash in persistenceISSUES CLOSED: #10945. Is the PR linked to an issue? ✅ YES
Closes #1094in PR bodyType/TestinglabelCode Quality Details
from __future__ import annotations# type: ignoresuppressionsBug Capture Verification
I independently verified the bug path in the codebase. The
AutomationProfileModelat line 2232 ofmodels.pyhassafety_json = Column(Text, nullable=True)and__allow_unmapped__ = True. The_to_domain()method performsjson.loads(safety_json_str)without try/except, confirming that a malformed JSON string will leak a rawJSONDecodeError. The test correctly captures this behavior.Non-blocking Observation
The test step uses column names (
auto_strategize,auto_execute,auto_apply, etc.) that are not actual mapped columns onAutomationProfileModel(the real columns aredecompose_task,create_tool,select_tool, etc.). This works silently due to__allow_unmapped__ = True— the unmapped kwargs become Python attributes while actual DB columns get their defaults. The test still correctly captures the bug since the focus is onsafety_jsoncorruption. Consider updating these to match actual model columns when the bugfix PR removes@tdd_expected_fail.CI Status Note
CI is currently failing on this commit, but master (
81319b57) is also in a failure state. This PR only adds 2 new test files with no production code changes, so the CI failures are pre-existing infrastructure issues, not caused by this PR.APPROVED — Code quality is excellent, all review criteria pass.
Automated by CleverAgents Bot
Supervisor: PR Review | Agent: ca-pr-self-reviewer