fix(agents): correct prompt tool names and recover from tool errors (#175, #176) by w7-mgfcode · Pull Request #177 · w7-mgfcode/ForecastLabAI

w7-mgfcode · 2026-05-18T13:18:36Z

Summary

Closes #175 and #176. Both were found by a capture_run_messages diagnostic of a failing experiment-agent run, and both touch base.py + experiment.py, so they land together.

#175 — prompt tool names didn't match registered tools

TOOL_USAGE_INSTRUCTIONS and the EXPERIMENT_SYSTEM_PROMPT workflow named tools as run_backtest, list_runs, compare_backtest_results, … but the registered tools are tool_-prefixed. Weaker/local models trusted the prompt text and called unknown tool names (Unknown tool name: 'run_backtest'). The prompts now use the exact registered tool_* names.

#176 — a tool exception crashed the whole run

A tool raising a plain exception aborted the run (observed: ValueError: No data found for store=1, product=101 … when the model picked a store/product with no data). New recoverable() decorator wraps every async DB-touching tool so an expected ValueError becomes a ModelRetry — the model gets the message and can correct its arguments. Other exception types still propagate as genuine bugs.

Changes

agents/base.py — new recoverable() decorator; corrected TOOL_USAGE_INSTRUCTIONS.
agents/experiment.py — corrected workflow tool names; @recoverable on the 6 DB tools.
agents/rag_assistant.py — @recoverable on the 2 DB tools (tool_plain pure tools untouched).
tests/test_base.py — prompt-name regression test; recoverable behaviour tests.

Verification

✅ ruff · ✅ mypy --strict · ✅ pyright (0 errors) · ✅ 113 agent unit tests

Summary by Sourcery

Align agent prompts with registered tool names and make database-driven agent tools resilient to expected data errors so runs can recover instead of crashing.

Bug Fixes:

Update agent prompt instructions and experiment workflow text to reference the correct tool_*-prefixed names used for registered tools.
Convert ValueError from async agent tools into ModelRetry errors so input-driven data issues no longer abort an entire agent run.

Enhancements:

Introduce a generic recoverable decorator and apply it to database-backed tools in experiment and RAG assistant agents to standardize error handling behavior.

Tests:

Add regression tests to ensure prompt tool usage instructions reference the registered tool names and to verify the recoverable decorator’s behavior for success, ValueError, and other exceptions.

…175) Two coupled robustness fixes for the agent layer, both surfaced by a capture_run_messages diagnostic. The changes share base.py and experiment.py, so they land in one commit. #175 — the experiment prompt named tools as run_backtest / list_runs / compare_backtest_results, but the registered tools are tool_-prefixed (tool_run_backtest, ...). Weaker models trusted the prompt and called unknown tool names. TOOL_USAGE_INSTRUCTIONS and the EXPERIMENT_SYSTEM_PROMPT workflow now use the exact registered names. #176 — a tool raising a plain exception aborted the whole run (observed: ValueError "No data found for store=..."). New recoverable() decorator wraps every async DB-touching tool so an expected ValueError becomes a ModelRetry the model can correct from; other exceptions still propagate. - Add recoverable() to agents/base.py; decorate the 6 experiment tools and the 2 rag_assistant tools (tool_plain pure tools left alone). - Tests: prompt names use tool_* ; recoverable converts ValueError to ModelRetry, passes other exceptions through, is transparent on success.

coderabbitai · 2026-05-18T13:18:45Z

Important

Review skipped

Auto reviews are disabled on base/target branches other than the default branch.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: c6eea962-e296-467d-80d7-a7b096522e5b

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch fix/agents-prompt-and-tool-errors

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

sourcery-ai · 2026-05-18T13:18:45Z

Reviewer's Guide

Aligns experiment agent prompt/tool names with the actual registered tool_* functions and introduces a recoverable() decorator so ValueError from async DB-touching tools becomes a ModelRetry instead of crashing the agent run, with tests covering both behaviors.

Sequence diagram for agent tool call with recoverable wrapper

sequenceDiagram
    participant Model
    participant Agent
    participant tool_run_backtest

    Model->>Agent: Call tool_run_backtest
    Agent->>tool_run_backtest: Await wrapped function
    alt [ValueError raised]
        tool_run_backtest-->>Agent: ValueError
        Agent-->>Model: ModelRetry(str(exc))
    else [No error]
        tool_run_backtest-->>Agent: Result
        Agent-->>Model: Result
    end

Flow diagram for recoverable decorator behavior

flowchart TD
    A[Call wrapped async tool] --> B[Execute func]
    B --> C{ValueError raised?}
    C -- No --> D[Return awaited result]
    C -- Yes --> E["Raise ModelRetry with str(exc)"]

File-Level Changes

Change	Details	Files
Introduce recoverable() decorator for async tools so input-driven ValueError becomes ModelRetry while preserving successful and non-ValueError behavior.	Add generic async recoverable() decorator that wraps a tool, catching ValueError and re-raising it as ModelRetry while letting other exceptions propagate. Document intent of recoverable() in docstring as handling expected, input-driven failures without aborting the agent run. Add unit tests verifying ValueError is converted to ModelRetry, non-ValueError exceptions pass through, and successful execution is unchanged.	`app/features/agents/agents/base.py` `app/features/agents/tests/test_base.py`
Align agent prompt instructions and workflow text with the actual registered tool_* names to prevent the model from calling unknown tools.	Update TOOL_USAGE_INSTRUCTIONS to reference the concrete registered tool_* names and refine the descriptive text. Update experiment agent WORKFLOW prompt steps to mention tool_list_runs, tool_run_backtest, and tool_compare_backtest_results explicitly. Add a regression test ensuring TOOL_USAGE_INSTRUCTIONS contains all expected tool_* names.	`app/features/agents/agents/base.py` `app/features/agents/agents/experiment.py` `app/features/agents/tests/test_base.py`
Apply recoverable() to all async DB-touching tools in experiment and RAG assistant agents so recoverable tool failures no longer crash the run.	Decorate experiment agent tools that hit the DB (tool_list_runs, tool_get_run, tool_run_backtest, tool_compare_runs, tool_create_alias, tool_archive_run) with @recoverable in addition to @agent.tool. Decorate RAG assistant agent DB-backed tools (tool_retrieve_context, tool_list_sources) with @recoverable while leaving pure tool_plain utilities unchanged.	`app/features/agents/agents/experiment.py` `app/features/agents/agents/rag_assistant.py`

Possibly linked issues

Agent system prompts reference bare tool names that do not match the registered tools #175: PR updates TOOL_USAGE_INSTRUCTIONS and EXPERIMENT_SYSTEM_PROMPT to use the correct tool_* names, fixing the issue.
A tool raising a plain exception crashes the whole agent run #176: Yes. PR’s recoverable decorator converts tool ValueErrors to ModelRetry, exactly fixing the crash described in the issue.

Tips and commands

Interacting with Sourcery

Trigger a new review: Comment @sourcery-ai review on the pull request.
Continue discussions: Reply directly to Sourcery's review comments.
Generate a GitHub issue from a review comment: Ask Sourcery to create an
issue from a review comment by replying to it. You can also reply to a
review comment with @sourcery-ai issue to create an issue from it.
Generate a pull request title: Write @sourcery-ai anywhere in the pull
request title to generate a title at any time. You can also comment
@sourcery-ai title on the pull request to (re-)generate the title at any time.
Generate a pull request summary: Write @sourcery-ai summary anywhere in
the pull request body to generate a PR summary at any time exactly where you
want it. You can also comment @sourcery-ai summary on the pull request to
(re-)generate the summary at any time.
Generate reviewer's guide: Comment @sourcery-ai guide on the pull
request to (re-)generate the reviewer's guide at any time.
Resolve all Sourcery comments: Comment @sourcery-ai resolve on the
pull request to resolve all Sourcery comments. Useful if you've already
addressed all the comments and don't want to see them anymore.
Dismiss all Sourcery reviews: Comment @sourcery-ai dismiss on the pull
request to dismiss all existing Sourcery reviews. Especially useful if you
want to start fresh with a new review - don't forget to comment
@sourcery-ai review to trigger a new review!

Customizing Your Experience

Access your dashboard to:

Enable or disable review features such as the Sourcery-generated pull request
summary, the reviewer's guide, and others.
Change the review language.
Add, remove or edit custom review instructions.
Adjust other review settings.

Getting Help

Contact our support team for questions or feedback.
Visit our documentation for detailed guides and information.
Keep in touch with the Sourcery team by following us on X/Twitter, LinkedIn or GitHub.

sourcery-ai

Hey - I've found 1 issue, and left some high level feedback:

The recoverable decorator assumes the wrapped callable is always async and await-able; consider either enforcing this via a runtime assertion/check or making the decorator robust to accidental use on sync functions to avoid confusing TypeError: object is not awaitable failures.
Tool names are now duplicated across TOOL_USAGE_INSTRUCTIONS, the experiment workflow prompt, and the actual @agent.tool functions; consider centralizing these names (e.g., as constants or deriving the prompt snippets from the registered tools) to reduce the risk of future drift like in #175.

Prompt for AI Agents

Please address the comments from this code review:

## Overall Comments
- The `recoverable` decorator assumes the wrapped callable is always async and `await`-able; consider either enforcing this via a runtime assertion/check or making the decorator robust to accidental use on sync functions to avoid confusing `TypeError: object is not awaitable` failures.
- Tool names are now duplicated across `TOOL_USAGE_INSTRUCTIONS`, the experiment workflow prompt, and the actual `@agent.tool` functions; consider centralizing these names (e.g., as constants or deriving the prompt snippets from the registered tools) to reduce the risk of future drift like in #175.

## Individual Comments

### Comment 1
<location path="app/features/agents/agents/base.py" line_range="24-26" />
<code_context>
 logger = structlog.get_logger()


+def recoverable[**P, ToolReturnT](
+    func: Callable[P, Awaitable[ToolReturnT]],
+) -> Callable[P, Awaitable[ToolReturnT]]:
+    """Wrap an async agent tool so an expected ``ValueError`` becomes a ``ModelRetry``.
+
</code_context>
<issue_to_address>
**issue (bug_risk):** The generic decorator signature uses invalid syntax and will not parse.

`def recoverable[**P, ToolReturnT](` is not valid Python syntax, even with PEP 695. To keep this decorator generic and compatible with current Python and type checkers, use the ParamSpec/TypeVar pattern instead:

```python
P = ParamSpec("P")
ToolReturnT = TypeVar("ToolReturnT")

def recoverable(
    func: Callable[P, Awaitable[ToolReturnT]],
) -> Callable[P, Awaitable[ToolReturnT]]:
    ...
```
</issue_to_address>

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

sourcery-ai · 2026-05-18T13:20:15Z

+def recoverable[**P, ToolReturnT](
+    func: Callable[P, Awaitable[ToolReturnT]],
+) -> Callable[P, Awaitable[ToolReturnT]]:


issue (bug_risk): The generic decorator signature uses invalid syntax and will not parse.

def recoverable[**P, ToolReturnT]( is not valid Python syntax, even with PEP 695. To keep this decorator generic and compatible with current Python and type checkers, use the ParamSpec/TypeVar pattern instead:

P = ParamSpec("P") ToolReturnT = TypeVar("ToolReturnT") def recoverable( func: Callable[P, Awaitable[ToolReturnT]], ) -> Callable[P, Awaitable[ToolReturnT]]: ...

Resolves the test_base.py conflict against dev (which now carries the #170/#172/#173 agent work) and folds in PR #177 code-review fixes: - base.py: recoverable() now rejects non-coroutine functions at decoration time with a clear TypeError, instead of failing opaquely with 'not awaitable' on first tool call (review: overall comment 1). - test_base.py: replace the hardcoded prompt-tool-name list with test_prompts_only_reference_registered_tool_names, which reads the registered tool set off the built agent and asserts every tool_* name in the prompts is real — drift in either direction now fails CI, the actual guard #175 needed (review: overall comment 2). - Kept both sides of the merge: the recoverable + prompt-name tests from this branch and the retry-budget + PromptedOutput tests from dev.

sourcery-ai Bot reviewed May 18, 2026

View reviewed changes

w7-mgfcode merged commit 5db34e9 into dev May 18, 2026
4 checks passed

w7-mgfcode mentioned this pull request May 18, 2026

feat: cut v0.2.12 — agent hardening, AI model console, demo showcase #178

Merged

w7-mgfcode deleted the fix/agents-prompt-and-tool-errors branch May 18, 2026 14:20

This was referenced May 18, 2026

A tool raising a plain exception crashes the whole agent run #176

Closed

Agent system prompts reference bare tool names that do not match the registered tools #175

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(agents): correct prompt tool names and recover from tool errors (#175, #176)#177

fix(agents): correct prompt tool names and recover from tool errors (#175, #176)#177
w7-mgfcode merged 2 commits into
devfrom
fix/agents-prompt-and-tool-errors

w7-mgfcode commented May 18, 2026 •

edited by sourcery-ai Bot

Loading

Uh oh!

coderabbitai Bot commented May 18, 2026 •

edited

Loading

Review skipped

Uh oh!

sourcery-ai Bot commented May 18, 2026 •

edited

Loading

Interacting with Sourcery

Customizing Your Experience

Getting Help

Uh oh!

sourcery-ai Bot left a comment

Uh oh!

sourcery-ai Bot May 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

w7-mgfcode commented May 18, 2026 • edited by sourcery-ai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

#175 — prompt tool names didn't match registered tools

#176 — a tool exception crashed the whole run

Changes

Verification

Summary by Sourcery

Uh oh!

coderabbitai Bot commented May 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Uh oh!

sourcery-ai Bot commented May 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviewer's Guide

Sequence diagram for agent tool call with recoverable wrapper

Flow diagram for recoverable decorator behavior

File-Level Changes

Possibly linked issues

Interacting with Sourcery

Customizing Your Experience

Getting Help

Uh oh!

sourcery-ai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

sourcery-ai Bot May 18, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

w7-mgfcode commented May 18, 2026 •

edited by sourcery-ai Bot

Loading

coderabbitai Bot commented May 18, 2026 •

edited

Loading

sourcery-ai Bot commented May 18, 2026 •

edited

Loading