feat(autoresearch): scorer interface + 3 implementations (#3198) by mrveiss · Pull Request #3202 · mrveiss/AutoBot-AI

mrveiss · 2026-04-01T19:13:58Z

Summary

Pluggable PromptScorer ABC with ScorerResult dataclass (0.0-1.0 clamped)
ValBpbScorer: experiment-based scoring via ExperimentRunner
LLMJudgeScorer: automated 0-10 rating via LLMService with JSON/regex fallback
HumanReviewScorer: Redis-backed human review queue with polling + timeout

Closes #3198
Part of #2600 (AutoResearch M3)

Test plan

All 11 scorer unit tests pass
Existing autoresearch tests unaffected

🤖 Generated with Claude Code

…umanReview scorers (#3198) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…ard, data leakage (#3198) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

feat(autoresearch): add scorer interface with ValBpb, LLMJudge, and H…

f2b58a1

…umanReview scorers (#3198) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

This was referenced Apr 1, 2026

feat(autoresearch): prompt optimizer + API endpoints + M3 integration (#3200) #3206

Merged

feat(frontend): AutoResearch experiment dashboard (#3201) #3207

Merged

fix(autoresearch): address review — Redis key validation, baseline gu…

b65ccde

…ard, data leakage (#3198) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

mrveiss merged commit 6240155 into Dev_new_gui Apr 1, 2026
1 of 3 checks passed

mrveiss deleted the issue-3198-scorers branch April 1, 2026 19:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(autoresearch): scorer interface + 3 implementations (#3198)#3202

feat(autoresearch): scorer interface + 3 implementations (#3198)#3202
mrveiss merged 2 commits intoDev_new_guifrom
issue-3198-scorers

mrveiss commented Apr 1, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

mrveiss commented Apr 1, 2026

Summary

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant