Skip to content

Pull requests: tinybirdco/llm-benchmark

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add benchmark results for openai/gpt-5.4-mini
#280 opened Mar 18, 2026 by github-actions bot Loading…
Add benchmark results for openai/gpt-5.4-nano
#279 opened Mar 18, 2026 by github-actions bot Loading…
Add benchmark results for z-ai/glm-5-turbo
#277 opened Mar 15, 2026 by github-actions bot Loading…
Add benchmark results for x-ai/grok-4.20-beta
#274 opened Mar 12, 2026 by github-actions bot Loading…
Add benchmark results for openrouter/hunter-alpha
#273 opened Mar 12, 2026 by github-actions bot Loading…
Add benchmark results for openrouter/healer-alpha
#272 opened Mar 12, 2026 by github-actions bot Loading…
Add benchmark results for qwen/qwen3.5-9b
#270 opened Mar 10, 2026 by github-actions bot Loading…
Add benchmark results for openai/gpt-5.4-pro
#268 opened Mar 5, 2026 by github-actions bot Loading…
Add benchmark results for inception/mercury-2
#267 opened Mar 5, 2026 by github-actions bot Loading…
Add benchmark results for openai/gpt-5.4
#266 opened Mar 5, 2026 by github-actions bot Loading…
Add benchmark results for openai/gpt-5.3-chat
#263 opened Mar 4, 2026 by github-actions bot Loading…
Add benchmark results for qwen/qwen3.5-27b
#259 opened Feb 26, 2026 by github-actions bot Loading…
Add benchmark results for qwen/qwen3.5-122b-a10b
#258 opened Feb 26, 2026 by github-actions bot Loading…
Add benchmark results for qwen/qwen3.5-flash-02-23
#256 opened Feb 26, 2026 by github-actions bot Loading…
ProTip! Follow long discussions with comments:>50.