tinybirdco / llm-benchmark Public

Notifications You must be signed in to change notification settings
Fork 8
Star 73

Code
Issues 1
Pull requests 157
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Pull requests: tinybirdco/llm-benchmark

Labels 13 Milestones 0

New pull request New

157 Open 111 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Add benchmark results for openai/gpt-5.4-mini

#280 opened Mar 18, 2026 by github-actions bot

Loading…

Add benchmark results for openai/gpt-5.4-nano

#279 opened Mar 18, 2026 by github-actions bot

Loading…

Add benchmark results for mistralai/mistral-small-2603

#278 opened Mar 17, 2026 by github-actions bot

Loading…

Add benchmark results for z-ai/glm-5-turbo

#277 opened Mar 15, 2026 by github-actions bot

Loading…

Add benchmark results for minimax/minimax-m2.5:free

#276 opened Mar 15, 2026 by github-actions bot

Loading…

Add benchmark results for x-ai/grok-4.20-multi-agent-beta

#275 opened Mar 12, 2026 by github-actions bot

Loading…

Add benchmark results for x-ai/grok-4.20-beta

#274 opened Mar 12, 2026 by github-actions bot

Loading…

Add benchmark results for openrouter/hunter-alpha

#273 opened Mar 12, 2026 by github-actions bot

Loading…

Add benchmark results for openrouter/healer-alpha

#272 opened Mar 12, 2026 by github-actions bot

Loading…

Add benchmark results for nvidia/nemotron-3-super-120b-a12b:free

#271 opened Mar 11, 2026 by github-actions bot

Loading…

Add benchmark results for qwen/qwen3.5-9b

#270 opened Mar 10, 2026 by github-actions bot

Loading…

Add benchmark results for bytedance-seed/seed-2.0-lite

#269 opened Mar 10, 2026 by github-actions bot

Loading…

Add benchmark results for openai/gpt-5.4-pro

#268 opened Mar 5, 2026 by github-actions bot

Loading…

Add benchmark results for inception/mercury-2

#267 opened Mar 5, 2026 by github-actions bot

Loading…

Add benchmark results for openai/gpt-5.4

#266 opened Mar 5, 2026 by github-actions bot

Loading…

Add benchmark results for bytedance-seed/seed-1.6-flash

#265 opened Mar 4, 2026 by github-actions bot

Loading…

Add benchmark results for liquid/lfm-2.5-1.2b-thinking:free

#264 opened Mar 4, 2026 by github-actions bot

Loading…

Add benchmark results for openai/gpt-5.3-chat

#263 opened Mar 4, 2026 by github-actions bot

Loading…

Add benchmark results for google/gemini-3.1-flash-lite-preview

#262 opened Mar 3, 2026 by github-actions bot

Loading…

Add benchmark results for bytedance-seed/seed-2.0-mini

#261 opened Feb 27, 2026 by github-actions bot

Loading…

Add benchmark results for google/gemini-3.1-flash-image-preview

#260 opened Feb 26, 2026 by github-actions bot

Loading…

Add benchmark results for qwen/qwen3.5-27b

#259 opened Feb 26, 2026 by github-actions bot

Loading…

Add benchmark results for qwen/qwen3.5-122b-a10b

#258 opened Feb 26, 2026 by github-actions bot

Loading…

Add benchmark results for google/gemini-3.1-pro-preview-customtools

#257 opened Feb 26, 2026 by github-actions bot

Loading…

Add benchmark results for qwen/qwen3.5-flash-02-23

#256 opened Feb 26, 2026 by github-actions bot

Loading…

Previous 1 2 3 4 5 6 7 Next

Previous Next

ProTip! Follow long discussions with comments:>50.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!