Releases: symflower/eval-dev-quality
Releases · symflower/eval-dev-quality
v1.1.0
Rust Support
v1.0.10
v1.0.9
What's Changed
- Integrate v1.0 results by @zimmski in #431
- fix, Collect assessments if a model responds with an empty message by @ahumenberger in #428
Full Changelog: v1.0.8...v1.0.9
v1.0.8
What's Changed
- Fetch total costs from OpenRouter after query by @ahumenberger in #425
Full Changelog: v1.0.7...v1.0.8
v1.0.7
What's Changed
- Collect usage metrics of each query to be able to calculate costs by @ahumenberger in #424
Full Changelog: v1.0.6...v1.0.7
v1.0.6
What's Changed
- fix, Allow selecting models with attributes for openRouter as well by @ahumenberger in #421
Full Changelog: v1.0.5...v1.0.6
v1.0.5
v1.0.4
What's Changed
- Update Roadmap Template with Deep Dive Template by @bauersimon in #417
- Replace "Inkscape" with "svgexport" by @Munsio in #416
Full Changelog: v1.0.3...v1.0.4
v1.0.3
v1.0.2
What's Changed
- fix, Multiply the "tests-passing" values of transpile tasks because we are transpiling for two languages and therefore we are also running twice the tests by @Munsio in #402
- Allow to set reasoning_effort for models (e.g. OpenAI's o3-mini) by @zimmski in #408
Full Changelog: v1.0.1...v1.0.2