Skip to content

Popular repositories Loading

  1. vader vader Public

    Java 10 3

  2. FinanceQA FinanceQA Public

    FinanceQA: A Benchmark for Evaluating Financial Analysis Capabilities in Large Language Models

    6 1

  3. anvil anvil Public

    Python 6 8

  4. IDE-Bench IDE-Bench Public

    Comprehensive framework for evaluating AI IDE agents on real-world, cross-stack SWE tasks

    Python 4 8

  5. anvil-swift anvil-swift Public

    Python 1

  6. FullStackBoilerplate FullStackBoilerplate Public

    Python 8

Repositories

Showing 9 of 9 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…