Skip to content
@whitecircle

White Circle

Runtime safety and alignment infrastructure for AI in the real world.

Pinned Loading

  1. circle-guard-bench circle-guard-bench Public

    First-of-its-kind AI benchmark for evaluating the protection capabilities of large language model (LLM) guard systems (guardrails and safeguards)

    Python 62 4

  2. killbench killbench Public

    Benchmark showing all major LLMs exhibit measurable decision biases, worsened by structured outputs that reduce safety refusals.

    Python 19 1

Repositories

Showing 2 of 2 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…