Skip to content
@SPAR-Self-Forecasting

SPAR: Self-Forecasting

Popular repositories Loading

  1. spar-self-prediction spar-self-prediction Public

    Python 1

  2. self-prediction-wrapper self-prediction-wrapper Public

    We have a list of base_prompts, e.g. "What is 2+2?". We have a prefix wrapper: "WRAPPER = 'What would you say in response to this prompt: "{p}"'. We compare the top-1 agreement, JS-divergence betwe…

    Python

  3. LLMSelfForecasting LLMSelfForecasting Public

    Framework for testing LLMs' ability to predict their own behavior in multi-turn and agentic scenarios

    Python

  4. spar-self-prediction-extensions spar-self-prediction-extensions Public

    Python

  5. scheming-self-prediction scheming-self-prediction Public

    Disentangling incapability from scheming in LLMs self-predicting their agentic trajectories.

  6. ai-psychosis-self-prediction ai-psychosis-self-prediction Public

    Self-prediction vs cross-prediction experiment on AI psychosis red-teaming scores

    Python

Repositories

Showing 6 of 6 repositories

Top languages

Loading…

Most used topics

Loading…