SPAR: Self-Forecasting

Python 1

We have a list of base_prompts, e.g. "What is 2+2?". We have a prefix wrapper: "WRAPPER = 'What would you say in response to this prompt: "{p}"'. We compare the top-1 agreement, JS-divergence betwe…

Python

LLMSelfForecasting Public

Framework for testing LLMs' ability to predict their own behavior in multi-turn and agentic scenarios

Python

spar-self-prediction-extensions Public

Python

scheming-self-prediction Public

Disentangling incapability from scheming in LLMs self-predicting their agentic trajectories.

ai-psychosis-self-prediction Public

Self-prediction vs cross-prediction experiment on AI psychosis red-teaming scores

Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SPAR: Self-Forecasting

Popular repositories Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

People

Top languages

Uh oh!

Most used topics

Uh oh!