LLM Context Window Benchmark

A visual benchmark comparing how different Large Language Models (LLMs) handle complex coding prompts, particularly for games and interactive UI. This project serves as a "context-window-in-action" gallery.

🚀 How it Works

The project is a static site that aggregates benchmark results from various models (Gemini, Claude, GPT, Grok). Each benchmark is a directory containing:

prompt.txt: The exact prompt given to the models.
Sub-directories for each model (e.g., gemini, claude) containing the generated index.html.
modelnames.json: Mapping internal IDs to human-readable names.

🏗 Directory Structure

.
├── create_config.sh    # Script to regenerate the gallery index
├── index.html          # Main gallery UI
├── flappy/             # Benchmark: Flappy Bird clone
│   ├── prompt.txt      # The prompt used
│   ├── gemini/         # Result from Gemini
│   │   └── index.html
│   └── claude/         # Result from Claude
│       └── index.html
└── platformer/         # Benchmark: Platformer game
    └── ...

🛠 Features

Side-by-Side Comparison: view model outputs for the same prompt in one interface.
Dynamic Config Generation: Just drop a new result folder and run create_config.sh.
Vanilla Implementation: No heavy frameworks, just fast, static HTML/JS.

🤝 How to Contribute

We welcome contributions of new benchmarks or new model results for existing benchmarks!

1. Adding a New Model Result

If you want to add a result for a model (e.g., "DeepSeek") to an existing benchmark (e.g., flappy):

Create a folder named deepseek inside flappy/.
Add the generated index.html file into flappy/deepseek/.
(Optional) Add the model name to flappy/modelnames.json.
Run ./create_config.sh to update the site.

2. Adding a New Benchmark

Create a new root folder (e.g., tetris/).
Add a prompt.txt with the prompt you used.
Add folders for each model you tested.
Run ./create_config.sh.

💻 Local Development

Clone the repository.
To view the site, you can use any static server, like npx phost or python -m http.server.
After adding new folders or files, run:
```
bash create_config.sh
```

📜 License

See LICENSE.md for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LLM Context Window Benchmark

🚀 How it Works

🏗 Directory Structure

🛠 Features

🤝 How to Contribute

1. Adding a New Model Result

2. Adding a New Benchmark

💻 Local Development

📜 License

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

LLM Context Window Benchmark

🚀 How it Works

🏗 Directory Structure

🛠 Features

🤝 How to Contribute

1. Adding a New Model Result

2. Adding a New Benchmark

💻 Local Development

📜 License