evolutionary-mnist

Hyperparmaeter tuning via LLM for a two layer CNN on the MNIST dataset. Also an example of something that can be done, but probably shouldn't.

Uses:

PyTorch for training the model
OpenRouter for LLM API calls

Example run

Validation accuracy:

Reminder: no regression is used here. The improvements learned come strictly from the LLM's reasoning and analysis on the previous runs:

Example LLM reasoning (which leaves a lot of room for improvement):

Setup

uv sync

The MNIST dataset is downloaded from Hugging Face: https://huggingface.co/datasets/ylecun/mnist

uvx --from huggingface_hub hf download ylecun/mnist --repo-type dataset --local-dir data

Use the scripts/prepare_data.py script to split the dataset into train and validation sets.

Running Experiments

This example runs for 5 generations with 4 training runs per generation.

uv run evolutionary-mnist experiments/evo-mini-v3.toml

Future work:

Improve system prompt.
Neural Architecture Search (NAS) for on-the-fly architecture exploration.
Keep training time constant per run within each generation.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
experiments		experiments
scripts		scripts
src/evolutionary_mnist		src/evolutionary_mnist
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock
visualization.py		visualization.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

evolutionary-mnist

Example run

Setup

Running Experiments

Future work:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

evolutionary-mnist

Example run

Setup

Running Experiments

Future work:

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages