MNIST Volume Control

A deep learning project where users can draw digits (000–100) to set the volume interactively.
It combines Python (PyTorch) for training and benchmarking, C++ (libtorch) for optimized inference, and a JavaScript/ONNX Runtime Web frontend for browser deployment.

Features

Digit Drawing Interface: Users draw digits in three separate boxes to form a number between 000–100.
CNN-based Recognition: Trained on MNIST in PyTorch.
Real-time Volume Control: Predicted number dynamically sets the volume bar.
Cross-Platform Inference:
- Python: Training + CPU/GPU benchmarking.
- C++: Optimized TorchScript inference with libtorch.
- Web: ONNX.js deployment for browser demo.

📂 Project Structure

mnist-volume-controller/
├── assets/
│   ├── screenshots          # Screenshots for README
│
├── cpp/                     # C++ inference (CLion)
│   ├── main.cpp
│   ├── CMakeLists.txt
│   └── mnist_cnn.pt         # TorchScript model for C++
│
├── python/                  # Training & benchmarking (PyCharm)
│   ├── train.ipynb
│   ├── benchmark_cpu.py
│   ├── export_onnx.py
│   ├── mnist_cnn.pth        # PyTorch training checkpoint
│   └── mnist_cnn.pt         # TorchScript export
│
├── web/                     # Browser demo
│   ├── volume_index.html
│   ├── volume_style.css
│   ├── volume_script.js
│   └── mnist_cnn.onnx       # Model for browser inference
│
├── README.md
└── .gitignore

🧑‍💻 Setup & Usage

1. Train the Model (Python)

Open python/train.ipynb in Jupyter or Colab.
Train CNN on MNIST.
Export trained model to TorchScript (mnist_cnn.pt) and ONNX (mnist_cnn.onnx).

2. Benchmark (Python vs C++)

Python CPU: ~0.56 ms
C++ CPU: ~0.47 ms
Benchmarked on the same input (digit.png, 1000 runs, avg per inference).

3. Run C++ Inference (Local)

cd cpp
mkdir build && cd build
cmake ..
make
./mnist_cpp digit.png

4. Run in Browser (Web)

Open web/volume_index.html in a browser.
Draw digits in boxes and click Predict.
Volume bar updates in real time.

📊 Benchmarks

Framework	Device	Avg Latency (ms)
Python	CPU	~0.56
C++	CPU	~0.47

(GPU was slower on MNIST batch=1 due to overhead; not included in demo results.)

📸 Screenshots

(placeholders — replace with actual screenshots)

One digit:
Two digits:
Exactly 100:
Above 100:

License

MIT License

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MNIST Volume Control

Features

📂 Project Structure

🧑‍💻 Setup & Usage

1. Train the Model (Python)

2. Benchmark (Python vs C++)

3. Run C++ Inference (Local)

4. Run in Browser (Web)

📊 Benchmarks

📸 Screenshots

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
assets/screenshots		assets/screenshots
cpp		cpp
python		python
web		web
.gitignore		.gitignore
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

MNIST Volume Control

Features

📂 Project Structure

🧑‍💻 Setup & Usage

1. Train the Model (Python)

2. Benchmark (Python vs C++)

3. Run C++ Inference (Local)

4. Run in Browser (Web)

📊 Benchmarks

📸 Screenshots

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages