Skip to content

Latest commit

Β 

History

History
23 lines (15 loc) Β· 1.11 KB

File metadata and controls

23 lines (15 loc) Β· 1.11 KB

🀝 Contributing

For the code to be accepted it must be measured and break the record. πŸš€

Previously we were adding more features / code / architectures but records weren't getting broken and there was no progress. πŸ“‰

Important

Consider more code = bad (complexity, bloat, maintenance, bugs when upgrading), unless there is a new record in the training speed / loss, which justifies adding code. ✨


πŸ›  How to contribute:

  1. πŸ” Pick a topic / task from issues (issues are general name for tasks), carefully read it and understand it
  2. 🍴 Fork the repo
  3. πŸ’» Clone it and implement the experiment, follow README
  4. πŸ“Š Benchmark your changes against a the baseline that you also measured beforehand. If hardware is limited, use free GPUs (Lightning AI, Colab) and reduce model size (n_layer, n_embd) for testing.
  5. πŸ“₯ Submit a PR with your findings and comparison data.

πŸ† Leaderboard

Please check LEADERBOARD for architecture, records and contribution history.