This repository contains coursework for the Artificial Intelligence Programming course.
It consists of two main projects:
Implementation of a classic PID controller and a neural-network-based controller using JAX for gradient-based optimization.
An educational reimplementation of MuZero, a state-of-the-art reinforcement learning algorithm by Google Deepmind that learns a model of the environment and uses MCTS for planning.
Commands to be run from the root directory of the repo.
Create a virtual environment and install dependencies:
# Create virtual environment
python3 -m venv .venv
# Activate it (Linux/macOS)
source .venv/bin/activate
# Or activate it (Windows)
.venv\Scripts\activate
# Install dependencies
pip install -r requirements.txtpython3 -m project2.training.rl_managerpython3 -m project2.playground.muzero_play_gameMake sure you have the file path for the trained model in the core/config.py file.
python3 -m project2.training.run_tensorboardpython3 -m project2.playground.playground_mcts_perfect_modelYou can also use "python -m project2..." as an alternative to "python3 -m project2...".
