🧠 Tic-Tac-Toe: Minimax vs Reinforcement Learning

This project implements a Tic-Tac-Toe game in Python where a Reinforcement Learning (Q-Learning) agent competes against a Minimax algorithm. It’s based on Sentdex’s tutorial and expanded to allow:

Head-to-head matches between agents
Training and evaluation of the Q-Learning model
Optional human vs AI gameplay

🚀 Features

Q-Learning agent trained from scratch
Minimax opponent with perfect play
Configurable training episodes
Win/draw/loss performance tracking
Play-by-play printouts for auto-play mode

📁 File

tic-tac-toe-minimax-vs-RL.py – main script with everything in one place: training, gameplay, and visualization

🎯 Project Goal

This project is intended as a simple, educational case to explore how reinforcement learning (Q-learning) can be implemented. It is not optimized to create a perfect AI player — the goal is to demonstrate how to build and train a basic RL agent using a simple game like Tic-Tac-Toe.

🧠 How It Works

The Q-Learning agent learns from trial and error, improving its strategy over thousands of games.
The Minimax player evaluates all possible future states to make the optimal move.
You can pit them against each other, play yourself, or watch training performance evolve.

▶️ Usage

Run the script using:

python tic-tac-toe-minimax-vs-RL.py

You'll be prompted to choose:

Which agent to train or test
Number of training games
Whether to play yourself or watch autoplay

📊 Example Output

Training in progress...
Episode 1000 | Win rate: 85% | Draw rate: 10% | Loss rate: 5%

📚 Inspired By

Tutorial series by Sentdex, which provides excellent resources on Python and AI.

📄 License

MIT License — feel free to use, modify, and share!

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
LICENSE		LICENSE
README.md		README.md
tic-tac-toe-minimax-vs-RL.py		tic-tac-toe-minimax-vs-RL.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 Tic-Tac-Toe: Minimax vs Reinforcement Learning

🚀 Features

📁 File

🎯 Project Goal

🧠 How It Works

▶️ Usage

📊 Example Output

📚 Inspired By

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🧠 Tic-Tac-Toe: Minimax vs Reinforcement Learning

🚀 Features

📁 File

🎯 Project Goal

🧠 How It Works

▶️ Usage

📊 Example Output

📚 Inspired By

📄 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages