Antichess Reinforcement Learning Agent

A Deep Q-Learning-based AI for Antichess, a chess variant where the objective is to lose all your pieces or get stalemated. This project trains two reinforcement learning agents to compete against each other using delayed double Q-networks strategy.

Introduction

Antichess, also known as Losing Chess, is a chess variant where players aim to lose all their pieces or force a stalemate. This project trains a Deep Q-Network (DQN) agent to play Antichess against another AI or a random strategy.

Key Components of the RL System:

Deep Q-Learning for decision-making.
Experience Replay to stabilize training.
Polyak Averaging to update target networks smoothly.
Self-play for better learning.

Project Features

✅ Fully functional Antichess game logic
✅ Deep Q-Learning with self-play training
✅ Uses experience replay for stable learning
✅ Implements Polyak averaging for smooth target network updates
✅ Customizable opponent strategy (White or Black)

Rules of Antichess

The goal is to lose all your pieces or get stalemated.
Capturing is forced—if a capture is available, the player must take it.
The king has no special status—it can be captured like any other piece.
Pawns promote only to queens upon reaching the last rank.

More details on Antichess rules: Wikipedia

Reinforcement Learning Approach

This project trains two Deep Q-Learning agents using self-play, enabling them to improve their strategy through thousands of games.

Key Techniques

🔹 Bellman Equation:
Used to update Q-values during training.
🔹 Deep Q-Networks (DQN):
Neural network architecture with experience replay to prevent catastrophic forgetting.
🔹 Polyak Averaging:
Gradual target network updates for more stable learning.

Technical References

📖 Bellman Equation
📖 Deep Q-Learning
📖 Polyak Averaging

Installation

Requirements

Python 3.8+
PyTorch
NumPy

Setup

git clone https://github.com/pythagon-code/antichess-rl.git
cd antichess-rl
pip install -r requirements.txt

How to Train the Model

Run the following command to train the AI:

python train.py

This will:

Initialize the game board.
Train two agents using self-play.
Store experiences in experience replay buffers.
Save trained models as white.pth and black.pth.

Testing the Trained Model

To test the trained agent:

python test.py

The trained AI plays against a random strategy.
Set agent_to_play = "white" or "black" to control which side the AI plays.
Results are displayed at the end.

Results

The trained AI achieved 76% accuracy as White against a random opponent.
The model demonstrated strategic play and improved convergence through experience replay.

References

📖 Bellman Equation: Understanding the Bellman Equation in Reinforcement Learning
📖 Deep Q-Learning: Guide to Deep Q-Learning
📖 Polyak Averaging: How Polyak Averaging Improves RL Stability

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
LICENSE		LICENSE
README.md		README.md
antichess_reinforcement_learning.py		antichess_reinforcement_learning.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Antichess Reinforcement Learning Agent

Table of Contents

Introduction

Key Components of the RL System:

Project Features

Rules of Antichess

Reinforcement Learning Approach

Key Techniques

Technical References

Installation

Requirements

Setup

How to Train the Model

Testing the Trained Model

Results

References

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

pythagon-code/antichess-rl

Folders and files

Latest commit

History

Repository files navigation

Antichess Reinforcement Learning Agent

Table of Contents

Introduction

Key Components of the RL System:

Project Features

Rules of Antichess

Reinforcement Learning Approach

Key Techniques

Technical References

Installation

Requirements

Setup

How to Train the Model

Testing the Trained Model

Results

References

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages