Yet another AIβugh. It's really just a game theory solver with a neural network to learn how good a particular position is.
Coup is an imperfect information social deduction game, that pits players against each other. Just like poker, players do not know other players' cards, but unlike poker, players can swap cards with a central pile to gain more information. This adds a layer of complexity in managing "chance nodes" and player actions that are dependent on it. Game Rules: Official Rulebook
π’ Basic game simulation
- π’ Handles full Coup game flow
- βͺ Better game design architecture instead of many branches
π‘ Card counting mechanism for pruning search
- π’ Create brute force tracker for validation
- π‘ Create memoized card counter for quicker querying of impossible states
- βͺ Front-end visualization
βͺ State Probability Tracker
- βͺ Use a GPU Compute Shader to compute probability of a particular card permutation state
βͺ Counterfactual Regret Minimization (CFR)
- βͺ Basic CFR
- βͺ CVFPR
- βͺ CVFPR + RL
Some papers to reference.
πRelevant Papers
π Accelerating Nash Equilibrium Convergence in Monte Carlo Settings Through Counterfactual Value Based Fictitious Play
π Student of Games: A unified learning algorithm for both perfect and imperfect information games
π Combining Deep Reinforcement Learning and Search for Imperfect-Information Games
π Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
- Sample Implementation
π PerfectDou: Dominating DouDizhu with Perfect Information Distillationg