Fix warnings and update rand #4

edre · 2021-02-02T18:53:46Z

No description provided.

…player. Benefits: * API simplification: the Strategy, Winner, and Evaluator do not need to be passed the current player. * The caller of choose_move cannot move the same player twice in a row. * The Game implementer does not need to worry that a caller or Strategy will feed it bogus Players. * Simplify Evaluator by automatically handling terminal states. Drawbacks: * Game state must keep track of current player.

Benchmark gets 3% slower.

Since nodes that get cut off can return the best score so far (which they know they can't beat), we permute the initial order instead of choosing randomly from the set of equally-performing nodes. The benchmark gets about 2x faster, but individual runs are more variable, as exploring best to worst will cut off a lot more than exploring worst to best. It will always be strictly faster than the previous version.

This serves two purposes: 1) 64 bits seems unnecessary, and 32 bits without an enum tag allows a transposition table to fit more values. 2) A raw i32 allows parallel Strategies to compare and update values with simple atomic operations.

Ensure they evaluate the same result as an optimization-free negamax.

Includes transposition table and timeout.

…strategies.

Negamax isn't going to grow more options. It will remain the basic Strategy with the least trait requirements of a game.

Take the mutable options out of the initial configuration, and prohibit setting max depth and max time at the same time.

Disabled in benchmark as it doesn't help for connect4.

Completely untested because I'm not sure what noisy moves would be in connect four.

This revealed that the principal variation often runs through cutoff nodes, which is concerning...

New unified Game trait * Unifies functions from Game, Move, Zobrist traits. * Unifies game state semantics for mutate-in-place and copy-on-play games. * No useful types need to implement minimax traits, so can you can easily wrap games from other crates.

* Fix wasm build.

Much easier to implement without undo.

It verifies that unweighted rollouts are terrible.

Apparently this simple idea was introduced in a paper called MCTS-Solver, and I don't get their simulation argument so I didn't implement it their way. My way is stupid simple.

* Various MCTS algorithmic improvements. * Verbose mode for MCTS. * Customizable move randomizer for MCTS.

Leads to confusing crashes in MCTS. The hashes were probably not that useful for humans anyway.

* Don't call zobrist_hash from MCTS's verbose mode.

to make it easier to implement games with arbitrarily complex moves where heap alocation is neccessary and therefore the copy trait cannot be easily implemented (if at all)

Removed Copy trait bound on Game::M

No followup interest to bring this to any of the strategies, and I still think games should be heavily encouraged to find ways to make moves super small and cheap. This backs out commit 70dbd40.

Countermoves et al will all be checked first across all threads.

This prevents me being confused when having two util.rs files opened in my editor.

Otherwise high values can stick around several moves after they were relevant.

* Updates to countermoves reordering. * Upgrade rand dependency.

Alternatives considered: I also looked into a few different tiny crates that just do basic RNG with less overhead and cruft. Unfortunately most of them default to a const seed in wasm32-unknown-unknown, so I'd need to do my own getrandom all over the place. Also whatever crate I use will be in the public API of MCTS rollouts, and fancy MCTS rollouts may actually want the fancy distributions that rand provides. While here I sprinkled Send in some places and removed holding on to any ThreadRngs so that all Strategies can be Send types. Switching the rollouts API to SmallRng is technically an API change but any actual usage (do I have users?) will very likely be textually compatible.

feat(stats): iterative deepening statistics table via parallel reads

edre added 30 commits February 2, 2021 10:47

Update to a more recent version of rand.

870d760

Fix all build warnings from stable rustc.

2e14f54

Increase maximum moves to 200.

5529dd5

Benchmark gets 3% slower.

Add a rustfmt config and apply it.

a0557e9

Migrate benchmark to bencher crate to use stable rust.

ef8060f

Add connect four example game.

cb17957

Tweak winning evaluations to prolong defeat and accelerate victory.

51bd581

Add actual evaluator to connect four.

42769e1

Add testing harness for strategies.

2a46eed

Ensure they evaluate the same result as an optimization-free negamax.

Add an iterative search strategy and Zobrist trait.

167539b

Includes transposition table and timeout.

Add benchmarks based on connect four comparing negamax and iterative …

d258cad

…strategies.

Simplify Negamax config by removing the Options struct.

dc34591

Negamax isn't going to grow more options. It will remain the basic Strategy with the least trait requirements of a game.

Simplify IterativeSearch options.

339bab2

Take the mutable options out of the initial configuration, and prohibit setting max depth and max time at the same time.

Encapsulate replacement strategy in TranspositionTable.

74edfcc

Refactor connect4 to twiddle colors less.

e895e84

Update documentation for 0.1.0 release.

fed0f39

Tidy some doc comments.

3854eda

Hide pub testing functions.

77803cf

Add configuration for transposition table replacement strategy.

d3744a4

Add null window search option.

2e649f3

Add option for incrementing 2 depths at a time.

aee2546

Disabled in benchmark as it doesn't help for connect4.

Implement optional quiescence search at the leaf nodes.

5a1553d

Completely untested because I'm not sure what noisy moves would be in connect four.

Compute the principal variation at each iteration.

c41434c

This revealed that the principal variation often runs through cutoff nodes, which is concerning...

Refactor table bookkeeping out of giant negamax function body.

e4f135d

Port value clamping to IterativeSearch.

88192ef

integration test: ensure strategies actually pick one of the best moves

379b1e2

Narrow window after failed scout probe.

27adf8d

edre and others added 30 commits March 21, 2023 21:04

Release 0.5.0

7510412

New unified Game trait * Unifies functions from Game, Move, Zobrist traits. * Unifies game state semantics for mutate-in-place and copy-on-play games. * No useful types need to implement minimax traits, so can you can easily wrap games from other crates.

Fix wasm build.

a1acf12

Release 0.5.1

28bee1c

* Fix wasm build.

Add mancala example.

7fb389f

Much easier to implement without undo.

Remove Move from docs

fb410c5

Exponential algorithm for interpretting depth in mcts

9faa5cc

Update rustfmt edition to avoid deprecation warning.

45b1d33

mcts: add verbose mode

46a27d1

It verifies that unweighted rollouts are terrible.

mcts: use scoped threads instead of cloning everything

fed4272

mcts: add custom rollout policy

e451ceb

mcts: implement virtual loss

1a88855

mcts: implement endgame terminal-state propagation.

94c5d94

Apparently this simple idea was introduced in a paper called MCTS-Solver, and I don't get their simulation argument so I didn't implement it their way. My way is stupid simple.

mcts: pick random best child more uniformly

5ec9fb2

mcts: add principal variation

e2ffde1

Release 0.5.2

8e01887

* Various MCTS algorithmic improvements. * Verbose mode for MCTS. * Customizable move randomizer for MCTS.

Remove verbose-mode dependency on zobrist_hash.

24ff09b

Leads to confusing crashes in MCTS. The hashes were probably not that useful for humans anyway.

Release 0.5.3

32d49bb

* Don't call zobrist_hash from MCTS's verbose mode.

mcts: Factor out random_best into util library.

87538e1

Removed Copy trait bound on Game::M

70dbd40

to make it easier to implement games with arbitrarily complex moves where heap alocation is neccessary and therefore the copy trait cannot be easily implemented (if at all)

Merge pull request #1 from Lege19/no-trait-bound-on-move

aa290f4

Removed Copy trait bound on Game::M

Consistent uses of semicolons in verbose logs.

9c9964a

Back out "Removed Copy trait bound on Game::M"

3846d5b

No followup interest to bring this to any of the strategies, and I still think games should be heavily encouraged to find ways to make moves super small and cheap. This backs out commit 70dbd40.

Fix new clippy lints.

4b89a9c

Respect the move ordering in parallel search.

92031c2

Countermoves et al will all be checked first across all threads.

Rename util.rs in common.rs in src/strategies/.

ff79d16

This prevents me being confused when having two util.rs files opened in my editor.

Decay old countermoves values faster.

5d030a0

Otherwise high values can stick around several moves after they were relevant.

Release 0.5.4

1c094fd

* Updates to countermoves reordering. * Upgrade rand dependency.

feat(stats): iterative deepening statistics table via parallel reads

0032065

Merge pull request #3 from rsarvar1a/feature-iterative-statistics

dad8275

feat(stats): iterative deepening statistics table via parallel reads

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix warnings and update rand #4

Fix warnings and update rand #4

Uh oh!

edre commented Feb 2, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix warnings and update rand #4

Are you sure you want to change the base?

Fix warnings and update rand #4

Uh oh!

Conversation

edre commented Feb 2, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants