quoridor.policy#

Different functions to evaluate the coup to play.

Functions

`play_greedy`(game)	Evaluate all possibilities and choose best one.
`play_random`(game[, rng])	Evaluate all possibilities and choose any of them.
`play_seeing_future_rec`(game[, n_future, n_sim])	Choose coup with best score after playing n coups.
`play_with_proba`(game[, rng])	Choose one of them based on a exp formula.

quoridor.policy.play_greedy(game)#: Evaluate all possibilities and choose best one.

quoridor.policy.play_random(game, rng=None)#: Evaluate all possibilities and choose any of them.

quoridor.policy.play_seeing_future_rec(game, n_future=2, n_sim=4)#: Choose coup with best score after playing n coups.

quoridor.policy.play_with_proba(game, rng=None)#

Choose one of them based on a exp formula.

This formula affect a 0 proba to all play leading to an incorrect situation and e^100 to the best move.