I remember seeing a version of the paper earlier in the year (it talked a lot about getting the bot to be aggressive to avoid stalemates).
Feels like the secret sauce has to be probability distributions guessing what all the pieces are.
Bluffing in stratego seems like it requires long-term planning (if you move a 2 like a 10, you have to keep treating it like that for the bluff to work).
Feels like the secret sauce has to be probability distributions guessing what all the pieces are.
Bluffing in stratego seems like it requires long-term planning (if you move a 2 like a 10, you have to keep treating it like that for the bluff to work).