hanabi.rs/README.md

# Simulations of Hanabi strategies

Hanabi is a cooperative card game of incomplete information.
Despite relatively [simple rules](https://boardgamegeek.com/article/10670613#10670613),
the space of Hanabi strategies is quite interesting.

This repository provides a framework for implementing Hanabi strategies.
It also explores some implementations, based on ideas from
[this paper](https://d0474d97-a-62cb3a1a-s-sites.googlegroups.com/site/rmgpgrwc/research-papers/Hanabi_final.pdf).

In particular, it contains a variant of their "information strategy", with some improvements.
This strategy achieves the best results I am aware of for n > 2 (see below).

Please contact me if:
- You know of other interesting/good strategy ideas!
- Have questions about the framework or existing strategies

Some similar projects I am aware of:
- https://github.com/rjtobin/HanSim (written for the paper mentioned above)
- https://github.com/Quuxplusone/Hanabi

## Setup

Install rust/rustc and cargo. Then,

`cargo run -- -h`

```
Usage: target/debug/rust_hanabi [options]

Options:
    -l, --loglevel LOGLEVEL
                        Log level, one of 'trace', 'debug', 'info', 'warn',
                        and 'error'
    -n, --ntrials NTRIALS
                        Number of games to simulate (default 1)
    -t, --nthreads NTHREADS
                        Number of threads to use for simulation (default 1)
    -s, --seed SEED     Seed for PRNG (default random)
    -p, --nplayers NPLAYERS
                        Number of players
    -g, --strategy STRATEGY
                        Which strategy to use. One of 'random', 'cheat', and
                        'info'
    -h, --help          Print this help menu
```

For example,

```
cargo run -- -n 10000 -s 0 -p 5 -g cheat
```

Or, if the simulation is slow (as the info strategy is),

```
time cargo run --release -- -n 10000 -o 1000 -s 0 -t 4 -p 5 -g info
```

Or, to see a transcript of a single game:
```
cargo run -- -s 2222 -p 5 -g info -l debug | less
```

## Results

On seeds 0-9999, we have:

          |   2p    |   3p    |   4p    |   5p    |
----------|---------|---------|---------|---------|
cheating  | 24.8600 | 24.9781 | 24.9715 | 24.9583 |
info      | 18.5909 | 24.1655 | 24.7922 | 24.8784 |


To reproduce:
```
n=10000   # number of rounds to simulate
t=4       # number of threads
for strategy in info cheat; do
  for p in $(seq 2 5); do
    time cargo run --release -- -n $n -s 0 -t $t -p $p -g $strategy;
  done
done
```
improvements, cleanup, readme 2016-03-19 22:14:29 +01:00			`# Simulations of Hanabi strategies`

improve to 24.78 (for 5 players) 2016-03-30 07:24:29 +02:00			`Hanabi is a cooperative card game of incomplete information.`
			`Despite relatively [simple rules](https://boardgamegeek.com/article/10670613#10670613),`
			`the space of Hanabi strategies is quite interesting.`
improvements, cleanup, readme 2016-03-19 22:14:29 +01:00
improve to 24.78 (for 5 players) 2016-03-30 07:24:29 +02:00			`This repository provides a framework for implementing Hanabi strategies.`
			`It also explores some implementations, based on ideas from`
			`[this paper](https://d0474d97-a-62cb3a1a-s-sites.googlegroups.com/site/rmgpgrwc/research-papers/Hanabi_final.pdf).`
improvements, cleanup, readme 2016-03-19 22:14:29 +01:00
improve to 24.78 (for 5 players) 2016-03-30 07:24:29 +02:00			`In particular, it contains a variant of their "information strategy", with some improvements.`
choose index dynamically, use OwnedGameView where possible 2016-03-30 19:24:21 +02:00			`This strategy achieves the best results I am aware of for n > 2 (see below).`
improve to 24.78 (for 5 players) 2016-03-30 07:24:29 +02:00
			`Please contact me if:`
			`- You know of other interesting/good strategy ideas!`
make color = char 2016-03-31 19:17:22 +02:00			`- Have questions about the framework or existing strategies`
improvements, cleanup, readme 2016-03-19 22:14:29 +01:00
improve to 24.78 (for 5 players) 2016-03-30 07:24:29 +02:00			`Some similar projects I am aware of:`
improvements, cleanup, readme 2016-03-19 22:14:29 +01:00			`- https://github.com/rjtobin/HanSim (written for the paper mentioned above)`
			`- https://github.com/Quuxplusone/Hanabi`

			`## Setup`

smart hinting, silencing/configuring of progress output 2016-04-01 11:08:46 +02:00			`Install rust/rustc and cargo. Then,`
improvements, cleanup, readme 2016-03-19 22:14:29 +01:00
			`cargo run -- -h`

			```
			`Usage: target/debug/rust_hanabi [options]`

			`Options:`
			`-l, --loglevel LOGLEVEL`
make color = char 2016-03-31 19:17:22 +02:00			`Log level, one of 'trace', 'debug', 'info', 'warn',`
			`and 'error'`
improvements, cleanup, readme 2016-03-19 22:14:29 +01:00			`-n, --ntrials NTRIALS`
make color = char 2016-03-31 19:17:22 +02:00			`Number of games to simulate (default 1)`
improvements, cleanup, readme 2016-03-19 22:14:29 +01:00			`-t, --nthreads NTHREADS`
make color = char 2016-03-31 19:17:22 +02:00			`Number of threads to use for simulation (default 1)`
			`-s, --seed SEED Seed for PRNG (default random)`
improvements, cleanup, readme 2016-03-19 22:14:29 +01:00			`-p, --nplayers NPLAYERS`
			`Number of players`
make color = char 2016-03-31 19:17:22 +02:00			`-g, --strategy STRATEGY`
			`Which strategy to use. One of 'random', 'cheat', and`
			`'info'`
improvements, cleanup, readme 2016-03-19 22:14:29 +01:00			`-h, --help Print this help menu`
			```

			`For example,`

make color = char 2016-03-31 19:17:22 +02:00			```
smart hinting, silencing/configuring of progress output 2016-04-01 11:08:46 +02:00			`cargo run -- -n 10000 -s 0 -p 5 -g cheat`
make color = char 2016-03-31 19:17:22 +02:00			```

			`Or, if the simulation is slow (as the info strategy is),`

			```
smart hinting, silencing/configuring of progress output 2016-04-01 11:08:46 +02:00			`time cargo run --release -- -n 10000 -o 1000 -s 0 -t 4 -p 5 -g info`
			```

			`Or, to see a transcript of a single game:`
			```
			`cargo run -- -s 2222 -p 5 -g info -l debug \| less`
make color = char 2016-03-31 19:17:22 +02:00			```
improvements, cleanup, readme 2016-03-19 22:14:29 +01:00
fix sorting wrong order bug... some cleanup, update results 2016-03-31 09:02:09 +02:00			`## Results`
improvements, cleanup, readme 2016-03-19 22:14:29 +01:00
smart hinting, silencing/configuring of progress output 2016-04-01 11:08:46 +02:00			`On seeds 0-9999, we have:`

			`\| 2p \| 3p \| 4p \| 5p \|`
			`----------\|---------\|---------\|---------\|---------\|`
			`cheating \| 24.8600 \| 24.9781 \| 24.9715 \| 24.9583 \|`
			`info \| 18.5909 \| 24.1655 \| 24.7922 \| 24.8784 \|`
improvements, cleanup, readme 2016-03-19 22:14:29 +01:00

various cleanups, fixes 2016-04-01 09:14:13 +02:00			`To reproduce:`
			```
smart hinting, silencing/configuring of progress output 2016-04-01 11:08:46 +02:00			`n=10000 # number of rounds to simulate`
			`t=4 # number of threads`
various cleanups, fixes 2016-04-01 09:14:13 +02:00			`for strategy in info cheat; do`
			`for p in $(seq 2 5); do`
smart hinting, silencing/configuring of progress output 2016-04-01 11:08:46 +02:00			`time cargo run --release -- -n $n -s 0 -t $t -p $p -g $strategy;`
various cleanups, fixes 2016-04-01 09:14:13 +02:00			`done`
			`done`
			```