hanabi.rs/README.md

# Simulations of Hanabi strategies

Hanabi is a cooperative card game of incomplete information.
Despite relatively [simple rules](https://boardgamegeek.com/article/10670613#10670613),
the space of Hanabi strategies is quite interesting.
This project provides a framework for implementing Hanabi strategies in Rust.
It also explores some implementations, based on ideas from
[this paper](https://d0474d97-a-62cb3a1a-s-sites.googlegroups.com/site/rmgpgrwc/research-papers/Hanabi_final.pdf).
In particular, it contains an improved version of their "information strategy",
which achieves the best results I'm aware of for games with more than 2 players ([see below](#results)).

Please feel free to contact me about Hanabi strategies, or this framework.

Most similar projects I am aware of:
- https://github.com/rjtobin/HanSim (written for the paper mentioned above)
- https://github.com/Quuxplusone/Hanabi

## Setup

Install rust (rustc and cargo), and clone this git repo.

Then, in the repo root, run `cargo run -- -h` to see usage details.

For example, to simulate a 5 player game using the cheating strategy, for seeds 0-99:
```
cargo run -- -n 100 -s 0 -p 5 -g cheat
```

Or, if the simulation is slow, build with `--release` and use more threads:
```
time cargo run --release -- -n 10000 -o 1000 -s 0 -t 4 -p 5 -g info
```

Or, to see a transcript of the game with seed 222:
```
cargo run -- -s 222 -p 5 -g info -l debug | less
```

## Strategies

To write a strategy, you simply [implement a few traits](src/strategy.rs).

The framework is designed to take advantage of Rust's ownership system
so that you *can't cheat*, without using stuff like `Cell` or `Arc` or `Mutex`.

Generally, your strategy will be passed something of type `&BorrowedGameView`.
This game view contains many useful helper functions ([see here](src/game.rs)).
If you want to mutate a view, you'll want to do something like
`let mut self.view = OwnedGameView::clone_from(borrowed_view);`.
An OwnedGameView will have the same API as a borrowed one.

Some examples:

- [Basic dummy examples](src/strategies/examples.rs)
- [A cheating strategy](src/strategies/cheating.rs), using `Rc<RefCell<_>>`
- [The information strategy](src/strategies/information.rs)!

## Results

On seeds 0-9999, we have these average scores and win rates:

|       |   2p    |   3p    |   4p    |   5p    |
|-------|---------|---------|---------|---------|
|cheat  | 24.8600 | 24.9781 | 24.9715 | 24.9570 |
|       | 90.52 % | 98.12 % | 97.74 % | 96.57 % |
|info   | 22.3360 | 24.7279 | 24.8889 | 24.8991 |
|       | 09.98 % | 80.48 % | 91.37 % | 92.10 % |


To reproduce:
```
n=10000   # number of rounds to simulate
t=4       # number of threads
for strategy in info cheat; do
  for p in $(seq 2 5); do
    time cargo run --release -- -n $n -s 0 -t $t -p $p -g $strategy;
  done
done
```
improvements, cleanup, readme 2016-03-19 22:14:29 +01:00			`# Simulations of Hanabi strategies`

improve to 24.78 (for 5 players) 2016-03-30 07:24:29 +02:00			`Hanabi is a cooperative card game of incomplete information.`
			`Despite relatively [simple rules](https://boardgamegeek.com/article/10670613#10670613),`
			`the space of Hanabi strategies is quite interesting.`
update readme 2016-04-02 23:29:46 +02:00			`This project provides a framework for implementing Hanabi strategies in Rust.`
improve to 24.78 (for 5 players) 2016-03-30 07:24:29 +02:00			`It also explores some implementations, based on ideas from`
			`[this paper](https://d0474d97-a-62cb3a1a-s-sites.googlegroups.com/site/rmgpgrwc/research-papers/Hanabi_final.pdf).`
update readme 2016-04-02 23:29:46 +02:00			`In particular, it contains an improved version of their "information strategy",`
			`which achieves the best results I'm aware of for games with more than 2 players ([see below](#results)).`
improvements, cleanup, readme 2016-03-19 22:14:29 +01:00
update readme 2016-04-02 23:29:46 +02:00			`Please feel free to contact me about Hanabi strategies, or this framework.`
improvements, cleanup, readme 2016-03-19 22:14:29 +01:00
update readme 2016-04-02 23:29:46 +02:00			`Most similar projects I am aware of:`
improvements, cleanup, readme 2016-03-19 22:14:29 +01:00			`- https://github.com/rjtobin/HanSim (written for the paper mentioned above)`
			`- https://github.com/Quuxplusone/Hanabi`

			`## Setup`

update readme 2016-04-02 23:29:46 +02:00			`Install rust (rustc and cargo), and clone this git repo.`
improvements, cleanup, readme 2016-03-19 22:14:29 +01:00
update readme 2016-04-02 23:29:46 +02:00			Then, in the repo root, run `cargo run -- -h` to see usage details.
improvements, cleanup, readme 2016-03-19 22:14:29 +01:00
update readme 2016-04-02 23:29:46 +02:00			`For example, to simulate a 5 player game using the cheating strategy, for seeds 0-99:`
improvements, cleanup, readme 2016-03-19 22:14:29 +01:00			```
update readme 2016-04-02 23:29:46 +02:00			`cargo run -- -n 100 -s 0 -p 5 -g cheat`
improvements, cleanup, readme 2016-03-19 22:14:29 +01:00			```

update readme 2016-04-02 23:29:46 +02:00			Or, if the simulation is slow, build with `--release` and use more threads:
make color = char 2016-03-31 19:17:22 +02:00			```
smart hinting, silencing/configuring of progress output 2016-04-01 11:08:46 +02:00			`time cargo run --release -- -n 10000 -o 1000 -s 0 -t 4 -p 5 -g info`
			```

update readme 2016-04-02 23:29:46 +02:00			`Or, to see a transcript of the game with seed 222:`
smart hinting, silencing/configuring of progress output 2016-04-01 11:08:46 +02:00			```
update readme 2016-04-02 23:29:46 +02:00			`cargo run -- -s 222 -p 5 -g info -l debug \| less`
make color = char 2016-03-31 19:17:22 +02:00			```
improvements, cleanup, readme 2016-03-19 22:14:29 +01:00
update readme 2016-04-02 23:29:46 +02:00			`## Strategies`

minor cleanups 2016-04-04 09:26:42 +02:00			`To write a strategy, you simply [implement a few traits](src/strategy.rs).`
update readme 2016-04-02 23:29:46 +02:00
			`The framework is designed to take advantage of Rust's ownership system`
			so that you can't cheat, without using stuff like `Cell` or `Arc` or `Mutex`.

			Generally, your strategy will be passed something of type `&BorrowedGameView`.
			`This game view contains many useful helper functions ([see here](src/game.rs)).`
			`If you want to mutate a view, you'll want to do something like`
			`let mut self.view = OwnedGameView::clone_from(borrowed_view);`.
			`An OwnedGameView will have the same API as a borrowed one.`

			`Some examples:`

			`- [Basic dummy examples](src/strategies/examples.rs)`
			- [A cheating strategy](src/strategies/cheating.rs), using `Rc<RefCell<_>>`
			`- [The information strategy](src/strategies/information.rs)!`

fix sorting wrong order bug... some cleanup, update results 2016-03-31 09:02:09 +02:00			`## Results`
improvements, cleanup, readme 2016-03-19 22:14:29 +01:00
update readme 2016-04-02 23:29:46 +02:00			`On seeds 0-9999, we have these average scores and win rates:`
smart hinting, silencing/configuring of progress output 2016-04-01 11:08:46 +02:00
Fix broken table 2018-03-24 03:29:59 +01:00			`\| \| 2p \| 3p \| 4p \| 5p \|`
			`\|-------\|---------\|---------\|---------\|---------\|`
			`\|cheat \| 24.8600 \| 24.9781 \| 24.9715 \| 24.9570 \|`
			`\| \| 90.52 % \| 98.12 % \| 97.74 % \| 96.57 % \|`
If a player discards because nobody needed a hint, make that common knowledge "Needing a hint" is defined as having a playable card, but not knowing any specific card is playable. Thus, if a player doesn't know any playable cards and someone else discards, the player (publicly) concludes that all their cards are unplayable. 2019-02-26 18:47:07 +01:00			`\|info \| 22.3360 \| 24.7279 \| 24.8889 \| 24.8991 \|`
			`\| \| 09.98 % \| 80.48 % \| 91.37 % \| 92.10 % \|`
improvements, cleanup, readme 2016-03-19 22:14:29 +01:00

various cleanups, fixes 2016-04-01 09:14:13 +02:00			`To reproduce:`
			```
smart hinting, silencing/configuring of progress output 2016-04-01 11:08:46 +02:00			`n=10000 # number of rounds to simulate`
			`t=4 # number of threads`
various cleanups, fixes 2016-04-01 09:14:13 +02:00			`for strategy in info cheat; do`
			`for p in $(seq 2 5); do`
smart hinting, silencing/configuring of progress output 2016-04-01 11:08:46 +02:00			`time cargo run --release -- -n $n -s 0 -t $t -p $p -g $strategy;`
various cleanups, fixes 2016-04-01 09:14:13 +02:00			`done`
			`done`
			```