holdem

Installation

git clone https://github.com/VinQbator/holdem.git

cd holdem
pip install .

It is also highly recommended to install https://github.com/mitpokerbots/pbots_calc to get a fast C backend for Monte Carlo simulations ( Make sure to add the library to LD_LIBRARY_PATH, which is not said in the instructions)

Description

This is the first OpenAI Gym No-Limit Texas Hold'em* (NLH) environment written in Python. It's an experiment to build a Gym environment that is synchronous and can support any number of players but also appeal to the general public that wants to learn how to "solve" NLH.

*Python 3 supports arbitrary length integers 💸

Right now, this is a work in progress, but I believe the API is mature enough for some preliminary experiments. Join us in making some interesting progress on multi-agent Gym environments.

Usage

There is limited documentation at the moment. I'll try to make this less painful to understand.

Example

Run python example.py from repo root.

`env = holdem.TexasHoldemEnv(n_seats, max_limit=100000, all_in_equity_reward=False, equity_steps=100, autoreset_stacks=True, debug=False)`

Creates a gym environment representation a NLH Table from the parameters:

n_seats - number of seats in table. No players are initially allocated to the table. You must call env.add_player(seat_id, ...) to populate the table.
max_limit - max_limit is used to define the gym.spaces API for the class. It does not actually determine any NLH limits; in support of gym.spaces.Discrete.
all_in_equity_reward - use Monte Carlo simulation to pay out winnings and rewards from environment based on equity in all in situations.
equity_steps - number of MC simulations to run to determine equity.
autoreset_stacks - reset stacks after every hand automatically.
debug - add debug statements to play, will probably be removed in the future.

`env.add_player(seat_id, stack=2500)`

Adds a player to the table according to the specified seat (seat_id) and the initial amount of chips allocated to the player's stack. If the table does not have enough seats according to the n_seats used by the constructor, a gym.error.Error will be raised.

`(player_states, community_states) = env.reset()`

Calling env.reset resets the NLH table to a new hand state. New behavior is reserved for a special, future portion of the API that is yet another feature that is not standard in Gym environments and is a work in progress.

The observation returned is a tuple of the following by index:

player_states - a tuple where each entry is tuple(player_info, player_hand), this feature can be used to gather all states and hands by (player_infos, player_hands) = zip(*player_states).
- player_infos - is a list of int features describing the individual player. It can be conveniently accessed with utils.player_table like so player_states[0][player_table.STACK]. First index is always the player that needs to act now. It contains the following entries:
  - CURRENT_BET - amount already put in pot on this street
  - STACK - player's current stack
  - IS_IN_POT - player has cards
  - HAS_ACTED - has player played this street
  - IS_ALL_IN
  - LAST_SIDEPOT - player's last sidepot
- player_hands - is a list of int features describing the cards in the player's pocket. The values are encoded based on the treys.Card integer representation.
community_states - a tuple(community_infos, community_cards) where:
- community_infos - a list of public information common to everyone on the table. Positions here are absolute not indexes in players list (you can subtract TO_ACT_POS from BUTTON_POS to get button dealer player's index):
  - BUTTON_POS - location of the dealer button, where big blind is posted.
  - SMALL_BLIND - the current small blind amount.
  - POT - the current total amount in the community pot.
  - LAST_RAISE - the last posted raise amount.
  - MINRAISE - minimum required raise amount, if above 0.
  - TO_CALL - the TOTAL amount required to call, including what you already have put in pot.
  - TO_ACT_POS - the current player required to take an action.
- community_cards - is a list of int features describing the cards on board. The values are encoded based on the treys.Card integer representation. There are 5 int in the list, where -1 represents that there is no card present.

Credits

Huge thanks to @BigBadBurrow for pointing out a ton of bugs!

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
holdem		holdem
tests		tests
.gitignore		.gitignore
README.md		README.md
example.py		example.py
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

holdem

Installation

Description

Usage

Example

`env = holdem.TexasHoldemEnv(n_seats, max_limit=100000, all_in_equity_reward=False, equity_steps=100, autoreset_stacks=True, debug=False)`

`env.add_player(seat_id, stack=2500)`

`(player_states, community_states) = env.reset()`

Credits

About

Releases

Packages

Languages

VinQbator/holdem

Folders and files

Latest commit

History

Repository files navigation

holdem

Installation

Description

Usage

Example

env = holdem.TexasHoldemEnv(n_seats, max_limit=100000, all_in_equity_reward=False, equity_steps=100, autoreset_stacks=True, debug=False)

env.add_player(seat_id, stack=2500)

(player_states, community_states) = env.reset()

Credits

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

`env = holdem.TexasHoldemEnv(n_seats, max_limit=100000, all_in_equity_reward=False, equity_steps=100, autoreset_stacks=True, debug=False)`

`env.add_player(seat_id, stack=2500)`

`(player_states, community_states) = env.reset()`

Packages