DeepStack

DeepStack is an artificial intelligence computer program designed to play two-player poker, specifically heads up no-limit Texas hold 'em. It is the first computer program to outplay human professionals in this game.

Background

is a key benchmark game in academic community and substantial amount of research was done to find optimal strategies against worst case adversaries.
While human professionals have been outplayed in large perfect information games, such as Chess, decades before, imperfect information games require much more complex recursive reasoning.
Prior popular approaches relied mainly on simplification of the game by using abstractions. However, abstractions in imperfect-information games often result in highly-exploitable strategies.
Instead, DeepStack uses several algorithmic innovations, such as the use of neural networks and continual resolving.
The program was developed by an international team from Charles University, Czech Technical University and University of Alberta.

Algorithm

At the core of the program is the use of neural networks for determining the value of specific card combinations.
The networks are trained only on a small number of games states and are used to generalize to situations not seen during training.
The program uses search with the neural networks and continual resolving to ensure strategy found at each step is consistent with the strategy used in previous steps.
The search procedure uses counterfactual regret minimization to iteratively update strategy in its lookahead tree, and the neural networks are used for leaf evaluation. The leaf evaluation avoids reasoning about the entire remainder of the game by substituting the computation beyond a certain depth with a fast approximate estimate.

2016 tournament with professional players

In a study completed December 2016, DeepStack defeated 11 professional poker players by playing 44,000 hands of poker. Over all games played, DeepStack won 49 big blinds/100, over four standard deviations from zero, making it the first computer program to beat professional poker players in heads-up no-limit Texas hold'em poker.

Competing approaches

Concurrently with DeepStack, a competing approach from Carnegie Mellon University research group was published, called Libratus. On January 11 to 31, 2017, Libratus was pitted in a tournament against four top-class human poker players. The algorithm was also published in Science. Libratus does not use neural networks for leaf evaluation. Experts argue that using learning with neural networks is more general and it has been indeed used in subsequent works that generalize to other games with imperfect information

Reception by the poker community

Dara O'Kearney, an Irish poker professional who completed 456 hands, claimed that DeepStack played in a style similar to one used by some human players, based on game theory.