Saturday, November 23, 2024

How computers made poker a game for nerds

Must read

Thousands of poker players have gathered in Las Vegas this month for their biggest tournament series of the year: the World Series of Poker (WSOP).

For many professional players, the WSOP is an annual pilgrimage. For amateurs, it may be the culmination of years of saving to afford the steep $10,000 entry fee for the main event, which has drawn a record-breaking 10,112 participants this year. The top 1,517 players will walk away with at least $15,000, while the overall winner will claim $10 million at the final table on July 17.

To get “in the money” — that is, to make more than they paid to enter the tournament — even the best players need a fair bit of luck. These days, though, most poker players are not heedless, risk-seeking gamblers. They’re diligent students of the game who have often spent thousands of hours drilling the most optimal strategies, often with the help of a computer.

“It used to be that poker was a game played by degenerates, people who live on the edge,” said Doug Polk, a professional player who has won a WSOP event three times. “But what’s funny about that is, if you boil down poker to its essence, it’s a math game. And so that means that nerds are going to eventually win.”

We’ve come a long way from the image of a tuxedoed James Bond sipping his shaken-not-stirred martini in Casino Royale. Real-life players are instead reaching for Red Bull to power through 10 straight hours of play amid the constant clinking of poker chips. On the windowless casino floor, they don sunglasses to shield any glimpse of their eyes that could give away their position to other players and wear noise-canceling headphones to silence everything but the calculations running in their heads.

They are trying to play, albeit imperfectly, a strategy computer algorithms have revealed as game theory optimal, what players call GTO: the mathematically proven way to become unexploitable to other players while maximizing winnings. How closely players can hew to GTO can often determine their success, especially since the competition has never been fiercer: Anyone can now study GTO with an app on their phone.

“We’ve really seen a shift away from sort of the old-school gambler vibe toward a much more intelligent and sophisticated group of players,” said Polk, who is also a co-founder of the poker training company Upswing Poker and co-owner of the Lodge Card Club in Texas.

But has the relentless pursuit of optimization also made poker a bit … robotic?

How poker became so damn huge

Poker in some form dates back a thousand years, but French colonists imported its European predecessor Poque to the southern US in the 1800s, after which English speakers adopted the game and made it their own.

It took hold in the saloons on the American frontier in the 1870s. US soldiers in World War I made the game more broadly popular. After 1970, when the first World Series of Poker was held, poker tournaments became more commonplace, producing star players like Amarillo Slim and Puggy Pearson (who dropped out of school in the fifth grade). This growth in popularity also spawned the first generation of poker strategy books.

The 1990s saw poker go mainstream as tournaments were increasingly televised in the style of live sports. Films like Rounders, where Matt Damon’s character plays underground poker games to help his friend repay a loan shark, elevated the game in popular culture. The first online poker sites emerged, paving the way for a poker boom in the 2000s.

No-Limit Hold’em became the most popular version of the game. In case you’re unfamiliar, here’s how it works:

As the rules suggest, poker is a game of incomplete information. Players know there are 52 cards in a deck, divided into four suits with 13 cards each. While they can calculate the probabilities of certain cards being in their opponent’s hand or appearing on the table, they cannot know these outcomes for certain. The choices they make around whether to fold, check, bet — and how much to bet — depend on these probabilities, as well as their anticipation of how their opponents will play.

Historically, poker players primarily relied on intuition and experience to make these decisions. They adapted their strategies to exploit any patterns they observed in their opponents’ play. For instance, a player might bluff too often; any opponent who realizes this will exploit that tendency by calling more often and not folding outright, forcing the player to show their weak hand at the end of the betting rounds. Players might also provide subconscious visual tells, such as glancing at their chips when they’re feeling good about their hand and are planning to bet or raise. An opponent might want to proceed cautiously in challenging them.

But this style of play, known as “exploitative play,” isn’t an exact science. Many players developed idiosyncratic styles rooted in their own strengths and weaknesses, leading to diverse approaches and debates about the optimal strategy.

“I do have some nostalgia for the days when you could just put your pants on and go play,” said Eric Seidel, a pro player who won his first of 10 WSOP events in 1992 and has been inducted into the Poker Hall of Fame. “There weren’t really any solutions to things. People were just making things up as they went along and getting by on experience and judgment.”

The rise of GTO and computer solvers, however, completely changed the game.

Time for some game theory

Game theory itself isn’t new. Mathematician John von Neumann and economist Oskar Morgenstern developed it in the 1950s to establish a mathematical way to understand optimal strategies in competitive games, and their work has been applied to a wide range of fields from international relations to the study of animal behavior.

A key concept in game theory that you may remember if you’ve ever taken Economics 101 is a Nash Equilibrium: a state in which each player’s strategy is optimal given the strategies of all other players. In games like poker, a player is unexploitable in this state — it is their best strategy to maximize their winnings in the long run, and no other player can do better by changing their strategy alone. That doesn’t mean they won’t lose a particular hand since luck of the card draw still plays a role, but over many hands, they’ll do better than they would pursuing any other strategy.

After Bill Chen’s seminal book The Mathematics of Poker came out in 2006, top players increasingly started incorporating game theoretical concepts in their play. But GTO really exploded in the mid-2010s with the development of computer poker solvers such as PioSolver or Simple Postflop. These solvers use profit-maximizing algorithms that analyze and predict players’ strategies and payoffs to essentially find the Nash Equilibrium at any point in a game.

Players can practice by running past hands by the solver or asking it to spit out optimal strategies in a certain scenario. They can also play against a computer with apps like GTO Wizard, which will then tell them what they should have done to play optimally. With repetition, GTO strategies start to sink in.

“Some top players spend hours running simulations and memorizing the solutions,” said Liv Boeree, a former professional poker player and the only woman in history to win both a WSOP event and a European Poker Tour event. “This is vastly different to the old-school days where players ‘trained’ through playing, and perhaps honing, general principles. We now have access to much more granular data on optimal play in specific situations.”

Poker, she added, has “evolved from an art into a science.”

However, no human can actually play perfect GTO strategy. Performing the necessary calculations without the help of a computer is just too complicated. That’s because there is an astronomical number of game states in poker: depending on the betting rules and the number of their chips, a player can make a bet of any size and so can their opponents, and there are 2,598,960 possible hands. In practice, players are only approximating GTO; at best, that’s what their opponents are doing as well. This means that players can deviate from textbook GTO to exploit weaknesses in other players and maximize their profits.

Some players do this exceptionally well. Take Phil Ivey or Daniel Negreanu, who came on the scene before GTO was widespread but are still top poker players known for their abilities to read and exploit their opponents. But deviating from GTO can also be risky because it can backfire and open a player to exploitation themselves.

“The beauty of poker is that there are so many different situations that can arise and so many different ways to think about the different spots and the combinations of hands and how likely things are that even if you’re really good, you’re going to be constantly making mistakes,” Polk said.

At the same time, however, Polk said that even simplified GTO strategies can allow a player to reap many of the benefits of textbook GTO. This means that, for most successful players, “poker has become more about executing the math and those strategies at a high level, rather than staring your opponent down in the eye and knowing that he doesn’t have the goods.”

Where does poker go from here?

Not all of the changes wrought by GTO are for the better of the game. For one, it raises a practical problem: It’s now easier than ever to cheat, especially online.

In 2011, the US government cracked down on unregulated online poker sites in an event known by the poker community as “Black Friday.” But such sites have nonetheless proliferated internationally — take the Canada- and Ireland-based site GG Poker, for which Negreanu is a spokesperson — and made a major resurgence during the pandemic.

Some more reputable sites, including GG Poker, have tried to implement anti-cheating software to detect when players are using computer solvers to assist them in real time, banning some accounts and confiscating their winnings.

But enforcement can be spotty, and determined individuals can still find ways to cheat. In one 2020 case, Fedor Kruse, a former video game streamer who became prominent in online high-stakes tournaments, was accused of using real-time assistance software to play GTO by using a second computer that he allegedly called the “dream machine.”

Polk said the only way for a poker player to protect themselves from rampant cheating is to not play online at all.

“It’s really much harder for people to be successful online than it used to be,” Seidel said. “The majority of players are honest, but the ones that aren’t are definitely a problem. And some sites are better than others at policing that kind of thing.”

But beyond cheating, some players think GTO has changed the soul of poker in a way that makes it less fun to play.

“I think it’s worse for the game, frankly, because the days of just no one having any idea are gone,” said Polk, who has somewhat stepped back from the game and focused on his businesses after playing what he estimates to be 7 to 8 million hands of poker in his lifetime. “It’s just turned into who wants to spend the time necessary to execute a high level. I don’t love that … I just don’t feel like it’s that interesting anymore.”

Boeree agrees that GTO “takes away a bit of [poker’s] romantic Wild West character.” But she said it’s not all bad; just different.

For one thing, GTO players tend to play more aggressively than old-school players. That’s because the theory incorporates a balanced mix of frequent value bets and bluffs: GTO players tailor their bet sizes to extract maximum value from their strong hands by encouraging calls from worse hands, using bluffs to keep opponents guessing and prevent exploitation.

This can be exciting to watch in that it frequently leads to bigger pots, crazier bluffs, and dramatic showdowns. For amateurs who don’t know what they’re doing, it’s a lot easier to make costly mistakes when facing GTO opponents. Sometimes, hands are resolved more quickly because a big bet could intimidate players into folding. But thinking through GTO strategy on the spot can also be time-consuming, making some hands go slower.

When playing opponents who came on the scene in the 1990s, Polk said he has noticed that their “style tends to be more conservative. It tends to be more of wait till you have a good hand.”

“Whereas the modern poker strategies, they’re very aggressive. They really go after people in places. They bluff a lot. They attack pots. They make your opponent’s life difficult,” he added.

The democratization of poker knowledge has made top players’ “edges much smaller and the competition fierce,” Boeree said. If every player at a table is playing GTO, the play becomes more about focusing on maintaining GTO and less about reading opponents, which can seem mechanical.

“I’d argue it’s also made it more intellectually fascinating, as the mathematical building blocks of the game are now truly visible for the first time,” she said. “There’s still plenty of room for creativity, especially in live play where you’re dealing with real human faces and emotions. Even the best players still make mistakes and give off tells that can in theory be exploited!”

Seidel said he doesn’t have as much time to sit in front of a computer and study GTO in the way that some younger players might. His experience has helped balance that out, ensuring he’s still one of the world’s best players: He has already notched two major tournament victories this year. The changes in the game, however, are part of what keep him coming back after more than three decades of playing professionally at a top level.

“I do find it’s an exciting time to play with players who have really studied the game,” he said. “It’s really interesting to me because I feel like, as much as I’m a player, I’m also a fan of the game. I really enjoy watching the game develop and trying my best to keep up.”

Latest article