Poker Driving Artificial Intelligence Research

Poker Driving Artificial Intelligence Research 212

Posted by samzenpus on Monday August 21, 2006 @01:35PM from the I-wouldn't-fold-yet-dave dept.

J-Hawker writes "The Canadian Press has a story about a University of Alberta team that is using Texas Hold-'em to study artificial intelligence. Poker seems to be a much more useful game for this research than chess. From the article: 'Poker has what are currently some of the biggest challenges to (artificial intelligence) systems, and uncertainty is the primary hurdle that we're facing,' said Michael Bowling, adding that the University of Alberta program was able to use its opponents' actions to infer certain things about their hands. 'The same techniques, the same principles that we're developing to build poker systems are the same principles that can be applied to many other problems. The nice thing about chess as a property of the game is what we call perfect information. You look at the board, you know where all the pieces are, you know whose turn it is — you have complete knowledge of the game,' he said. 'But in the real world, knowing everything is just so rare. Everything we do all day long is all about partial information. So poker's much more representative of what the real world's like, and in that sense it becomes a much harder problem.'"

Poker Driving Artificial Intelligence Research

This discussion has been archived. No new comments can be posted.

Search 212 Comments Log In/Create an Account

Comments Filter:

Straight Forward Evaluation (Score:5, Interesting)

by eldavojohn ( 898314 ) * writes: <eldavojohn.gmail@com> on Monday August 21, 2006 @01:37PM (#15949746) Journal

Poker seems to be a much more useful game for this research than chess.
This shouldn't be a surprise. Poker has the advantage of always being able to simply evaluate your chip count. Chess doesn't. You can't enumerate chess games through the entire gamespace so the initial opening moves are based on libraries or heuristics. In response to the machine not knowing all aspects of the "game space," I thought that there were a lot of developments in the field that allowed these to be accounted for. What ever happened to good old Trial and Error [wikipedia.org] or Fuzzy Systems [austinlinks.com]? Aren't these viable strategies when playing poker?

What confuses me is how the poker openings differ. I would speculate that a program would be some heuristic relating the ratio of bluffing to "playing the odds." I have gambling friends that play poker all the time and they have these rules that they follow when they play initially against people. They say it's the best until you "know" the people you're playing. Once you can read them then you deviate from the rules. The real irony is that the most successful people I know adhere to a system until they learn someone's movements. Sounds to me like I would write an application that specializes in playing the odds until it recognizes a historical action that statistically reveals the player is bluffing/not bluffing.

Simply put, unless you knew someone's reputation as being a bluffer, you would play the opening hand always the same way. Aren't we forced to program the "AI" of the poker software as being this simple heuristic? Will programs ever be able to "read" players intelligently or will they rely on Markov models & statistics they develop from playing against the same human over and over?

Most unfortunate is the fact that the primary reason my friends gamble is they don't experience the same kind of rush while playing other games as they do with poker because it's more social than other games. If we program applications to beat humans, where does the "social aspect" of the game go?

Even more interesting is the network of poker bots [msn.com] that are set up and running some of the web sites that host poker players. Imagine sitting down at a table of five with four of the other seats taken. Now imagine that these aren't humans but instead bots on four different IP addresses that are sharing card information over an IP connection so that they can leverage odds over you and stop themselves from making stupid mistakes (i.e. they share a card on the table for a pair but really need three of a kind to pose a threat). There's a reason why the percentages fluctuate on TV when cards are revealed whether they be in the flop or in another player's hand.

Re:Straight Forward Evaluation (Score:5, Interesting)

by bdonalds ( 989355 ) writes: on Monday August 21, 2006 @01:43PM (#15949790) Homepage

Now imagine that these aren't humans but instead bots on four different IP addresses that are sharing card information over an IP connection so that they can leverage odds over you and stop themselves from making stupid mistakes
Just to address a small part of your post- Bots Schmots! This is a problem already with humans. I used to like to play Euchre and the like online, but too many times it became obvious my opponents were communicating to each other and ruined the fun.

Re:Straight Forward Evaluation (Score:3, Interesting)

by Pulzar ( 81031 ) writes: on Monday August 21, 2006 @01:52PM (#15949854)

Simply put, unless you knew someone's reputation as being a bluffer, you would play the opening hand always the same way. Aren't we forced to program the "AI" of the poker software as being this simple heuristic?

The simpler the heuristic used to program the AI, the easier it will be for the opponents to figure out what the bot is doing. A big difference between a mediocre and a successful poker player is the ability to vary their play significantly enough to make it hard for anybody to put them on a hand, without impacting their play so much that they are playing badly.

There are many systems out there developed for the "opening hand", as you call it, and, yes, AI can be programmed to play the preflop game fairly well. After the flop, though, it's a whole different game. As much as you hear about odds in poker, it's not a matter of simple math to calculate them and play "proper odds". You only know your odds if you know exactly what every opponent has... and that's where simple heuristics fail miserably.

Finally, even if you knew everybody's cards, even then you would still need to know exactly how much they are going to bet (if at all) in the future rounds of betting in order to calculate the exact odds you're getting. Once again, that's something that's still very hard for bots to figure out.

Not anytime soon. (Score:4, Interesting)

by Lejade ( 31993 ) * writes: <olivier AT mekensleep DOT com> on Monday August 21, 2006 @01:52PM (#15949858) Homepage Journal

Before anyone goes off about how AIs will eventually replace us, my company runs a (GPL and GNU/linux friendly) poker site [pok3d.com] and the last thing i am worried about is bots taking over humans in no-limit games. To win consistently against serious players an AI would need to be a LOT smarter than what the guys from Alberta have. It would need to have a serious grasp of human psychology. It might happen, eventually, but by then society might have changed so much that "money" might also be an obsolete concept...

And even if such software existed, it would basically mean that you couldn't win at online poker anymore because skill would not be relevant anymore. That wouldn't be very different from the current situation with player-versus-casino luck games (like roulette or slots).
And we can all see how poorly these are doing, right? :)

Already bots playing (Score:4, Interesting)

by slapyslapslap ( 995769 ) writes: on Monday August 21, 2006 @01:55PM (#15949882)

There are already bots playing against unsuspecting people at the online casinos. I'm not sure how much AI is involved, but apparently they play better than most humans.

Limit versus No-Limit Texas Hold'em (Score:2, Interesting)

by harryk ( 17509 ) writes: <jofficer@@@gmail...com> on Monday August 21, 2006 @02:03PM (#15949943) Homepage

there are a few things that stand out, about this level of developing.

First, they are playing limit hold'em, which I assume to mean pot-limit texas hold'em. While thats fine, and you'll find plenty pf people that play Pot-Limit, its still a very different game than No-Limit hold'em.

A second thing that I am inferring from the game, is that they are playing heads-up, meaning 1 on 1. Again, this is cool, and I think its a great first step, I still relate that back to Chess. Now if they can take that same AI and play against 8 or 9 other players, effectively, then I'll be impressed.

Re:Straight Forward Evaluation (Score:5, Interesting)

by Propagandhi ( 570791 ) writes: on Monday August 21, 2006 @02:03PM (#15949945) Journal

Aren't we forced to program the "AI" of the poker software as being this simple heuristic? Will programs ever be able to "read" players intelligently or will they rely on Markov models & statistics they develop from playing against the same human over and over?

Playing poker with 100% consistency is no way to be an excellent poker player. It's easy to make a bot that follows a set of statistics which give it a good chance to win regardless of how their opponent ha played in the past, but if the bot takes into account the player's past actions then it can improve its chances of success. Taking into account the opponent's aggressiveness becomes especially important late in a tournament style match (when other players have been eliminated), most bots aren't designed to play in these situations (hence why you don't see many bots in tournaments, playing instead at the normal tables).

The bot would, ideally, be as good as a very observant player, noting those who bluff and those who don't. Obviously noting 1 or 2 bluffs or non-bluffs would not be enough to make a decision, but over the course of a long tournament, or even better a poker playing career, this information would become very useful. The bot would learn its opponents, and this is what makes it an interesting problem.

Even more interesting is the network of poker bots that are set up and running some of the web sites that host poker players.

I'd argue that cheating at online poker isn't very interesting at all. Humans can do the exact same thing, and online poker companies monitor game's to ensure that there isn't an uncommonly high percentage of people in the same area playing any game. Obviously it might be easier to distribute the bots across the country, but I think it's still more likely (today) to run into actual players grifting you in this manner.

There's a reason why the percentages fluctuate on TV when cards are revealed whether they be in the flop or in another player's hand.

Quantum physics, right? You can accurately determine the odds of winning, or the cards in hand, but not both at the same time? Swear I read something about this somewhere.

Re:Straight Forward Evaluation (Score:2, Interesting)

by The Mad Debugger ( 952795 ) writes: on Monday August 21, 2006 @02:16PM (#15950025)

What confuses me is how the poker openings differ. I would speculate that a program would be some heuristic relating the ratio of bluffing to "playing the odds." I have gambling friends that play poker all the time and they have these rules that they follow when they play initially against people. They say it's the best until you "know" the people you're playing. Once you can read them then you deviate from the rules.
and
Simply put, unless you knew someone's reputation as being a bluffer, you would play the opening hand always the same way. Aren't we forced to program the "AI" of the poker software as being this simple heuristic? Will programs ever be able to "read" players intelligently or will they rely on Markov models & statistics they develop from playing against the same human over and over?
Unfortunately it's not quite that simple. First of all, the opening strategies have quite a lot of inputs. Your action varies based on the cards in your hand, the amount of money on the table, your position in the game (did you act first, or last, or somewhere in between), who is in or out, how they've acted (called or raised), etc.

Second, good poker strategy is not just reactive. It includes active attempts to probe for information by examining the reactions of others at the table. In general your strategy remains fairly constant, but you adjust your behavior (how aggressively do you play), based on the information that you've learned by observing the play of others at the table.

Third, good poker strategy is intentionally deceptive. Sometimes you might specifically decide to play the same hand a different way the second time than the first. Sometimes you will bet your cards for value, and others you won't.

In the long run, you're going to end up with an estimation of each player, and how well/poorly agressively/meekly they're playing, and that's information that you form over time. It's also hard to come by sometimes, because when a player folds, you generally don't get to see their cards, and it can be very expensive to take lots of hands all the way to a showdown just to see what the other player had. At the same time, you're trying to hide as much information as you can from the other players at the table, and possibly create a false image of your intents.

That's what this article is about there is a *lot* there, which makes it a very interesting problem. Your behaviors aren't nearly as fixed as they are in chess. It's further complicated by conflicting opinions as to the best play of certain hands and situations.

If you're interested in the mathematical and game theory aspects of poker, check out "The Theory of Poker" by David Slansky. It has lots of great discussion on the mathematical basis of decisionmaking in poker, including theory of bluffing, etc. Of course, as I just mentioned, it's not the only opinion on how poker should be played, but it's a good starting point.

Most unfortunate is the fact that the primary reason my friends gamble is they don't experience the same kind of rush while playing other games as they do with poker because it's more social than other games. If we program applications to beat humans, where does the "social aspect" of the game go?
Poker bots can be mostly sucessful because there are a lot of iditots who play mathematically unsound poker, and are pretty much begging to give their money away. If your bot playes sound poker, it doesn't matter if you give up some money to players who are playing better. Your expectation is positive because there is so much money there to be had.

If you don't like the idea of bots, get some friends, some beer, and play in your kitchen. It makes it much harder to have colluding computers (as you described) take your cash. Plus, then, you also get to drink beer.

Re:Already bots playing (Score:3, Interesting)

by UnanimousCoward ( 9841 ) writes: on Monday August 21, 2006 @02:17PM (#15950031) Homepage Journal

The bot issue is orthogonal to AI research WRT hold'em at this point. The theory behind deploying bots is playing 'solid' poker in low-stakes games (since that's where the 'bad' players are), winning pennies or small bucks per hour, and massively scaling up. The AI angle is, of course, more intriguing against 'good' players.

Re:Straight Forward Evaluation (Score:5, Interesting)

by rcs1000 ( 462363 ) * writes: <rcs1000@NoSpaM.gmail.com> on Monday August 21, 2006 @02:18PM (#15950042)

Two things:

(1) Knowing the cards of the other players is a small, but significant, advantage. Say you've got two hearts, and your three buddies have a heart each. Well, you're chance of getting another three hearts on the table are significantly affected. (Likewise, if they have none, it increases the chance you'll want to stay in and catch the flop.)

(2) Much more serious, though, is collusion in betting. You and your buddy can conspire to raise the pot *as much as you like*. In a fixed raise game, this is an enormous advantage. Another player cannot just "call" and see the next card, as there will always be a player still to call who can reraise.

Personally, though, I love bots. I'm happy to play them all day long. (So long as they're not colluding, of course...)

Cheers,

Robert

Re:Innovation through vices (Score:3, Interesting)

by tbone1 ( 309237 ) writes: on Monday August 21, 2006 @02:18PM (#15950043) Homepage

Aside from military use (which to some might be a vice as well), isn't it interesting how much of our innovation nowadays is centered around profiting from people's vices (gambling, sex/porn, etc)
Considering that a lot of naval technology of the 16th, 17th, and 18th centuries was stimulated by things like warfare, tobacco, sugar/rum, tea, coffee, and the slave trade, is this really surprising?

Computer Go (Score:5, Interesting)

by dahl_ag ( 415660 ) writes: on Monday August 21, 2006 @02:21PM (#15950071)

While it probably doesn't have nearly the financial motivation that poker does, the AI behind Computer Go [wikipedia.org] also represents a huge challenge [wikipedia.org]. The rules of Go are very simple, but it is impossible to 'solve' using brute-force techniques like you might use with something like chess.

Poker isn't the best! (Score:3, Interesting)

by PresidentEnder ( 849024 ) writes: <wyvernender@@@gmail...com> on Monday August 21, 2006 @02:28PM (#15950122) Journal

Poker is only one of many double-blind, "real-world" games out there. I like the idea of making an AI learn poker (poker masters are more like human beings than chess masters, certainly), but it is my humble opinion that Kriegspiel [wikipedia.org] is where it's really at.

Re:Straight Forward Evaluation (Score:3, Interesting)

by Relic of the Future ( 118669 ) writes: <dales@digitEINST ... minus physicist> on Monday August 21, 2006 @02:46PM (#15950245)

When it's REAL MONEY on the line, any advantage can be significant enough to warant taking advantage of it.
That said, uses (and abuses and detection) of out-of-band communication isn't what this research is about; those are concerns for someone else's research project. It's a problem that has plagued poker (and euchre and bridge and a thousand other partial-information games) since the game was invented. That's not a technological problem or a decision making problem, it's a social problem, and A.I. hasn't quite yet advanced to the point where we can worry about it trying to cheat at cards. (Although I welcome the day that we do have to worry about that.)

Limit vs. No-Limit (Score:3, Interesting)

by BadBlood ( 134525 ) writes: on Monday August 21, 2006 @02:49PM (#15950265)

One distinction to make is that bots can be and have been successful playing against human opponents in limit poker, where the bet size is fixed on each betting round.

In no-limit poker, when each bet has the potential to cost your opponent all of their money/chips, the decision making process is more critical and mistakes more costly. Variance in no-limit poker is much larger and the AI required to determine whether your opponent is bluffing or has "the nuts" becomes a much larger problem to solve.

Re:Straight Forward Evaluation (Score:3, Interesting)

by Pulzar ( 81031 ) writes: on Monday August 21, 2006 @03:51PM (#15950734)

(1) Knowing the cards of the other players is a small, but significant, advantage. Say you've got two hearts, and your three buddies have a heart each.

In general, 4 guys playing together on one table is hard to do more than a couple of times before being flagged on any poker site. So, in most cases, you'll have one buddy telling you that he has no heart (affecting the odds by a negligible amount, and the most likely case), one heart (affecting the odds somewhat, and somewhat less likely to happen), or two (which will happen the least often, but will affect the odds the most).

So, it depends on the definition of 'significant', but the small edge that you will get once every 40-50 hands (probability of having 2 suited cards and flopping a flush draw) is not enough to significantly influence the outcome of a game. The quality of your play in other 39 out of 40 hands will be the deciding factor.

(2) Much more serious, though, is collusion in betting.

Very true, but this is, luckily, very easily detectable when all of the cards are visible to somebody (i.e. the online poker room operator). So, it's something that you can only do for a short period of time before somebody complains to the operator, at which point it will be obvious that two players playing at the same table a lot are following this particular betting pattern.

It's probably only worth it to sit down at the highest limits possible, try to scoop some money very quickly, and never play again at that site. It's not a very good long-term strategy, though :).

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Poker Driving Artificial Intelligence Research 212

Poker Driving Artificial Intelligence Research More Login

Poker Driving Artificial Intelligence Research

Straight Forward Evaluation (Score:5, Interesting)

Re:Straight Forward Evaluation (Score:5, Interesting)

Re:Straight Forward Evaluation (Score:3, Interesting)

Not anytime soon. (Score:4, Interesting)

Already bots playing (Score:4, Interesting)

Limit versus No-Limit Texas Hold'em (Score:2, Interesting)

Re:Straight Forward Evaluation (Score:5, Interesting)

Re:Straight Forward Evaluation (Score:2, Interesting)

Re:Already bots playing (Score:3, Interesting)

Re:Straight Forward Evaluation (Score:5, Interesting)

Re:Innovation through vices (Score:3, Interesting)

Computer Go (Score:5, Interesting)

Poker isn't the best! (Score:3, Interesting)

Re:Straight Forward Evaluation (Score:3, Interesting)

Limit vs. No-Limit (Score:3, Interesting)

Re:Straight Forward Evaluation (Score:3, Interesting)

Related Links Top of the: day, week, month.

Slashdot Top Deals

Slashdot