Poker Driving Artificial Intelligence Research

Poker Driving Artificial Intelligence Research 212

Posted by samzenpus on Monday August 21, 2006 @01:35PM from the I-wouldn't-fold-yet-dave dept.

J-Hawker writes "The Canadian Press has a story about a University of Alberta team that is using Texas Hold-'em to study artificial intelligence. Poker seems to be a much more useful game for this research than chess. From the article: 'Poker has what are currently some of the biggest challenges to (artificial intelligence) systems, and uncertainty is the primary hurdle that we're facing,' said Michael Bowling, adding that the University of Alberta program was able to use its opponents' actions to infer certain things about their hands. 'The same techniques, the same principles that we're developing to build poker systems are the same principles that can be applied to many other problems. The nice thing about chess as a property of the game is what we call perfect information. You look at the board, you know where all the pieces are, you know whose turn it is — you have complete knowledge of the game,' he said. 'But in the real world, knowing everything is just so rare. Everything we do all day long is all about partial information. So poker's much more representative of what the real world's like, and in that sense it becomes a much harder problem.'"

Poker Driving Artificial Intelligence Research

This discussion has been archived. No new comments can be posted.

Search 212 Comments Log In/Create an Account

Comments Filter:

Re:Bluffing (Score:5, Informative)

by Cherita Chen ( 936355 ) writes: on Monday August 21, 2006 @01:53PM (#15949871) Homepage

Most good players don't actually "Bluff" in the sense that they are totally full of crap, and have no hand. Most bluffs are calculated risks based on the overall odds of "Improving" the hand, such as the case with four of a suit and two cards left to be turned over. In that case, the overall odds of hitting the flush are good enough to bet on (unless of course there is a pair showing on the board, which would indicate a possible full-house), even though the player has no "real" hand yet. Situations such as this can be quantified. Granted there are some real morons out there who will try to "bluff" with nothing, they are relatively rare and don't usually last long.

Re:Bluffing (Score:5, Informative)

by AuMatar ( 183847 ) writes: on Monday August 21, 2006 @01:59PM (#15949918)

Good poker players constantly semi-bluff and do continuation bets. They do plain bluffs in some situations too. Its all about reading your opponent- if I think an opponent will lay down a hand if he doesn't have a pair, I will always bet on the flop even with nothing- the odds favor him having nothing as well, and if he always lays down non-pairs I'll win more money by betting than I lose.

Sudoku is trivial for an AI (Score:3, Informative)

by Harmonious Botch ( 921977 ) writes: on Monday August 21, 2006 @02:11PM (#15949994) Homepage Journal

...given a large enough stack. I think I could write a perfect Sudoku program in about 30 lines of code. Most of it would be a reursive routine.

Re:Straight Forward Evaluation (Score:5, Informative)

by brunes69 ( 86786 ) writes: <[slashdot] [at] [keirstead.org]> on Monday August 21, 2006 @02:17PM (#15950039)

What confuses me is how the poker openings differ. I would speculate that a program would be some heuristic relating the ratio of bluffing to "playing the odds." I have gambling friends that play poker all the time and they have these rules that they follow when they play initially against people. They say it's the best until you "know" the people you're playing. Once you can read them then you deviate from the rules. The real irony is that the most successful people I know adhere to a system until they learn someone's movements. Sounds to me like I would write an application that specializes in playing the odds until it recognizes a historical action that statistically reveals the player is bluffing/not bluffing.
You can tell you don't play much poker.
Part of what differentiates a pro player to an amatur player in poker, is the ability to "project an image". A pro player will purposefully *project* an image of a bluffer, or a tight player, so that they can exploit that image of themselves when they see fit in the game.
Thusly, it is very difficlt to get a "read" on a good poker player, because not only do you not know what cards they have, but you don't know how they would play for any two given cards, so you can't use their behaviour to prdict the cards they have.
In the end, the above description is what any decent player is aiming for while they play.
Because of this, a computer can have a hard time going beyond implied odds calulations in determining how to play a hand - and any pro ill tell you, implied odds are a good starting point, but they won't make you money in the long run.

Re:If you are really interested... (Score:2, Informative)

by Anonymous Coward writes: on Monday August 21, 2006 @02:18PM (#15950049)

Why don't you read about the software? This is just a toolkit. This software provides a .DLL that will allow your machine to interface with online poker sites and gather statistical data. There is also a mode that allows for automated play. It is important to note that you really need to be skilled (software/mathematics) to effectivly use the automated software. Again, if anyone is really interested in AI/online poker, this is THE tool to get you started.

Not true at all (Score:3, Informative)

by brunes69 ( 86786 ) writes: <[slashdot] [at] [keirstead.org]> on Monday August 21, 2006 @02:22PM (#15950080)

If you are chasing a flush, knowing the suits of the other person's two cards can ajust your odds bu a huge factor. For example, if you were four to a flush post-flop (giving you odds of 38/100 to hit your flush), and you all of a sudden know that two cards not in play are *not* of your suit, that ups the odds to 43/100 - this is a huge odd jump in hold-em, and can mean the difference between folding and going all-in in a race situation.

Two players colluding in a game is a huge problem. Even a marginal advantage equals to a huge advantage played out over time.

Re:More info than a real player? (Score:3, Informative)

by Best ID Ever! ( 712255 ) writes: on Monday August 21, 2006 @02:23PM (#15950082)

It's possible they're smart enough to start the programs with the same starting conditions in each case, i.e. no knowledge of their opponents hands.

The effect of luck in poker win rates can still be seen over even 100000 hands. Google for "poker" and "confidence interval" for some in depth discussions on it.

Some Actual Factual Information (Score:1, Informative)

by Anonymous Coward writes: on Monday August 21, 2006 @03:02PM (#15950329)

Hi, I've been studying poker bots for a year or so. I'm not a /. member but somebody brought this to my attention and I thought I would post some useful resources.

The Alberta folks are the world's leading (publicly known) authorities on poker AI. (There may be others who are using their knowledge to cheat at online poker, but they're not talking).

Here is the link to the Alberta web site:

http://www.cs.ualberta.ca/~games/poker/ [ualberta.ca]

(Note: you have scroll down a ways to the doc links).

If you want to read just one paper about computer poker, read "The Challenge of Poker". Here is a link (PDF warning):

http://www.cs.ualberta.ca/~darse/Papers/AIJ02.pdf [ualberta.ca]

Some of the Alberta guys spun off into a local company that makes an excellent software product, Poker Academy Pro:

http://www.poker-academy.com/poker-software/ [poker-academy.com]

This software written in Java and offers a programming API so you can plug your own bots into their game. The programming API has an online forum:

http://www.poker-academy.com/forums/viewforum.php? f=3 [poker-academy.com]

Here is the resources page of one of Alberta guys, now a senior developer working on Poker Academy:

http://spaz.ca/poker/ [spaz.ca]

And here is a blog written by a guy who has some excellent LGPL code available (left side of page);

http://pokerforprogrammers.blogspot.com/ [blogspot.com]

Note: do NOT use the version of this software on the Code Project - it is out of date and buggy - get the download from this blog.

There's lots of other stuff of variable quality...all the pages linked here are high-quality content in the sense that the authors have thought deeply about the AI problem and/or are very skilled programmers.

Enjoy,
Jeff

Re:Straight Forward Evaluation (Score:3, Informative)

by enjo13 ( 444114 ) writes: on Monday August 21, 2006 @04:21PM (#15950999) Homepage

The bot would, ideally, be as good as a very observant player, noting those who bluff and those who don't. Obviously noting 1 or 2 bluffs or non-bluffs would not be enough to make a decision, but over the course of a long tournament, or even better a poker playing career, this information would become very useful. The bot would learn its opponents, and this is what makes it an interesting problem.

A large part of what makes Hold'Em a unique challenge is that you really don't have a lot of deterministic information to go off of. A great many hands never reach the stage where you GET to see other players cards. So the data set is inherently incomplete. You can't know that a player likes to bluff, because an effective bluffer never actually shows his cards. After all, the whole idea of bluffing is that you never make it to the end of the hand where you HAVE to show your cards.So its not a simple problem of looking at past history to determine future play, because the past history is quite incomplete. You can know how often a player bluffs and gets called on it, but there may be another player who is actually bluffing much more often that AI can't know about through simple deterministic means.

This is why players have the option of showing their winning hand after they've won a pot, even if it doesn't make it to the final draw. Actually making it to the final card is not that common in practice (I would guess about 1/3 of hands). A good player will intentionally create the PERCEPTION of a certain playstyle by showing (or not) cards at the end of the hand. They can use that in the future to change their playstyle and catch unsuspecting players. A purely historical approach to data (based on a LOT of unknown outcomes) just doesn't work well here.

Re: No more info than a real player. (Score:2, Informative)

by eisbaer4 ( 195961 ) writes: on Monday August 21, 2006 @09:03PM (#15952713) Homepage

I am a member of the Alberta team.

The article did not explain this part of the competition very well. The bots did not have access to any hidden information. The opponent's cards were only revealed at a showdown.

Here's how it works. A mini-match (1000 games) was played, with all of the cards recorded. Then a completely separate mini-match was played with the same cards, but with the positions reversed. This is possible because fresh copies of the programs can be started with a clean slate, having no memory of the previous match. (In fact, the two halves of the mini-match aren't even played on the same computer). Many of these duplicate matches were then played (40 for the fast competition, 12 for the slow competition) between each pair of players.

It is essential to run the matches in duplicate, in order to equalize the opportunities each side gets. The natural fluctation (variance) due to luck is huge in poker, making it very difficult to separate the signal from the noise. The problem is much worse than most people imagine, making accurate measurement of performance a sticky problem. The duplicate matches go a long way toward reducing that variance, and revealing the true long-term expectation.

[On a related note, I created a technique for direct assessment of poker decisions, which cuts away much of the noise. It is described in detail in my PhD thesis, which should be available from the CPRG website (http://games.cs.ualberta.ca/poker/ [ualberta.ca]) in September. A detailed analysis of the AAAI matches will also be available.]

Incidentally, there are a lot of incorrect statements being made in this thread. See the post titled "Some Actual Factual Information" for links to some of the original sources.

- Darse.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Poker Driving Artificial Intelligence Research 212

Poker Driving Artificial Intelligence Research More Login

Poker Driving Artificial Intelligence Research

Re:Bluffing (Score:5, Informative)

Re:Bluffing (Score:5, Informative)

Sudoku is trivial for an AI (Score:3, Informative)

Re:Straight Forward Evaluation (Score:5, Informative)

Re:If you are really interested... (Score:2, Informative)

Not true at all (Score:3, Informative)

Re:More info than a real player? (Score:3, Informative)

Some Actual Factual Information (Score:1, Informative)

Re:Straight Forward Evaluation (Score:3, Informative)

Re: No more info than a real player. (Score:2, Informative)

Related Links Top of the: day, week, month.

Slashdot Top Deals

Slashdot