<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="article-commentary">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Psychol.</journal-id>
<journal-title>Frontiers in Psychology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Psychol.</abbrev-journal-title>
<issn pub-type="epub">1664-1078</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpsyg.2018.00210</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Psychology</subject>
<subj-group>
<subject>General Commentary</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Commentary: Heads-up limit hold&#x00027;em poker is solved</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Newall</surname> <given-names>Philip W. S.</given-names></name>
<xref ref-type="author-notes" rid="fn001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/476742/overview"/>
</contrib>
</contrib-group>
<aff><institution>Technical University of Munich</institution>, <addr-line>Munich</addr-line>, <country>Germany</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Ulrich Hoffrage, University of Lausanne, Switzerland</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Adam Goodie, University of Georgia, United States</p></fn>
<fn fn-type="corresp" id="fn001"><p>&#x0002A;Correspondence: Philip W. S. Newall <email>pnew&#x00040;tum.de</email></p></fn>
<fn fn-type="other" id="fn002"><p>This article was submitted to Cognition, a section of the journal Frontiers in Psychology</p></fn></author-notes>
<pub-date pub-type="epub">
<day>21</day>
<month>02</month>
<year>2018</year>
</pub-date>
<pub-date pub-type="collection">
<year>2018</year>
</pub-date>
<volume>9</volume>
<elocation-id>210</elocation-id>
<history>
<date date-type="received">
<day>13</day>
<month>09</month>
<year>2017</year>
</date>
<date date-type="accepted">
<day>08</day>
<month>02</month>
<year>2018</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2018 Newall.</copyright-statement>
<copyright-year>2018</copyright-year>
<copyright-holder>Newall</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<related-article id="RA1" related-article-type="commentary-article" journal-id="Science" journal-id-type="nlm-ta" vol="347" page="145" xlink:href="25574016" ext-link-type="pubmed">A commentary on <article-title>Heads-up limit hold&#x00027;em poker is solved</article-title> by Bowling, M., Burch, N., Johanson, M., and Tammelin, O. (2015). Science 347, 145&#x02013;149. doi: <object-id>10.1126/science.1259433</object-id></related-article>
<kwd-group>
<kwd>games</kwd>
<kwd>game theory</kwd>
<kwd>expertise</kwd>
<kwd>artificial intelligence</kwd>
<kwd>decision making</kwd>
</kwd-group>
<counts>
<fig-count count="0"/>
<table-count count="1"/>
<equation-count count="0"/>
<ref-count count="21"/>
<page-count count="3"/>
<word-count count="1671"/>
</counts>
</article-meta>
</front>
<body>
<p>The game of poker, with its tactics of bluffing and deception, has frequently captured the imagination. In one example from popular culture, James Bond defeats a terrorist financier at the poker table in the film <italic>Casino Royale</italic>. Bond&#x00027;s poker skill reflects his abilities as a spy: Spotting lies and deception, and thinking one move ahead of his opponent. But like other domains of human skill, poker has been affected by the rise of the machines. In 2015, a supercomputer with 48 CPUs running for 68 days &#x0201C;solved&#x0201D; heads-up limit hold&#x00027;em poker, the simplest poker game played for money in casinos and online (Bowling et al., <xref ref-type="bibr" rid="B3">2015</xref>). This computer cannot be beaten, even in a human lifetime of play. This commentary analyzes the perfect strategy from Bowling et al.&#x00027;s target article to ask: Does the computer&#x00027;s strategy in the game&#x00027;s key initial decision reflect poker expert wisdom, or does the computer play entirely differently?</p>
<p>Games are a common domain for testing the relative skills of experts and computers. In 1997 Garry Kasparov famously lost to Deep Blue in chess, and in 2016 Lee Sedol lost to AlphaGo. In 2008 a similar expert-computer match occurred for heads-up limit hold&#x00027;em poker (CPRG, <xref ref-type="bibr" rid="B8">2008</xref>). A team of seven professionals played &#x0201C;Polaris,&#x0201D; a computer designed by researchers from the University of Alberta (who later built the 2015 supercomputer). Polaris was the overall winner, although the professional Matt Hawrilenko, who was viewed by many as the most-skilled in this poker game (Brodie, <xref ref-type="bibr" rid="B4">2008</xref>; Arnett, <xref ref-type="bibr" rid="B1">2009</xref>; Nalbone, <xref ref-type="bibr" rid="B13">2011</xref>), emerged a net winner.</p>
<p>Even a relatively simple card game involving two players and a 52 pack of cards can create significant complexity. More precisely, there are 3.16 &#x000D7; 10<sup>17</sup> potential game states in this poker game (Bowling et al., <xref ref-type="bibr" rid="B3">2015</xref>). Humans must initially simplify complex problems to learn and improve their performance (Dreyfus and Dreyfus, <xref ref-type="bibr" rid="B9">1986</xref>). We use simple &#x0201C;heuristics&#x0201D; even for problems much simpler than this poker game (Gigerenzer et al., <xref ref-type="bibr" rid="B10">1999</xref>; Hertwig et al., <xref ref-type="bibr" rid="B11">2013</xref>). Poker theorists suggest two relevant simplifying principles: aggression and information hiding (Chen and Ankenman, <xref ref-type="bibr" rid="B6">2006</xref>). It is generally better to be aggressive by <italic>raising</italic> the stakes, rather than equalling the stakes by <italic>calling</italic>. It is generally better to hide information by playing many hands the same way, rather than having a unique strategy for specific hands.</p>
<p>Here is a simple strategy as the first player on the first round reflecting these principles (this situation is both important and relatively simple to analyze). This player can play any individual hand by <italic>folding</italic> (putting no more money in, and immediately forfeiting the hand), <italic>calling</italic> (equalling the bet), or by <italic>raising</italic> (doubling the bet). The strategy involves finding a single threshold point: All hands weaker than this are <italic>folded</italic>, and those stronger are played by <italic>raising</italic> (<italic>raise-or-fold</italic>). <italic>Calling</italic>, as a potential strategy, is never considered (on the initial first round decision; calling might be done later on). Now, the first player&#x00027;s first round strategy must specify play in one additional scenario. If the second player <italic>re-raises</italic>, then the first player is revisited with the fold, call, raise trilemma. (If the second player folds the hand immediately ends; if the second player calls play moves onto the second round.) In this case folding is inadvisable from a risk-reward perspective (Sklansky, <xref ref-type="bibr" rid="B20">1999</xref>). A similar argument means raising accomplishes little (because a skilled second player will never fold). Therefore, <italic>always-calling</italic> is the recommended simple strategy (information hiding trumps aggression in this instance where the principles conflict). So the first player should <italic>raise-or-fold</italic> (based on initial hand strength), and then <italic>always-call</italic>. The entire first round strategy boils down to a single hand: The worst hand worth raising. An effective strategy could not be simpler.</p>
<p>Matt Hawrilenko followed this strategy in the 2008 match (Newall, <xref ref-type="bibr" rid="B14">2011</xref>). Over 1,000 hands, he raised 86.8%, otherwise folding. When facing a re-raise he called every time. Computers do not face the same computational constraints as humans. So it is perhaps not surprising that Polaris, the 2008 computer, used a similar yet more-complex strategy. Polaris raised 85.0% (1.8% lower than Hawrilenko), but called 2.4% (rather than never). When facing a re-raise, Polaris called 83.6% of the time, otherwise raising (compared to Hawrilenko calling 100%).</p>
<p>Polaris from 2008 is significantly weaker at poker and used fewer computational resources than &#x0201C;Cepheus,&#x0201D; the unbeatable 2015 agent (Bowling et al., <xref ref-type="bibr" rid="B3">2015</xref>). So how does Cepheus compare? Surprisingly, the more complex computer agent actually uses a simpler strategy. Table <xref ref-type="table" rid="T1">1</xref> compares the three strategies&#x00027; observable behavior (combining data from Newall, <xref ref-type="bibr" rid="B14">2011</xref>; Bowling et al., <xref ref-type="bibr" rid="B3">2015</xref>). Cepheus initially raises 82.54%, only calling a miniscule 0.06%. Cepheus&#x00027;s initial calling frequency is closer to Hawrilenko&#x00027;s than Polaris&#x00027;s. Cepheus calls 99.1% when facing a re-raise, again much closer to Hawrilenko than Polaris. But you would not be recommended to copy these rare plays. According to one of the study&#x00027;s co-authors, Cepheus&#x00027;s deviations from Hawrilenko&#x00027;s simpler strategy are, &#x0201C;most likely part of the noise that makes it &#x02018;essentially&#x02019; solved and not just solved&#x0201D; (Burch, <xref ref-type="bibr" rid="B5">2015</xref>).</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Action frequencies as the first player.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Action</bold></th>
<th valign="top" align="center"><bold>Matt Hawrilenko (2008)</bold></th>
<th valign="top" align="center"><bold>Polaris (2008)</bold></th>
<th valign="top" align="center"><bold>Cepheus (2015)</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left" colspan="4" style="background-color:#bbbdc0"><bold>INITIAL DECISION</bold></td>
</tr>
<tr>
<td valign="top" align="left">Fold</td>
<td valign="top" align="center">13.2%</td>
<td valign="top" align="center">12.6%</td>
<td valign="top" align="center">17.4%</td>
</tr>
<tr>
<td valign="top" align="left">Call</td>
<td valign="top" align="center">0%</td>
<td valign="top" align="center">2.4%</td>
<td valign="top" align="center">0.06%</td>
</tr>
<tr>
<td valign="top" align="left">Raise</td>
<td valign="top" align="center">86.8%</td>
<td valign="top" align="center">85.0%</td>
<td valign="top" align="center">82.54%</td>
</tr>
<tr>
<td valign="top" align="left" colspan="4" style="background-color:#bbbdc0"><bold>WHEN FACING A RE-RAISE</bold></td>
</tr>
<tr>
<td valign="top" align="left">Fold</td>
<td valign="top" align="center">0%</td>
<td valign="top" align="center">0%</td>
<td valign="top" align="center">0%</td>
</tr>
<tr>
<td valign="top" align="left">Call</td>
<td valign="top" align="center">100%</td>
<td valign="top" align="center">83.6%</td>
<td valign="top" align="center">99.9%</td>
</tr>
<tr>
<td valign="top" align="left">Raise</td>
<td valign="top" align="center">0%</td>
<td valign="top" align="center">16.4%</td>
<td valign="top" align="center">0.1%</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>In conclusion, the expert&#x00027;s strategy in 2008 in this key situation closely matched the unbeatable 2015 computer&#x00027;s strategic frequencies. This is another example of expert performance approaching perfect game theoretic strategy (Walker and Wooders, <xref ref-type="bibr" rid="B21">2001</xref>; Chiappori et al., <xref ref-type="bibr" rid="B7">2002</xref>; Palacios-Huerta, <xref ref-type="bibr" rid="B16">2003</xref>). Although the 2015 computer is unbeatable, the expert&#x00027;s knowledge is more robust to other poker games (Lake et al., <xref ref-type="bibr" rid="B12">2016</xref>). And the expert can adjust strategy to take greater advantage of opponents&#x00027; mistakes. (Newall, <xref ref-type="bibr" rid="B15">2013</xref>, explores less crucial situations in this poker game where computers play unlike most experts.) Simple heuristics have been recommended for changing environments (Bookstaber and Langsam, <xref ref-type="bibr" rid="B2">1985</xref>), for when computational resources are limited (Simon, <xref ref-type="bibr" rid="B19">1955</xref>), and when effort must be economized (Shah and Oppenheimer, <xref ref-type="bibr" rid="B18">2008</xref>). Yet it appears that the supercomputer&#x00027;s optimization led it to a simple strategy like the expert&#x00027;s, even when none of these arguments applied (Parpart et al., <xref ref-type="bibr" rid="B17">2017</xref>).</p>
<sec id="s1">
<title>Author contributions</title>
<p>The author confirms being the sole contributor of this work and approved it for publication.</p>
<sec>
<title>Conflict of interest statement</title>
<p>The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</sec>
</body>
<back>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="web"><person-group person-group-type="author"><name><surname>Arnett</surname> <given-names>K.</given-names></name></person-group> (<year>2009</year>). <source>A Poker Life &#x02013; Matt Hawrilenko</source>. Available online at: <ext-link ext-link-type="uri" xlink:href="http://www.cardplayer.com/poker-news/7826-a-poker-life-matt-hawrilenko">http://www.cardplayer.com/poker-news/7826-a-poker-life-matt-hawrilenko</ext-link></citation></ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bookstaber</surname> <given-names>R.</given-names></name> <name><surname>Langsam</surname> <given-names>J.</given-names></name></person-group> (<year>1985</year>). <article-title>On the optimality of coarse behavior rules</article-title>. <source>J. Theor. Biol.</source> <volume>116</volume>, <fpage>161</fpage>&#x02013;<lpage>193</lpage>. <pub-id pub-id-type="doi">10.1016/S0022-5193(85)80262-9</pub-id><pub-id pub-id-type="pmid">4058019</pub-id></citation></ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bowling</surname> <given-names>M.</given-names></name> <name><surname>Burch</surname> <given-names>N.</given-names></name> <name><surname>Johanson</surname> <given-names>M.</given-names></name> <name><surname>Tammelin</surname> <given-names>O.</given-names></name></person-group> (<year>2015</year>). <article-title>Heads-up limit hold&#x00027;em poker is solved</article-title>. <source>Science</source> <volume>347</volume>, <fpage>145</fpage>&#x02013;<lpage>149</lpage>. <pub-id pub-id-type="doi">10.1126/science.1259433</pub-id><pub-id pub-id-type="pmid">25574016</pub-id></citation></ref>
<ref id="B4">
<citation citation-type="web"><person-group person-group-type="author"><name><surname>Brodie</surname> <given-names>R.</given-names></name></person-group> (<year>2008</year>). <source>A Heads-Up for Human Poker Players</source>. Available online at: <ext-link ext-link-type="uri" xlink:href="http://www.liontales.com/2008/07/">http://www.liontales.com/2008/07/</ext-link></citation></ref>
<ref id="B5">
<citation citation-type="web"><person-group person-group-type="author"><name><surname>Burch</surname> <given-names>N.</given-names></name></person-group> (<year>2015</year>). <source>Re: Computers Conquer Texas Hold&#x00027;em Poker for First Time</source>. Available online at: <ext-link ext-link-type="uri" xlink:href="http://forumserver.twoplustwo.com/showpost.php?p=45771353&#x00026;postcount=31">http://forumserver.twoplustwo.com/showpost.php?p=45771353&#x00026;postcount=31</ext-link></citation></ref>
<ref id="B6">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>B.</given-names></name> <name><surname>Ankenman</surname> <given-names>J.</given-names></name></person-group> (<year>2006</year>). <source>The Mathematics of Poker</source>. <publisher-loc>Pittsburgh, PA</publisher-loc>: <publisher-name>ConJelCo LLC</publisher-name>.</citation></ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chiappori</surname> <given-names>P.</given-names></name> <name><surname>Levitt</surname> <given-names>S.</given-names></name> <name><surname>Groseclose</surname> <given-names>T.</given-names></name></person-group> (<year>2002</year>). <article-title>Testing mixed-strategy equilibria when players are heterogeneous: the case of penalty kicks in soccer</article-title>. <source>Am. Econ. Rev.</source> <volume>92</volume>, <fpage>1138</fpage>&#x02013;<lpage>1151</lpage>. <pub-id pub-id-type="doi">10.1257/00028280260344678</pub-id></citation></ref>
<ref id="B8">
<citation citation-type="web"><person-group person-group-type="author"><collab>CPRG</collab></person-group> (<year>2008</year>). <source>The Second Man-Machine Poker Competition</source>. Available online at: <ext-link ext-link-type="uri" xlink:href="http://webdocs.cs.ualberta.ca/&#x0007E;games/poker/man-machine/">http://webdocs.cs.ualberta.ca/&#x0007E;games/poker/man-machine/</ext-link></citation></ref>
<ref id="B9">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Dreyfus</surname> <given-names>H. L.</given-names></name> <name><surname>Dreyfus</surname> <given-names>S. E.</given-names></name></person-group> (<year>1986</year>). <source>Mind Over Machine</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>The Free Press</publisher-name>.</citation></ref>
<ref id="B10">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Gigerenzer</surname> <given-names>G.</given-names></name> <name><surname>Todd</surname> <given-names>P. M.</given-names></name> <collab>The ABC Research Group</collab></person-group> (<year>1999</year>). <source>Simple Heuristics That Make Us Smart</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>. <pub-id pub-id-type="pmid">11301545</pub-id></citation></ref>
<ref id="B11">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Hertwig</surname> <given-names>R.</given-names></name> <name><surname>Hoffrage</surname> <given-names>U.</given-names></name> <collab>The ABC Research Group</collab></person-group> (<year>2013</year>). <source>Simple Heuristics in a Social World</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>.</citation></ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lake</surname> <given-names>B. M.</given-names></name> <name><surname>Ullman</surname> <given-names>T. D.</given-names></name> <name><surname>Tenenbaum</surname> <given-names>J. B.</given-names></name> <name><surname>Gershman</surname> <given-names>S. J.</given-names></name></person-group> (<year>2016</year>). <article-title>Building machines that learn and think like people</article-title>. <source>arXiv:1604.00289.</source><pub-id pub-id-type="pmid">27881212</pub-id></citation></ref>
<ref id="B13">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Nalbone</surname> <given-names>J.</given-names></name></person-group> (<year>2011</year>). <source>Princeton Grad Made Millions Playing Online Poker Until the Feds Upped the Ante</source>. Available online at: <ext-link ext-link-type="uri" xlink:href="http://www.nj.com/mercer/index.ssf/2011/06/princeton_grad_was_making_a_fi.html">http://www.nj.com/mercer/index.ssf/2011/06/princeton_grad_was_making_a_fi.html</ext-link></citation></ref>
<ref id="B14">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Newall</surname> <given-names>P.</given-names></name></person-group> (<year>2011</year>). <source>The Intelligent Poker Player</source>. <publisher-loc>Las Vegas, NV</publisher-loc>: <publisher-name>Two Plus Two Publishing</publisher-name>.</citation></ref>
<ref id="B15">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Newall</surname> <given-names>P.</given-names></name></person-group> (<year>2013</year>). <source>Further Limit Hold&#x00027;em: Exploring the Model Poker Game</source>. <publisher-loc>Las Vegas, NV</publisher-loc>: <publisher-name>Two Plus Two Publishing</publisher-name>.</citation></ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Palacios-Huerta</surname> <given-names>I.</given-names></name></person-group> (<year>2003</year>). <article-title>Professionals play minimax</article-title>. <source>Rev. Econ. Stud.</source> <volume>70</volume>, <fpage>395</fpage>&#x02013;<lpage>415</lpage>. <pub-id pub-id-type="doi">10.1111/1467-937X.00249</pub-id></citation></ref>
<ref id="B17">
<citation citation-type="web"><person-group person-group-type="author"><name><surname>Parpart</surname> <given-names>P.</given-names></name> <name><surname>Jones</surname> <given-names>M.</given-names></name> <name><surname>Love</surname> <given-names>B. C.</given-names></name></person-group> (<year>2017</year>). <source>Heuristics as Bayesian Inference under Extreme Priors</source>. Available online at: <ext-link ext-link-type="uri" xlink:href="https://psyarxiv.com/qkbt5">https://psyarxiv.com/qkbt5</ext-link></citation></ref>
<ref id="B18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shah</surname> <given-names>A. K.</given-names></name> <name><surname>Oppenheimer</surname> <given-names>D. M.</given-names></name></person-group> (<year>2008</year>). <article-title>Heuristics made easy: an effort-reduction framework</article-title>. <source>Psychol. Bull.</source> <volume>134</volume>, <fpage>207</fpage>&#x02013;<lpage>222</lpage>. <pub-id pub-id-type="doi">10.1037/0033-2909.134.2.207</pub-id><pub-id pub-id-type="pmid">18298269</pub-id></citation></ref>
<ref id="B19">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Simon</surname> <given-names>H. A.</given-names></name></person-group> (<year>1955</year>). <article-title>A behavioral model of rational choice</article-title>. <source>Q. J. Econ.</source> <volume>69</volume>, <fpage>99</fpage>&#x02013;<lpage>118</lpage>. <pub-id pub-id-type="doi">10.2307/1884852</pub-id></citation></ref>
<ref id="B20">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Sklansky</surname> <given-names>D.</given-names></name></person-group> (<year>1999</year>). <source>The Theory of Poker</source>. <publisher-loc>Las Vegas, NV</publisher-loc>: <publisher-name>Two Plus Two Publishing</publisher-name>.</citation></ref>
<ref id="B21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Walker</surname> <given-names>M.</given-names></name> <name><surname>Wooders</surname> <given-names>J.</given-names></name></person-group> (<year>2001</year>). <article-title>Minimax play at wimbledon</article-title>. <source>Am. Econ. Rev.</source> <volume>91</volume>, <fpage>1521</fpage>&#x02013;<lpage>1538</lpage>. <pub-id pub-id-type="doi">10.1257/aer.91.5.1521</pub-id></citation></ref>
</ref-list> 
</back>
</article>