<?xml version="1.0" encoding="UTF-8" standalone="yes"?><oembed><version><![CDATA[1.0]]></version><provider_name><![CDATA[Azimuth]]></provider_name><provider_url><![CDATA[https://johncarlosbaez.wordpress.com]]></provider_url><author_name><![CDATA[John Baez]]></author_name><author_url><![CDATA[https://johncarlosbaez.wordpress.com/author/johncarlosbaez/]]></author_url><title><![CDATA[Information Geometry (Part&nbsp;13)]]></title><type><![CDATA[link]]></type><html><![CDATA[<p><a href="https://johncarlosbaez.wordpress.com/2012/06/24/information-geometry-part-12/">Last time</a> I gave a sketchy overview of evolutionary game theory.  Now let&#8217;s get serious.</p>
<p>I&#8217;ll start by explaining &#8216;Nash equilibria&#8217; for 2-person games.  These are situations where neither player can profit by changing what they&#8217;re doing.  Then I&#8217;ll introduce &#8216;mixed strategies&#8217;, where the players can choose among several strategies with different probabilities.  Then I&#8217;ll introduce evolutionary game theory, where we think of each strategy as a <i>species</i>, and its probability as <i>the fraction of organisms that belong to that species</i>.</p>
<p>Back in <a href="https://johncarlosbaez.wordpress.com/2012/06/01/information-geometry-part-9/">Part 9</a>, I told you about the &#8216;replicator equation&#8217;, which says how these fractions change with time thanks to natural selection.   Now we&#8217;ll see how this leads to the idea of an &#8216;evolutionarily stable strategy&#8217;.   And finally, we&#8217;ll see that when evolution takes us toward such a stable strategy, the amount of information the organisms have &#8216;left to learn&#8217; keeps decreasing!</p>
<h3> Nash equilibria </h3>
<p>We can describe a certain kind of two-person game using a <b>payoff matrix</b>, which is an <img src='https://s0.wp.com/latex.php?latex=n+%5Ctimes+n&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='n &#92;times n' title='n &#92;times n' class='latex' /> matrix <img src='https://s0.wp.com/latex.php?latex=A_%7Bij%7D&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='A_{ij}' title='A_{ij}' class='latex' /> of real numbers.  We think of <img src='https://s0.wp.com/latex.php?latex=A_%7Bij%7D&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='A_{ij}' title='A_{ij}' class='latex' /> as the payoff that either player gets if they choose strategy <img src='https://s0.wp.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='i' title='i' class='latex' /> and their opponent chooses strategy <img src='https://s0.wp.com/latex.php?latex=j.&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='j.' title='j.' class='latex' /></p>
<p>Note that in this kind of game, there&#8217;s no significant difference between the &#8216;first player&#8217; and the &#8216;second player&#8217;: <i>either</i> player wins an amount <img src='https://s0.wp.com/latex.php?latex=A_%7Bij%7D&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='A_{ij}' title='A_{ij}' class='latex' /> if they choose strategy <img src='https://s0.wp.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='i' title='i' class='latex' /> and their opponent chooses strategy <img src='https://s0.wp.com/latex.php?latex=j.&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='j.' title='j.' class='latex' />  So, this kind of game is called <b>symmetric</b> even though the matrix <img src='https://s0.wp.com/latex.php?latex=A_%7Bij%7D&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='A_{ij}' title='A_{ij}' class='latex' /> may not be symmetric.  Indeed, it&#8217;s common for this matrix to be antisymmetric, meaning <img src='https://s0.wp.com/latex.php?latex=A_%7Bij%7D+%3D+-+A_%7Bji%7D%2C&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='A_{ij} = - A_{ji},' title='A_{ij} = - A_{ji},' class='latex' /> since in this case what one player wins, the other loses.  Games with this extra property are called <b>zero-sum games</b>.  But we won&#8217;t limit ourselves to those!</p>
<p>We say a strategy <img src='https://s0.wp.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='i' title='i' class='latex' /> is a <b>symmetric Nash equilibrium</b> if</p>
<p><img src='https://s0.wp.com/latex.php?latex=A_%7Bii%7D+%5Cge+A_%7Bji%7D+&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='A_{ii} &#92;ge A_{ji} ' title='A_{ii} &#92;ge A_{ji} ' class='latex' /></p>
<p>for all <img src='https://s0.wp.com/latex.php?latex=j.&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='j.' title='j.' class='latex' />  This means that if both players use strategy <img src='https://s0.wp.com/latex.php?latex=i%2C&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='i,' title='i,' class='latex' /> neither gains anything by switching to another strategy.</p>
<p>For example, suppose our matrix is</p>
<p><img src='https://s0.wp.com/latex.php?latex=%5Cleft%28+%5Cbegin%7Barray%7D%7Brr%7D+-1+%26+-12+%5C%5C++0+%26+-3+%5Cend%7Barray%7D+%5Cright%29+&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;left( &#92;begin{array}{rr} -1 &amp; -12 &#92;&#92;  0 &amp; -3 &#92;end{array} &#92;right) ' title='&#92;left( &#92;begin{array}{rr} -1 &amp; -12 &#92;&#92;  0 &amp; -3 &#92;end{array} &#92;right) ' class='latex' /></p>
<p>Then we&#8217;ve got the Prisoner&#8217;s Dilemma exactly as described last time!  Here strategy 1 is <b>cooperate</b> and strategy 2 is <b>defect</b>.  If a player cooperates and so does his opponent, he wins</p>
<p><img src='https://s0.wp.com/latex.php?latex=A_%7B11%7D+%3D+-1+&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='A_{11} = -1 ' title='A_{11} = -1 ' class='latex' /></p>
<p>meaning he gets one month in jail.   We include a minus sign because &#8216;winning a month in jail&#8217; is not a good thing.   If the player cooperates but his opponent defects, he gets a whole year in jail:</p>
<p><img src='https://s0.wp.com/latex.php?latex=A_%7B12%7D+%3D+-12&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='A_{12} = -12' title='A_{12} = -12' class='latex' /></p>
<p>If he defects but his opponent cooperates, he doesn&#8217;t go to jail at all:</p>
<p><img src='https://s0.wp.com/latex.php?latex=A_%7B21%7D+%3D+0&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='A_{21} = 0' title='A_{21} = 0' class='latex' /></p>
<p>And if they both defect, they both get three months in jail:</p>
<p><img src='https://s0.wp.com/latex.php?latex=A_%7B22%7D+%3D+-3&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='A_{22} = -3' title='A_{22} = -3' class='latex' /></p>
<p>You can see that defecting is a Nash equilibrium, since</p>
<p><img src='https://s0.wp.com/latex.php?latex=A_%7B22%7D+%5Cge+A_%7B12%7D&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='A_{22} &#92;ge A_{12}' title='A_{22} &#92;ge A_{12}' class='latex' /></p>
<p>So, oddly, if our prisoners know game theory and believe Nash equilibria are best, they&#8217;ll both be worse off than if they cooperate and don&#8217;t betray each other.</p>
<div align="center">
<img width="170" src="http://math.ucr.edu/home/baez/prisoner's_dilemma_left.jpg" /> &nbsp;&nbsp;&nbsp;&nbsp; <img width="170" src="http://math.ucr.edu/home/baez/prisoner's_dilemma_right.jpg" /></div>
<h3> Nash equilibria  for mixed strategies </h3>
<p>So far we&#8217;ve been assuming that with 100% certainty, each player chooses one strategy <img src='https://s0.wp.com/latex.php?latex=i+%3D+1%2C2%2C3%2C%5Cdots%2C+n.&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='i = 1,2,3,&#92;dots, n.' title='i = 1,2,3,&#92;dots, n.' class='latex' />  Since we&#8217;ll be considering more general strategies in a minute, let&#8217;s call these <b>pure strategies</b>.</p>
<p>Now let&#8217;s throw some probability theory into the stew!  Let&#8217;s allow the players to pick different pure strategies with different probabilities.  So, we define a <b>mixed strategy</b> to be a probability distribution on the set of pure strategies.  In other words, it&#8217;s a list of <img src='https://s0.wp.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='n' title='n' class='latex' /> nonnegative numbers</p>
<p><img src='https://s0.wp.com/latex.php?latex=p_i+%5Cge+0+&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='p_i &#92;ge 0 ' title='p_i &#92;ge 0 ' class='latex' /></p>
<p>that sum to one:</p>
<p><img src='https://s0.wp.com/latex.php?latex=%5Cdisplaystyle%7B+%5Csum_%7Bi%3D1%7D%5En+p_i+%3D+1+%7D+&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;displaystyle{ &#92;sum_{i=1}^n p_i = 1 } ' title='&#92;displaystyle{ &#92;sum_{i=1}^n p_i = 1 } ' class='latex' /></p>
<p>Say I choose the mixed strategy <img src='https://s0.wp.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='p' title='p' class='latex' /> while you, my opponent, choose the mixed strategy <img src='https://s0.wp.com/latex.php?latex=q.&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='q.' title='q.' class='latex' />  Say our choices are made independently.  Then the probability that I choose the pure strategy <img src='https://s0.wp.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='i' title='i' class='latex' /> while you chose <img src='https://s0.wp.com/latex.php?latex=j&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='j' title='j' class='latex' /> is</p>
<p><img src='https://s0.wp.com/latex.php?latex=p_i+q_j&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='p_i q_j' title='p_i q_j' class='latex' /></p>
<p>so the expected value of my winnings is</p>
<p><img src='https://s0.wp.com/latex.php?latex=%5Cdisplaystyle%7B+%5Csum_%7Bi%2Cj+%3D+1%7D%5En+p_i+A_%7Bij%7D+q_j+%7D&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;displaystyle{ &#92;sum_{i,j = 1}^n p_i A_{ij} q_j }' title='&#92;displaystyle{ &#92;sum_{i,j = 1}^n p_i A_{ij} q_j }' class='latex' /></p>
<p>or using vector notation</p>
<p><img src='https://s0.wp.com/latex.php?latex=p+%5Ccdot+A+q+&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='p &#92;cdot A q ' title='p &#92;cdot A q ' class='latex' /></p>
<p>where the dot is the usual dot product on <img src='https://s0.wp.com/latex.php?latex=%5Cmathbb%7BR%7D%5En.&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;mathbb{R}^n.' title='&#92;mathbb{R}^n.' class='latex' /></p>
<p>We can easily adapt the concept of Nash equilibrium to mixed strategies.  A mixed strategy <img src='https://s0.wp.com/latex.php?latex=q&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='q' title='q' class='latex' /> is a <b>symmetric  Nash equilibrium</b> if for any other mixed strategy <img src='https://s0.wp.com/latex.php?latex=p%2C&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='p,' title='p,' class='latex' /></p>
<p><img src='https://s0.wp.com/latex.php?latex=q+%5Ccdot+A+q+%5Cge++p+%5Ccdot+A+q+&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='q &#92;cdot A q &#92;ge  p &#92;cdot A q ' title='q &#92;cdot A q &#92;ge  p &#92;cdot A q ' class='latex' /></p>
<p>This means that if both you and I are playing the mixed strategy <img src='https://s0.wp.com/latex.php?latex=q%2C&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='q,' title='q,' class='latex' /> I can&#8217;t improve my expected winnings by unilaterally switching to the mixed strategy <img src='https://s0.wp.com/latex.php?latex=p.&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='p.' title='p.' class='latex' />  And neither can you, because the game is symmetric!</p>
<p>If this were a course on game theory, I would now do some examples.  But it&#8217;s not, so I&#8217;ll just send you to page 6 of <a href="http://www.ssc.wisc.edu/~whs/research/egt.pdf">Sandholm&#8217;s paper</a>: he looks at some famous games like &#8216;hawks and doves&#8217; and &#8216;rock paper scissors&#8217;.</p>
<h3> Evolutionarily stable strategies </h3>
<p>We&#8217;re finally ready to discuss evolutionarily stable strategies.  To do this, let&#8217;s reinterpret the &#8216;pure strategies&#8217; <img src='https://s0.wp.com/latex.php?latex=i+%3D+1%2C2%2C3%2C+%5Cdots+n&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='i = 1,2,3, &#92;dots n' title='i = 1,2,3, &#92;dots n' class='latex' /> as <b>species</b>.  Here I don&#8217;t necessarily mean species in the classic biological sense: I just mean different kinds of self-replicating entities, or <b>replicators</b>.  For example, they could be different <a href="http://en.wikipedia.org/wiki/Allele">alleles</a> of the same gene.</p>
<p>Similarly, we&#8217;ll reinterpret the &#8216;mixed strategy&#8217; <img src='https://s0.wp.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='p' title='p' class='latex' /> as describing a mixed population of replicators, where the fraction of replicators belonging to the <img src='https://s0.wp.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='i' title='i' class='latex' />th species is <img src='https://s0.wp.com/latex.php?latex=p_i.&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='p_i.' title='p_i.' class='latex' />  These numbers are still probabilities: <img src='https://s0.wp.com/latex.php?latex=p_i&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='p_i' title='p_i' class='latex' /> is the probability that a randomly chosen replicator will belong to the <img src='https://s0.wp.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='i' title='i' class='latex' />th species.</p>
<p>We&#8217;ll reinterpret the payoff matrix <img src='https://s0.wp.com/latex.php?latex=A_%7Bij%7D&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='A_{ij}' title='A_{ij}' class='latex' /> as a <b>fitness matrix</b>.  In our earlier discussion of the replicator equation, we assumed that the population <img src='https://s0.wp.com/latex.php?latex=P_i&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='P_i' title='P_i' class='latex' /> of the <img src='https://s0.wp.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='i' title='i' class='latex' />th species grew according to the replicator equation</p>
<p><img src='https://s0.wp.com/latex.php?latex=%5Cdisplaystyle%7B+%5Cfrac%7Bd+P_i%7D%7Bd+t%7D+%3D+f_i%28P_1%2C+%5Cdots%2C+P_n%29+P_i+%7D&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;displaystyle{ &#92;frac{d P_i}{d t} = f_i(P_1, &#92;dots, P_n) P_i }' title='&#92;displaystyle{ &#92;frac{d P_i}{d t} = f_i(P_1, &#92;dots, P_n) P_i }' class='latex' /></p>
<p>where the <b>fitness function</b> <img src='https://s0.wp.com/latex.php?latex=f_i&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='f_i' title='f_i' class='latex' /> is any smooth function of the populations of each kind of replicator.</p>
<p>But in evolutionary game theory it&#8217;s common to start by looking at a simple special case where</p>
<p><img src='https://s0.wp.com/latex.php?latex=%5Cdisplaystyle%7Bf_i%28P_1%2C+%5Cdots%2C+P_n%29+++%3D+%5Csum_%7Bj%3D1%7D%5En+A_%7Bij%7D+p_j+%7D&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;displaystyle{f_i(P_1, &#92;dots, P_n)   = &#92;sum_{j=1}^n A_{ij} p_j }' title='&#92;displaystyle{f_i(P_1, &#92;dots, P_n)   = &#92;sum_{j=1}^n A_{ij} p_j }' class='latex' /></p>
<p>where</p>
<p><img src='https://s0.wp.com/latex.php?latex=%5Cdisplaystyle%7B+p_j+%3D+%5Cfrac%7BP_j%7D%7B%5Csum_k+P_k%7D+%7D&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;displaystyle{ p_j = &#92;frac{P_j}{&#92;sum_k P_k} }' title='&#92;displaystyle{ p_j = &#92;frac{P_j}{&#92;sum_k P_k} }' class='latex' /></p>
<p>is the fraction of replicators who belong to the <img src='https://s0.wp.com/latex.php?latex=j&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='j' title='j' class='latex' />th species.</p>
<p>What does this mean?  The idea is that we have a well-mixed population of game players&#8212;or replicators.   Each one has its own pure strategy&#8212;or species.   Each one randomly roams around and &#8216;plays games&#8217; with each other replicator it meets.  It gets to reproduce at a rate proportional to its expected winnings.</p>
<p>This is unrealistic in all sorts of ways, but it&#8217;s mathematically cute, and it&#8217;s been studied a lot, so it&#8217;s good to know about.  Today I&#8217;ll explain evolutionarily stable strategies only in this special case.  Later I&#8217;ll go back to the general case.</p>
<p>Suppose that we select a sample of replicators from the overall population.  What is the mean fitness of the replicators in this sample?  For this, we need to know the probability that a replicator from this sample belongs to the <img src='https://s0.wp.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='i' title='i' class='latex' />th species.  Say it&#8217;s <img src='https://s0.wp.com/latex.php?latex=q_j.&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='q_j.' title='q_j.' class='latex' />  Then the mean fitness of our sample is</p>
<p><img src='https://s0.wp.com/latex.php?latex=%5Cdisplaystyle%7B+%5Csum_%7Bi%2Cj%3D1%7D%5En+q_i+A_%7Bij%7D+p_j+%7D&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;displaystyle{ &#92;sum_{i,j=1}^n q_i A_{ij} p_j }' title='&#92;displaystyle{ &#92;sum_{i,j=1}^n q_i A_{ij} p_j }' class='latex' /></p>
<p>This is just a weighted average of the fitnesses in our earlier formula.   But using the magic of vectors, we can write this sum as</p>
<p><img src='https://s0.wp.com/latex.php?latex=q+%5Ccdot+A+p+&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='q &#92;cdot A p ' title='q &#92;cdot A p ' class='latex' /></p>
<p>We already saw this type of expression in the last section!  It&#8217;s my expected winnings if I play the mixed strategy <img src='https://s0.wp.com/latex.php?latex=q&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='q' title='q' class='latex' /> and you play the mixed strategy <img src='https://s0.wp.com/latex.php?latex=p.&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='p.' title='p.' class='latex' /></p>
<p>John Maynard Smith defined <img src='https://s0.wp.com/latex.php?latex=q&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='q' title='q' class='latex' /> to be <b>evolutionarily stable strategy</b> if when we add a small population of &#8216;invaders&#8217; distributed according to any other probability distribution <img src='https://s0.wp.com/latex.php?latex=p%2C&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='p,' title='p,' class='latex' /> the original population is more fit than the invaders.</p>
<p>In simple terms: a small &#8216;invading&#8217; population will do worse than the population as a whole.</p>
<p>Mathematically, this means:</p>
<p><img src='https://s0.wp.com/latex.php?latex=q+%5Ccdot+A+%28%281-%5Cepsilon%29q+%2B+%5Cepsilon+p%29+%3E++p+%5Ccdot+A+%28%281-%5Cepsilon%29q+%2B+%5Cepsilon+p%29+&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='q &#92;cdot A ((1-&#92;epsilon)q + &#92;epsilon p) &gt;  p &#92;cdot A ((1-&#92;epsilon)q + &#92;epsilon p) ' title='q &#92;cdot A ((1-&#92;epsilon)q + &#92;epsilon p) &gt;  p &#92;cdot A ((1-&#92;epsilon)q + &#92;epsilon p) ' class='latex' /></p>
<p>for all mixed strategies <img src='https://s0.wp.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='p' title='p' class='latex' /> and all sufficiently small <img src='https://s0.wp.com/latex.php?latex=%5Cepsilon+%5Cge+0+.&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;epsilon &#92;ge 0 .' title='&#92;epsilon &#92;ge 0 .' class='latex' />   Here</p>
<p><img src='https://s0.wp.com/latex.php?latex=%281-%5Cepsilon%29q+%2B+%5Cepsilon+p+&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='(1-&#92;epsilon)q + &#92;epsilon p ' title='(1-&#92;epsilon)q + &#92;epsilon p ' class='latex' /></p>
<p>is the population we get by replacing an <img src='https://s0.wp.com/latex.php?latex=%5Cepsilon&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;epsilon' title='&#92;epsilon' class='latex' />-sized portion of our original population by invaders.</p>
<p><b>Puzzle:</b>  Show that <img src='https://s0.wp.com/latex.php?latex=q&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='q' title='q' class='latex' /> is an evolutionarily stable strategy if and only these two conditions hold for all mixed stategies <img src='https://s0.wp.com/latex.php?latex=p%3A&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='p:' title='p:' class='latex' /></p>
<p><img src='https://s0.wp.com/latex.php?latex=q+%5Ccdot+A+q+%5Cge+p+%5Ccdot+A+q&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='q &#92;cdot A q &#92;ge p &#92;cdot A q' title='q &#92;cdot A q &#92;ge p &#92;cdot A q' class='latex' /></p>
<p>and also, for all <img src='https://s0.wp.com/latex.php?latex=q+%5Cne+p&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='q &#92;ne p' title='q &#92;ne p' class='latex' />,</p>
<p><img src='https://s0.wp.com/latex.php?latex=q+%5Ccdot+A+q+%3D+p+%5Ccdot+A+q+%5C%3B+%5Cimplies+%5C%3B+q+%5Ccdot+A+p+%3E+p+%5Ccdot+A+p&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='q &#92;cdot A q = p &#92;cdot A q &#92;; &#92;implies &#92;; q &#92;cdot A p &gt; p &#92;cdot A p' title='q &#92;cdot A q = p &#92;cdot A q &#92;; &#92;implies &#92;; q &#92;cdot A p &gt; p &#92;cdot A p' class='latex' /></p>
<p>The first condition says that <img src='https://s0.wp.com/latex.php?latex=q&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='q' title='q' class='latex' /> is a symmetric Nash equilibrium.  In other words, the invaders can&#8217;t on average be <i>better</i> playing against the original population than members of the original population are.    The second says that if the invaders are <i>just as good</i> at playing against the original population, they must be worse at playing against each other!  The combination of these conditions means the invaders won&#8217;t take over.</p>
<p>Again, I should do some examples&#8230; but instead I&#8217;ll refer you to page 9 of <a href="http://www.ssc.wisc.edu/~whs/research/egt.pdf">Sandholm&#8217;s paper</a>, and also these course notes:</p>
<p>&bull; Samuel Alizon and Daniel Cownden, <a href="http://dcownden.files.wordpress.com/2009/01/notes6.pdf">Evolutionary games and evolutionarily stable strategies</a>.</p>
<p>&bull; Samuel Alizon and Daniel Cownden, <a href="http://dcownden.files.wordpress.com/2009/01/notes8.pdf">Replicator dynamics</a>.</p>
<h3> The decrease of relative information </h3>
<p>Now comes the punchline&#8230; but with a slight surprise twist at the end.  <a href="https://johncarlosbaez.wordpress.com/2012/06/07/information-geometry-part-11/">Last time</a> we let</p>
<p><img src='https://s0.wp.com/latex.php?latex=P+%3D+%28P_1%2C+%5Cdots+%2C+P_n%29&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='P = (P_1, &#92;dots , P_n)' title='P = (P_1, &#92;dots , P_n)' class='latex' /></p>
<p>be a population that evolves with time according to the replicator equation, and we let <img src='https://s0.wp.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='p' title='p' class='latex' /> be the corresponding probability distribution.  We supposed <img src='https://s0.wp.com/latex.php?latex=q&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='q' title='q' class='latex' /> was some fixed probability distribution.   We saw that the relative information</p>
<p><img src='https://s0.wp.com/latex.php?latex=I%28q%2Cp%29+%3D+%5Cdisplaystyle%7B+%5Csum_i+%5Cln+%5Cleft%28%5Cfrac%7Bq_i%7D%7B+p_i+%7D%5Cright%29+q_i+%7D+&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='I(q,p) = &#92;displaystyle{ &#92;sum_i &#92;ln &#92;left(&#92;frac{q_i}{ p_i }&#92;right) q_i } ' title='I(q,p) = &#92;displaystyle{ &#92;sum_i &#92;ln &#92;left(&#92;frac{q_i}{ p_i }&#92;right) q_i } ' class='latex' /></p>
<p>obeys</p>
<p><img src='https://s0.wp.com/latex.php?latex=%5Cdisplaystyle%7B+%5Cfrac%7Bd%7D%7Bdt%7D+I%28q%2Cp%29+%3D++%28p+-+q%29+%7D+%5Ccdot+f%28P%29+&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;displaystyle{ &#92;frac{d}{dt} I(q,p) =  (p - q) } &#92;cdot f(P) ' title='&#92;displaystyle{ &#92;frac{d}{dt} I(q,p) =  (p - q) } &#92;cdot f(P) ' class='latex' /></p>
<p>where <img src='https://s0.wp.com/latex.php?latex=f%28P%29&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='f(P)' title='f(P)' class='latex' /> is the vector of fitness functions.  So, this relative information can never increase if</p>
<p><img src='https://s0.wp.com/latex.php?latex=%28p+-+q%29+%5Ccdot+f%28P%29+%5Cle+0+&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='(p - q) &#92;cdot f(P) &#92;le 0 ' title='(p - q) &#92;cdot f(P) &#92;le 0 ' class='latex' /></p>
<p>for all <img src='https://s0.wp.com/latex.php?latex=P&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='P' title='P' class='latex' />.</p>
<p>We can adapt this to the special case we&#8217;re looking at now.  Remember, right now we&#8217;re assuming</p>
<p><img src='https://s0.wp.com/latex.php?latex=%5Cdisplaystyle%7Bf_i%28P_1%2C+%5Cdots%2C+P_n%29+++%3D+%5Csum_%7Bj%3D1%7D%5En+A_%7Bij%7D+p_j+%7D&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;displaystyle{f_i(P_1, &#92;dots, P_n)   = &#92;sum_{j=1}^n A_{ij} p_j }' title='&#92;displaystyle{f_i(P_1, &#92;dots, P_n)   = &#92;sum_{j=1}^n A_{ij} p_j }' class='latex' /></p>
<p>so</p>
<p><img src='https://s0.wp.com/latex.php?latex=f%28P%29+%3D+A+p+&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='f(P) = A p ' title='f(P) = A p ' class='latex' /></p>
<p>Thus, the relative information will never increase if</p>
<p><img src='https://s0.wp.com/latex.php?latex=%28p+-+q%29+%5Ccdot+A+p+%5Cle+0&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='(p - q) &#92;cdot A p &#92;le 0' title='(p - q) &#92;cdot A p &#92;le 0' class='latex' /></p>
<p>or in other words,</p>
<p><img src='https://s0.wp.com/latex.php?latex=q+%5Ccdot+A+p+%5Cge+p+%5Ccdot+A+p++%5Cqquad+%5Cqquad+%5Cqquad+%5Cqquad+%5Cqquad+%5Cqquad++%281%29+&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='q &#92;cdot A p &#92;ge p &#92;cdot A p  &#92;qquad &#92;qquad &#92;qquad &#92;qquad &#92;qquad &#92;qquad  (1) ' title='q &#92;cdot A p &#92;ge p &#92;cdot A p  &#92;qquad &#92;qquad &#92;qquad &#92;qquad &#92;qquad &#92;qquad  (1) ' class='latex' /></p>
<p>Now, this looks very similar to the conditions for an evolutionary stable strategy as stated in the Puzzle above.  <i>But it&#8217;s not the same!</i>  That&#8217;s the surprise twist.</p>
<p>Remember, the Puzzle says that <img src='https://s0.wp.com/latex.php?latex=q&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='q' title='q' class='latex' /> is an evolutionarily stable state if for all mixed strategies <img src='https://s0.wp.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='p' title='p' class='latex' /> we have</p>
<p><img src='https://s0.wp.com/latex.php?latex=q+%5Ccdot+A+q+%5Cge+p+%5Ccdot+A+q++%5Cqquad+%5Cqquad+%5Cqquad+%5Cqquad+%5Cqquad+%5Cqquad+%282%29&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='q &#92;cdot A q &#92;ge p &#92;cdot A q  &#92;qquad &#92;qquad &#92;qquad &#92;qquad &#92;qquad &#92;qquad (2)' title='q &#92;cdot A q &#92;ge p &#92;cdot A q  &#92;qquad &#92;qquad &#92;qquad &#92;qquad &#92;qquad &#92;qquad (2)' class='latex' /></p>
<p>and also</p>
<p><img src='https://s0.wp.com/latex.php?latex=q+%5Ccdot+A+q+%3D+p+%5Ccdot+A+q+%5C%3B+%5Cimplies+%5C%3B+q+%5Ccdot+A+p+%3E+p+%5Ccdot+A+p++%5Cqquad+%5C%3B+%283%29&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='q &#92;cdot A q = p &#92;cdot A q &#92;; &#92;implies &#92;; q &#92;cdot A p &gt; p &#92;cdot A p  &#92;qquad &#92;; (3)' title='q &#92;cdot A q = p &#92;cdot A q &#92;; &#92;implies &#92;; q &#92;cdot A p &gt; p &#92;cdot A p  &#92;qquad &#92;; (3)' class='latex' /></p>
<p>Note that condition (1), the one we want, is <i>neither</i> condition (2) <i>nor</i> condition (3)!  This drove me crazy for almost a day.</p>
<div align="center">
<img src="https://i1.wp.com/math.ucr.edu/home/baez/emoticons/spiral_eyes.gif" alt="" /></div>
<p>I kept thinking I&#8217;d made a mistake, like mixing up <img src='https://s0.wp.com/latex.php?latex=p&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='p' title='p' class='latex' /> and <img src='https://s0.wp.com/latex.php?latex=q&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='q' title='q' class='latex' /> somewhere.  You&#8217;ve got to mind your p&#8217;s and q&#8217;s in this game!</p>
<p>But the solution turned out to be this.  After Maynard Smith came up with his definition of &#8216;evolutionarily stable state&#8217;, another guy came up with a different definition:</p>
<p>&bull; Bernhard Thomas, On evolutionarily stable sets, <i><a href="http://www.springerlink.com/content/g7812u72h6110m26/?MUD=MP">J. Math. Biology</a></i> <b>22</b> (1985), 105&#8211;115.</p>
<p>For him, an <b>evolutionarily stable strategy</b> obeys</p>
<p><img src='https://s0.wp.com/latex.php?latex=q+%5Ccdot+A+q+%5Cge+p+%5Ccdot+A+q++%5Cqquad+%5Cqquad+%5Cqquad+%5Cqquad+%5Cqquad+%5Cqquad+%282%29&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='q &#92;cdot A q &#92;ge p &#92;cdot A q  &#92;qquad &#92;qquad &#92;qquad &#92;qquad &#92;qquad &#92;qquad (2)' title='q &#92;cdot A q &#92;ge p &#92;cdot A q  &#92;qquad &#92;qquad &#92;qquad &#92;qquad &#92;qquad &#92;qquad (2)' class='latex' /></p>
<p>and also</p>
<p><img src='https://s0.wp.com/latex.php?latex=q+%5Ccdot+A+p+%5Cge+p+%5Ccdot+A+p++%5Cqquad+%5Cqquad+%5Cqquad+%5Cqquad+%5Cqquad+%5Cqquad++%281%29+&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='q &#92;cdot A p &#92;ge p &#92;cdot A p  &#92;qquad &#92;qquad &#92;qquad &#92;qquad &#92;qquad &#92;qquad  (1) ' title='q &#92;cdot A p &#92;ge p &#92;cdot A p  &#92;qquad &#92;qquad &#92;qquad &#92;qquad &#92;qquad &#92;qquad  (1) ' class='latex' /></p>
<p>Condition (1) is stronger than condition (3), so he renamed Maynard Smith&#8217;s evolutionarily stable strategies <b>weakly  evolutionarily stable strategies</b>.  And condition (1) guarantees that the relative information <img src='https://s0.wp.com/latex.php?latex=I%28q%2Cp%29&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='I(q,p)' title='I(q,p)' class='latex' /> can never increase.  So, now we&#8217;re happy.</p>
<p>Except for one thing: why should we switch from Maynard Smith&#8217;s perfectly sensible concept of evolutionarily stable state to this new stronger one?  I don&#8217;t really know, except that</p>
<p>&bull; it&#8217;s not much stronger</p>
<p>and</p>
<p>&bull; it lets us prove the theorem we want!</p>
<p>So, it&#8217;s a small mystery for me to mull over.  If you have any good ideas, let me know.</p>
]]></html><thumbnail_url><![CDATA[http://math.ucr.edu/home/baez/prisoner]]></thumbnail_url><thumbnail_height><![CDATA[]]></thumbnail_height><thumbnail_width><![CDATA[]]></thumbnail_width></oembed>