<?xml version="1.0" encoding="UTF-8" standalone="yes"?><oembed><version><![CDATA[1.0]]></version><provider_name><![CDATA[Azimuth]]></provider_name><provider_url><![CDATA[https://johncarlosbaez.wordpress.com]]></provider_url><author_name><![CDATA[John Baez]]></author_name><author_url><![CDATA[https://johncarlosbaez.wordpress.com/author/johncarlosbaez/]]></author_url><title><![CDATA[The Mathematics of Biodiversity (Part&nbsp;1)]]></title><type><![CDATA[link]]></type><html><![CDATA[<p>I&#8217;m in Barcelona now, and I want to blog about this:</p>
<p>&bull; <a href="http://www.crm.cat/en/Activities/Pages/ActivityDescriptions/Research-Program-on-Mathematics-of-Biodiversity.aspx">Research Program on the Mathematics of Biodiversity</a>, June-July 2012, Centre de Recerca Matemàtica, Barcelona, Spain.  Organized by Ben Allen, Silvia Cuadrado, Tom Leinster, Richard Reeve and John Woolliams.</p>
<p>We&#8217;re having daily informal talks and there&#8217;s no way I can blog about all of them, talk to people here, and still get enough work done.  So, I&#8217;ll just mention a few things that strike me!  For example, this morning Lou Jost told me about an interesting paper by <a href="http://en.wikipedia.org/wiki/I._J._Good">I. J. Good</a>.  </p>
<p>I&#8217;d known of I. J. Good as one of the guys who came up with the concept of a &#8216;technological singularity&#8217;.  In 1963 he wrote:</p>
<blockquote><p>
Let an ultraintelligent machine be defined as a machine that can far surpass all the intellectual activities of any man however clever. Since the design of machines is one of these intellectual activities, an ultraintelligent machine could design even better machines; there would then unquestionably be an &#8216;intelligence explosion,&#8217; and the intelligence of man would be left far behind. Thus the first ultraintelligent machine is the last invention that man need ever make.
</p></blockquote>
<p>He was a British mathematician who worked as a cryptologist at Bletchley Park with Alan Turing.  After World War II, he continued to work with Turing on the design of computers and Bayesian statistics at the University of Manchester.  Later he moved to the US.  In 1968, thanks to his interest in artificial intelligence, he served as consultant for Stanley Kubrick&#8217;s film <i>2001: A Space Odyssey</i>.  He died in 2009.</p>
<p>Good was also a big chess enthusiast, and worked on writing programs to play chess. He&#8217;s the guy in front here:</p>
<div align="center"><a href="http://chessprogramming.wikispaces.com/Jack+Good?responseToken=088059b641f2f9d6dc6fe57e92d26c489"><img width="400" src="https://i1.wp.com/spec.lib.vt.edu/imagebase/vtarchive/screen/VTA0006.jpg" /></a></div>
<p>If you want to learn more about his work on chess, click on this photo!</p>
<p>But the paper Lou Jost mentioned is on a rather different subject:</p>
<p>&bull; Irving John Good, <a href="http://www.ling.upenn.edu/courses/cogs502/GoodTuring1953.pdf">The population frequency of species and the estimation of population parameters</a>, <i>Biometrika</i> <b>40</b> (1953), 237&#8211;264.</p>
<p>Let me just state one result, sloppily, without any details or precise hypotheses!  </p>
<p><b>Puzzle:</b> Suppose you go into the jungles of Ecuador and start collecting orchids.  You count the number of orchids of each different species that you find.  You get a list of numbers, something like this:</p>
<p><img src='https://s0.wp.com/latex.php?latex=14%2C+10%2C+8%2C+6%2C+2%2C+1%2C+1%2C+1&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='14, 10, 8, 6, 2, 1, 1, 1' title='14, 10, 8, 6, 2, 1, 1, 1' class='latex' /></p>
<p>What is the chance that the next orchid you find will belong to a new species?  </p>
<p>Good gives a rule of thumb for solving problems of this type:</p>
<p><img src='https://s0.wp.com/latex.php?latex=%5Cdisplaystyle%7B+%5Cfrac%7Bn_1%7D%7BN%7D+%7D&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;displaystyle{ &#92;frac{n_1}{N} }' title='&#92;displaystyle{ &#92;frac{n_1}{N} }' class='latex' /></p>
<p>Here <img src='https://s0.wp.com/latex.php?latex=N&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='N' title='N' class='latex' /> is the total number of orchid you collected, and <img src='https://s0.wp.com/latex.php?latex=n_i&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='n_i' title='n_i' class='latex' /> is the number of species for which you found exactly <img src='https://s0.wp.com/latex.php?latex=i&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='i' title='i' class='latex' /> orchids of that species.  In our example, </p>
<p><img src='https://s0.wp.com/latex.php?latex=n_1+%3D+3&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='n_1 = 3' title='n_1 = 3' class='latex' /></p>
<p>since we found just one orchid of three different species: those are the three 1&#8217;s at the end of our list.  Furthermore,</p>
<p><img src='https://s0.wp.com/latex.php?latex=N+%3D+14+%2B+10+%2B+8+%2B+6+%2B+2+%2B+1+%2B+1+%3D+42&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='N = 14 + 10 + 8 + 6 + 2 + 1 + 1 = 42' title='N = 14 + 10 + 8 + 6 + 2 + 1 + 1 = 42' class='latex' /></p>
<p>So here is Good&#8217;s estimate the chance that the next orchid you collect will be of a new species:</p>
<p><img src='https://s0.wp.com/latex.php?latex=%5Cdisplaystyle%7B+%5Cfrac%7Bn_1%7D%7BN%7D+%3D+%5Cfrac%7B3%7D%7B42%7D+%7D+&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;displaystyle{ &#92;frac{n_1}{N} = &#92;frac{3}{42} } ' title='&#92;displaystyle{ &#92;frac{n_1}{N} = &#92;frac{3}{42} } ' class='latex' /></p>
<p>Good&#8217;s argument is nontrivial&#8212;and of course it depends on some assumptions on the nature of the distribution of populations of different species!  Since he doesn&#8217;t state these assumptions succinctly and I haven&#8217;t read the paper carefully yet, I&#8217;m afraid you&#8217;ll have to read the paper to find out what they are.</p>
<p>Of course the math works for samples of <i>anything</i> that comes in distinct types, not just species of organisms!  Good considers four examples: </p>
<p>&bull; moths captured in a light-trap at Rothamsted, England,</p>
<p>&bull; words in American newspapers,</p>
<p>&bull; nouns in Macaulay&#8217;s essay on Bacon, </p>
<p>&bull; chess openings in games published by the <i>British Chess Magazine</i> in 1951.</p>
<p>By comparing a small sample to a bigger one, he studies how well his rule works in practice, and apparently it does okay.  </p>
<p>In his paper, I. J. Good thanks Alan Turing for coming up with the basic idea.   In fact he says Turing gave an &#8216;intuitive demonstration&#8217; of it&#8212;but he doesn&#8217;t give this intuitive demonstration, and according to Lou Jost he actually admits somewhere that he forgot it.   </p>
<p>You can read more about the idea here:</p>
<p>&bull; <a href="http://en.wikipedia.org/wiki/Good%E2%80%93Turing_frequency_estimation">Good&#8211;Turing frequency estimation</a>.</p>
<p>By the way, <a href="http://www.loujost.com/">Lou Jost</a> is not only an expert on <a href="http://www.loujost.com/Statistics%20and%20Physics/Diversity%20and%20Similarity/JostEntropy%20AndDiversity.pdf">biodiversity and its relation to entropy</a>!  He lives in the jungles of Ecuador and has discovered over 60 new species of orchids, including <a href="http://en.wikinews.org/wiki/American_botanist_Lou_Jost_discovers_world%27s_smallest_orchid">the world&#8217;s smallest:</a></p>
<div align="center"><a href="http://en.wikinews.org/wiki/American_botanist_Lou_Jost_discovers_world%27s_smallest_orchid"><img src="https://i1.wp.com/upload.wikimedia.org/wikipedia/commons/thumb/8/85/Platystele_P5313ruler2.jpg/220px-Platystele_P5313ruler2.jpg" /></a></div>
<p>He found it in Ecuador, and the petals are just a few cells thick!  (Typically, the news reports say he found it in Bolivia and the petals are just one cell thick.)</p>
<p>He said:</p>
<blockquote><p>
I found it among the roots of another plant that I had collected, another small orchid which I took back to grow in my greenhouse to get it to flower. A few months later I saw that down among the roots was a tiny little plant that I realised was more interesting than the bigger orchid. Looking at the flower is often the best way to be able to identify which species of orchid you&#8217;ve got hold of – and can tell you whether you&#8217;re looking at an unknown species or not.
</p></blockquote>
]]></html><thumbnail_url><![CDATA[https://i1.wp.com/spec.lib.vt.edu/imagebase/vtarchive/screen/VTA0006.jpg?fit=440%2C330]]></thumbnail_url><thumbnail_height><![CDATA[330]]></thumbnail_height><thumbnail_width><![CDATA[243]]></thumbnail_width></oembed>