<?xml version="1.0" encoding="UTF-8" standalone="yes"?><oembed><version><![CDATA[1.0]]></version><provider_name><![CDATA[Azimuth]]></provider_name><provider_url><![CDATA[https://johncarlosbaez.wordpress.com]]></provider_url><author_name><![CDATA[John Baez]]></author_name><author_url><![CDATA[https://johncarlosbaez.wordpress.com/author/johncarlosbaez/]]></author_url><title><![CDATA[Maximum Entropy and&nbsp;Ecology]]></title><type><![CDATA[link]]></type><html><![CDATA[<p>I already talked about <a href="https://johncarlosbaez.wordpress.com/2012/10/27/john-harte/">John Harte&#8217;s book on how to stop global warming</a>.   Since I&#8217;m trying to apply information theory and thermodynamics to ecology, I was also interested in this book of his:</p>
<p>&bull; <a href="http://socrates.berkeley.edu/~hartelab/">John Harte</a>, <i>Maximum Entropy and Ecology</i>, Oxford U. Press, Oxford, 2011.</p>
<p>There&#8217;s a lot in this book, and I haven&#8217;t absorbed it all, but let me try to briefly summarize his <b>maximum entropy theory of ecology</b>. This aims to be &#8220;a comprehensive, parsimonious, and testable theory of the distribution, abundance, and energetics of species across spatial scales&#8221;.  One great thing is that he makes quantitative predictions using this theory and compares them to a lot of real-world data.  But let me just tell you about the theory.  </p>
<p>It&#8217;s heavily based on the <a href="http://en.wikipedia.org/wiki/Principle_of_maximum_entropy">principle of maximum entropy</a> (MaxEnt for short), and there are two parts:</p>
<blockquote><p>
Two MaxEnt calculations are at the core of the theory: the first yields all the metrics that describe abundance and energy distributions, and the second describes the spatial scaling properties of species&#8217; distributions.
</p></blockquote>
<h3> Abundance and energy distributions </h3>
<p>The first part of Harte&#8217;s theory is all about a conditional probability distribution </p>
<p><img src='https://s0.wp.com/latex.php?latex=R%28n%2C%5Cepsilon+%7C+S_0%2C+N_0%2C+E_0%29+&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='R(n,&#92;epsilon | S_0, N_0, E_0) ' title='R(n,&#92;epsilon | S_0, N_0, E_0) ' class='latex' /></p>
<p>which he calls the <b>ecosystem structure function</b>.  Here:</p>
<p>&bull; <img src='https://s0.wp.com/latex.php?latex=S_0&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='S_0' title='S_0' class='latex' />: the total number of species under consideration in some area.</p>
<p>&bull; <img src='https://s0.wp.com/latex.php?latex=N_0&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='N_0' title='N_0' class='latex' />: the total number of individuals under consideration in that area.</p>
<p>&bull; <img src='https://s0.wp.com/latex.php?latex=E_0&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='E_0' title='E_0' class='latex' />: the total rate of metabolic energy consumption of all these individuals.</p>
<p>Given this, </p>
<p><img src='https://s0.wp.com/latex.php?latex=R%28n%2C%5Cepsilon+%7C+S_0%2C+N_0%2C+E_0%29+%5C%2C+d+%5Cepsilon+&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='R(n,&#92;epsilon | S_0, N_0, E_0) &#92;, d &#92;epsilon ' title='R(n,&#92;epsilon | S_0, N_0, E_0) &#92;, d &#92;epsilon ' class='latex' /></p>
<p>is the probability that given <img src='https://s0.wp.com/latex.php?latex=S_0%2C+N_0%2C+E_0%2C&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='S_0, N_0, E_0,' title='S_0, N_0, E_0,' class='latex' /> if a species is picked from the collection of species, then it has <img src='https://s0.wp.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='n' title='n' class='latex' /> individuals, and if an individual is picked at random from that species, then its rate of metabolic energy consumption is in the interval <img src='https://s0.wp.com/latex.php?latex=%28%5Cepsilon%2C+%5Cepsilon+%2B+d+%5Cepsilon%29.&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='(&#92;epsilon, &#92;epsilon + d &#92;epsilon).' title='(&#92;epsilon, &#92;epsilon + d &#92;epsilon).' class='latex' /></p>
<p>Here of course <img src='https://s0.wp.com/latex.php?latex=d+%5Cepsilon&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='d &#92;epsilon' title='d &#92;epsilon' class='latex' /> is &#8216;infinitesimal&#8217;, meaning that we take a limit where it goes to zero to make this idea precise (if we&#8217;re doing analytical work) or take it to be very small (if we&#8217;re estimating <img src='https://s0.wp.com/latex.php?latex=R&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='R' title='R' class='latex' /> from data).  </p>
<p>I believe that when we &#8216;pick a species&#8217; we&#8217;re treating them all as equally probable, not weighting them according to their number of individuals.  </p>
<p>Clearly <img src='https://s0.wp.com/latex.php?latex=R&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='R' title='R' class='latex' /> obeys some constraints.  First, since it&#8217;s a probability distribution, it obeys the normalization condition:</p>
<p><img src='https://s0.wp.com/latex.php?latex=%5Cdisplaystyle%7B+%5Csum_n+%5Cint+d+%5Cepsilon+%5C%3B+R%28n%2C%5Cepsilon+%7C+S_0%2C+N_0%2C+E_0%29+%3D+1+%7D&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;displaystyle{ &#92;sum_n &#92;int d &#92;epsilon &#92;; R(n,&#92;epsilon | S_0, N_0, E_0) = 1 }' title='&#92;displaystyle{ &#92;sum_n &#92;int d &#92;epsilon &#92;; R(n,&#92;epsilon | S_0, N_0, E_0) = 1 }' class='latex' /></p>
<p>Second, since the average number of individuals per species is <img src='https://s0.wp.com/latex.php?latex=N_0%2FS_0%2C&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='N_0/S_0,' title='N_0/S_0,' class='latex' /> we have:</p>
<p><img src='https://s0.wp.com/latex.php?latex=%5Cdisplaystyle%7B+%5Csum_n+%5Cint+d+%5Cepsilon+%5C%3B+n+R%28n%2C%5Cepsilon+%7C+S_0%2C+N_0%2C+E_0%29+%3D+N_0+%2F+S_0+%7D&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;displaystyle{ &#92;sum_n &#92;int d &#92;epsilon &#92;; n R(n,&#92;epsilon | S_0, N_0, E_0) = N_0 / S_0 }' title='&#92;displaystyle{ &#92;sum_n &#92;int d &#92;epsilon &#92;; n R(n,&#92;epsilon | S_0, N_0, E_0) = N_0 / S_0 }' class='latex' /></p>
<p>Third, since the average over species of the total rate of metabolic energy consumption of individuals within the species is <img src='https://s0.wp.com/latex.php?latex=E_0%2F+S_0%2C&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='E_0/ S_0,' title='E_0/ S_0,' class='latex' /> we have:</p>
<p><img src='https://s0.wp.com/latex.php?latex=%5Cdisplaystyle%7B+%5Csum_n+%5Cint+d+%5Cepsilon+%5C%3B+n+%5Cepsilon+R%28n%2C%5Cepsilon+%7C+S_0%2C+N_0%2C+E_0%29+%3D+E_0+%2F+S_0+%7D&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;displaystyle{ &#92;sum_n &#92;int d &#92;epsilon &#92;; n &#92;epsilon R(n,&#92;epsilon | S_0, N_0, E_0) = E_0 / S_0 }' title='&#92;displaystyle{ &#92;sum_n &#92;int d &#92;epsilon &#92;; n &#92;epsilon R(n,&#92;epsilon | S_0, N_0, E_0) = E_0 / S_0 }' class='latex' /></p>
<p>Harte&#8217;s theory is that <img src='https://s0.wp.com/latex.php?latex=R&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='R' title='R' class='latex' /> <i>maximizes entropy subject to these three constraints</i>.  Here entropy is defined by</p>
<p><img src='https://s0.wp.com/latex.php?latex=%5Cdisplaystyle%7B+-+%5Csum_n+%5Cint+d+%5Cepsilon+%5C%3B+R%28n%2C%5Cepsilon+%7C+S_0%2C+N_0%2C+E_0%29+%5Cln%28R%28n%2C%5Cepsilon+%7C+S_0%2C+N_0%2C+E_0%29%29+%7D+&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;displaystyle{ - &#92;sum_n &#92;int d &#92;epsilon &#92;; R(n,&#92;epsilon | S_0, N_0, E_0) &#92;ln(R(n,&#92;epsilon | S_0, N_0, E_0)) } ' title='&#92;displaystyle{ - &#92;sum_n &#92;int d &#92;epsilon &#92;; R(n,&#92;epsilon | S_0, N_0, E_0) &#92;ln(R(n,&#92;epsilon | S_0, N_0, E_0)) } ' class='latex' /></p>
<p>Harte uses this theory to calculate <img src='https://s0.wp.com/latex.php?latex=R%2C&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='R,' title='R,' class='latex' /> and tests the results against data from about 20 ecosystems.  For example, he predicts the abundance of species as a function of their rank, with rank 1 being the most abundant, rank 2 being the second most abundant, and so on.  And he gets results like this:</p>
<div align="center">
<a href="http://math.ucr.edu/home/baez/ecological/harte_species_abundance_distribution.jpg"><img src="https://i1.wp.com/math.ucr.edu/home/baez/ecological/harte_species_abundance_distribution.jpg" /></a></div>
<p>The data here are from:</p>
<p>&bull; Green, Harte, and Ostling&#8217;s work on a serpentine grassland, </p>
<p>&bull; Luquillo&#8217;s work on a 10.24-hectare tropical forest, and</p>
<p>&bull; Cocoli&#8217;s work on a 2-hectare wet tropical forest.</p>
<p>The fit looks good to me&#8230; but I should emphasize that I haven&#8217;t had time to study these matters in detail.  For more, you can read this paper, at least if your institution subscribes to this journal:</p>
<p>&bull; J. Harte, T. Zillio, E. Conlisk and A. Smith, Maximum entropy and the state-variable approach to macroecology, <a href="http://www.esajournals.org/doi/full/10.1890/07-1369.1"><i>Ecology</i></a> <b>89</b> (2008), 2700&ndash;2711.</p>
<h3> Spatial abundance distribution </h3>
<p>The second part of Harte&#8217;s theory is all about a conditional probability distribution </p>
<p><img src='https://s0.wp.com/latex.php?latex=%5CPi%28n+%7C+A%2C+n_0%2C+A_0%29+&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;Pi(n | A, n_0, A_0) ' title='&#92;Pi(n | A, n_0, A_0) ' class='latex' /></p>
<p>This is the probability that <img src='https://s0.wp.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='n' title='n' class='latex' /> individuals of a species are found in a region of area <img src='https://s0.wp.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='A' title='A' class='latex' /> given that it has <img src='https://s0.wp.com/latex.php?latex=n_0&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='n_0' title='n_0' class='latex' /> individuals in a larger region of area <img src='https://s0.wp.com/latex.php?latex=A_0.&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='A_0.' title='A_0.' class='latex' /></p>
<p><img src='https://s0.wp.com/latex.php?latex=%5CPi&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;Pi' title='&#92;Pi' class='latex' /> obeys two constraints.  First, since it&#8217;s a probability distribution, it obeys the normalization condition:</p>
<p><img src='https://s0.wp.com/latex.php?latex=%5Cdisplaystyle%7B+%5Csum_n++%5CPi%28n+%7C+A%2C+n_0%2C+A_0%29+%3D+1+%7D&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;displaystyle{ &#92;sum_n  &#92;Pi(n | A, n_0, A_0) = 1 }' title='&#92;displaystyle{ &#92;sum_n  &#92;Pi(n | A, n_0, A_0) = 1 }' class='latex' /></p>
<p>Second, since the mean value of <img src='https://s0.wp.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='n' title='n' class='latex' /> across regions of area <img src='https://s0.wp.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='A' title='A' class='latex' /> equals <img src='https://s0.wp.com/latex.php?latex=n_0+A%2FA_0%2C&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='n_0 A/A_0,' title='n_0 A/A_0,' class='latex' /> we have</p>
<p><img src='https://s0.wp.com/latex.php?latex=%5Cdisplaystyle%7B+%5Csum_n+n+%5CPi%28n+%7C+A%2C+n_0%2C+A_0%29+%3D+n_0+A%2FA_0+%7D&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;displaystyle{ &#92;sum_n n &#92;Pi(n | A, n_0, A_0) = n_0 A/A_0 }' title='&#92;displaystyle{ &#92;sum_n n &#92;Pi(n | A, n_0, A_0) = n_0 A/A_0 }' class='latex' /></p>
<p>Harte&#8217;s theory is that <img src='https://s0.wp.com/latex.php?latex=%5CPi&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;Pi' title='&#92;Pi' class='latex' /> <i>maximizes entropy subject to these two constraints</i>.   Here entropy is defined by</p>
<p><img src='https://s0.wp.com/latex.php?latex=%5Cdisplaystyle%7B-+%5Csum_n++%5CPi%28n+%7C+A%2C+n_0%2C+A_0%29%5Cln%28%5CPi%28n+%7C+A%2C+n_0%2C+A_0%29%29+%7D+&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;displaystyle{- &#92;sum_n  &#92;Pi(n | A, n_0, A_0)&#92;ln(&#92;Pi(n | A, n_0, A_0)) } ' title='&#92;displaystyle{- &#92;sum_n  &#92;Pi(n | A, n_0, A_0)&#92;ln(&#92;Pi(n | A, n_0, A_0)) } ' class='latex' /></p>
<p>Harte explains two approaches to use this idea to derive &#8216;scaling laws&#8217; for how <img src='https://s0.wp.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='n' title='n' class='latex' /> varies with <img src='https://s0.wp.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='n' title='n' class='latex' />.   And again, he compares his predictions to real-world data, and get results that look good to my (amateur, hasty) eye!</p>
<p>I hope sometime I can dig deeper into this subject.  Do you have any ideas, or knowledge about this stuff?</p>
]]></html><thumbnail_url><![CDATA[https://i1.wp.com/math.ucr.edu/home/baez/ecological/harte_species_abundance_distribution.jpg?fit=440%2C330]]></thumbnail_url><thumbnail_height><![CDATA[330]]></thumbnail_height><thumbnail_width><![CDATA[193]]></thumbnail_width></oembed>