<?xml version="1.0" encoding="UTF-8" standalone="yes"?><oembed><version><![CDATA[1.0]]></version><provider_name><![CDATA[Azimuth]]></provider_name><provider_url><![CDATA[https://johncarlosbaez.wordpress.com]]></provider_url><author_name><![CDATA[John Baez]]></author_name><author_url><![CDATA[https://johncarlosbaez.wordpress.com/author/johncarlosbaez/]]></author_url><title><![CDATA[Networks and Population Biology (Part&nbsp;3)]]></title><type><![CDATA[link]]></type><html><![CDATA[<p>I think today I&#8217;ll focus on one aspect of the talks Susan Holmes gave today: the space of phylogenetic trees.   Her talks were full of interesting applications to genetics, but I&#8217;m afraid my summary will drift off into a mathematical daydream inspired by what she said!  Luckily you can see her actual talk uncontaminated by my reveries here:</p>
<p>&bull; Susan Holmes, <a href="http://www-stat.stanford.edu/~susan/talks/SG2Treespace.pdf">Treespace: distances between trees</a>.</p>
<p>It&#8217;s based on this paper:</p>
<p>&bull; Louis Billera, Susan Holmes and Karen Vogtmann, <a href="http://www-stat.stanford.edu/~susan/papers/lap.pdf">Geometry of the space of phylogenetic trees</a>, <i>Advances in Applied Mathematics</i> <b>27</b> (2001), 733-767.</p>
<p>As I mentioned <a href="https://johncarlosbaez.wordpress.com/2011/05/04/networks-and-population-biology-part-2/">last time</a>, a <a href="http://en.wikipedia.org/wiki/Phylogenetic_tree">phylogenetic tree</a> is something like this:</p>
<div align="center"><a href="http://en.wikipedia.org/wiki/Phylogenetic_tree"><img src="https://i1.wp.com/upload.wikimedia.org/wikipedia/commons/thumb/7/70/Phylogenetic_tree.svg/450px-Phylogenetic_tree.svg.png" /></a>
</div>
<p>More mathematically, what we see here is a <a href="http://en.wikipedia.org/wiki/Tree_%28graph_theory%29">tree</a> (a connected graph with no circuits), with a distinguished <a href="http://en.wikipedia.org/wiki/Vertex_%28graph_theory%29">vertex</a> called the <b>root</b>, and <img src='https://s0.wp.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='n' title='n' class='latex' /> vertices of <a href="http://en.wikipedia.org/wiki/Degree_%28graph_theory%29">degree</a> 1, called <b>leaves</b>, that are labeled with elements from some <img src='https://s0.wp.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='n' title='n' class='latex' />-element set. We shall call such a thing a <b>leaf-labelled rooted tree</b>.  </p>
<p>Now, the tree above is actually a <b><a href="http://en.wikipedia.org/wiki/Binary_tree">binary</a></b> tree, meaning that as we move up an edge, away from the root, it either branches into two new edges or ends in a leaf.  (More precisely: each vertex that doesn&#8217;t have degree 1 has degree 3.)  This makes sense in biology because while species often split into two as they evolve, it is less likely for a species to split into <i>three</i> all at once.  </p>
<p>So, the phylogenetic trees we see in biology are usually leaf-labeled rooted binary trees.  However, we often want to <i>guess</i> such a tree from some data.  In this game, trees that aren&#8217;t binary become important too!  </p>
<p>Why?  Well, each edge of the tree can be labeled with a number saying how much evolution occurred along that edge: for example, how many DNA base pairs changed.  But as this number goes to zero, we get a tree that&#8217;s not binary anymore.  So, we think of non-binary trees as conceptually useful &#8216;intermediate cases&#8217; between binary trees.</p>
<p>This idea immediately leads us to consider a topological space consisting of phylogenetic trees which are not necessarily binary.  And at this point in the lecture I drifted off into a daydream about &#8216;operads&#8217;, which are a nice piece of mathematics that&#8217;s closely connected to this idea.  </p>
<p>So, I will deviate slightly from Holmes and define a <b>phylogenetic tree</b> to be a leaf-labeled rooted tree where each edge is labeled by a number called its <b>length</b>.  This length must be positive for every edge except the edge incident to the root; for that edge any nonnegative length is allowed.</p>
<p>Let&#8217;s write <img src='https://s0.wp.com/latex.php?latex=%5Cmathrm%7BPhyl%7D_n&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;mathrm{Phyl}_n' title='&#92;mathrm{Phyl}_n' class='latex' /> for the set of phylogenetic trees with <img src='https://s0.wp.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='n' title='n' class='latex' /> leaves.  This becomes a topological space in a fairly obvious way.   For example, there&#8217;s a continuous path in <img src='https://s0.wp.com/latex.php?latex=%5Cmathrm%7BPhyl%7D_3&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;mathrm{Phyl}_3' title='&#92;mathrm{Phyl}_3' class='latex' /> that looks like this:</p>
<div align="center">
<img src="https://i2.wp.com/math.ucr.edu/home/baez/networks/tree_rotation.jpg" alt="" />
</div>
<p>Moreover we have this fact:</p>
<p><b>Theorem.</b> There is a topological operad called the <b>phylogenetic operad</b>, or <img src='https://s0.wp.com/latex.php?latex=%5Cmathrm%7BPhyl%7D%2C&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;mathrm{Phyl},' title='&#92;mathrm{Phyl},' class='latex' /> whose space of <i>n</i>-ary operations is <img src='https://s0.wp.com/latex.php?latex=%5Cmathrm%7BPhyl%7D_n&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;mathrm{Phyl}_n' title='&#92;mathrm{Phyl}_n' class='latex' /> for <img src='https://s0.wp.com/latex.php?latex=n+%5Cge+1&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='n &#92;ge 1' title='n &#92;ge 1' class='latex' /> and the empty set for <img src='https://s0.wp.com/latex.php?latex=n+%3D+0&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='n = 0' title='n = 0' class='latex' />.</p>
<p>If you don&#8217;t know what an <a href="http://math.ucr.edu/home/baez/operad.pdf">operad</a> is, don&#8217;t be scared.  This mainly just means that you can glue a bunch of phylogenetic trees to the top of another one and get a new phylogenetic tree!  More precisely, suppose you have a phylogenetic tree with <img src='https://s0.wp.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='n' title='n' class='latex' /> leaves, say <img src='https://s0.wp.com/latex.php?latex=T&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='T' title='T' class='latex' />. And suppose you have <img src='https://s0.wp.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='n' title='n' class='latex' /> more, say <img src='https://s0.wp.com/latex.php?latex=T_1%2C+%5Cdots%2C+T_n&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='T_1, &#92;dots, T_n' title='T_1, &#92;dots, T_n' class='latex' />.  Then you can glue the roots of <img src='https://s0.wp.com/latex.php?latex=T_1%2C+%5Cdots%2C+T_n&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='T_1, &#92;dots, T_n' title='T_1, &#92;dots, T_n' class='latex' /> to the leaves of <img src='https://s0.wp.com/latex.php?latex=T&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='T' title='T' class='latex' /> to get a new phylogenetic tree called <img src='https://s0.wp.com/latex.php?latex=T+%5Ccirc+%28T_1%2C+%5Cdots%2C+T_n%29&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='T &#92;circ (T_1, &#92;dots, T_n)' title='T &#92;circ (T_1, &#92;dots, T_n)' class='latex' />.  Furthermore, this gluing operation obeys some rules which look incredibly intimidating when you write them out using symbols, but pathetically obvious when you draw them using pictures of trees.  And these rules are the <a href="http://www.math.uchicago.edu/~may/PAPERS/handout.pdf">definition of an operad</a>.</p>
<p>I would like to know if mathematicians have studied the operad <img src='https://s0.wp.com/latex.php?latex=%5Cmathrm%7BPhyl%7D.&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;mathrm{Phyl}.' title='&#92;mathrm{Phyl}.' class='latex' />  It&#8217;s closely related to Stasheff&#8217;s <a href="http://www.google.com./search?q=associahedron">associahedron</a> operad,  but importantly different.   Operads have &#8216;algebras&#8217;, and the algebras of the associahedron operad are topological spaces with a product that&#8217;s &#8216;associative up to coherent homotopy&#8217;.  I believe algebras of the phylogenetic operad are topological spaces with a <i>commutative</i> product that&#8217;s associative up to coherent homotopy.   Has someone studied these? </p>
<p>In their paper Holmes and her coauthors discuss the associahedron in relation to their own work, but they don&#8217;t mention operads.  I&#8217;ve found another paper that mentions &#8216;the space of phylogenetic trees&#8217;:</p>
<p>&bull; David Speyer and Bernd Sturmfels, <a href="http://arxiv.org/abs/math/0304218">The tropical Grassmannian</a>, <i>Adv. Geom.</i> <b>4</b> (2004), 389–411.</p>
<p>but they don&#8217;t seem to study the operad aspect either. </p>
<p>Perhaps one reason is that Holmes and her coauthors deliberately decide to ignore the labellings on the edges incident to the leaves.  So, they get a space of phylogenetic trees with <img src='https://s0.wp.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='n' title='n' class='latex' /> leaves whose product with <img src='https://s0.wp.com/latex.php?latex=%280%2C%5Cinfty%29%5En&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='(0,&#92;infty)^n' title='(0,&#92;infty)^n' class='latex' /> is the space I&#8217;m calling <img src='https://s0.wp.com/latex.php?latex=%5Cmathrm%7BPhyl%7D_n&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;mathrm{Phyl}_n' title='&#92;mathrm{Phyl}_n' class='latex' />.   As they mention, this simplifies the <i>geometry</i> a bit.  However, it&#8217;s not so nice if you want an operad that accurately describes how you can build a big phylogenetic tree from smaller ones.</p>
<p>They don&#8217;t care about operads; they do some wonderful things with the geometry of their space of phylogenetic trees.   They construct a natural metric on it, and show it&#8217;s a <a href="http://en.wikipedia.org/wiki/CAT%28k%29_space">CAT(0)</a> space in the sense of Gromov.  This means that the triangles in this space are more skinny than those in Euclidean space&mdash;more like triangles in hyperbolic space:</p>
<div align="center"><img src="https://i1.wp.com/upload.wikimedia.org/wikipedia/commons/thumb/f/f8/Uniform_tiling_73-t2.png/400px-Uniform_tiling_73-t2.png" /></div>
<p>They study geodesics in this space&mdash;even though it&#8217;s not a manifold, but something more singular.  And so on!  </p>
<p>There&#8217;s a lot of great geometry here.  But for Holmes, all this is just preparation for doing some genomics&mdash; for example, designing statistical tests to measure how reliable the phylogenetic trees guessed from data actually are.  And for that aspect, try this:</p>
<p>&bull; Susan Holmes, <a href="http://www-stat.stanford.edu/~susan/papers/chapihp04.pdf">Statistical approach to tests involving phylogenies</a>, in O. Gascuel, editor, <i>Mathematics of Evolution and Phylogeny</i>, Oxford U. Press, Oxford, 2007.</p>
]]></html><thumbnail_url><![CDATA[https://i1.wp.com/upload.wikimedia.org/wikipedia/commons/thumb/7/70/Phylogenetic_tree.svg/450px-Phylogenetic_tree.svg.png?fit=440%2C330]]></thumbnail_url><thumbnail_height><![CDATA[297]]></thumbnail_height><thumbnail_width><![CDATA[440]]></thumbnail_width></oembed>