<?xml version="1.0" encoding="UTF-8" standalone="yes"?><oembed><version><![CDATA[1.0]]></version><provider_name><![CDATA[Azimuth]]></provider_name><provider_url><![CDATA[https://johncarlosbaez.wordpress.com]]></provider_url><author_name><![CDATA[John Baez]]></author_name><author_url><![CDATA[https://johncarlosbaez.wordpress.com/author/johncarlosbaez/]]></author_url><title><![CDATA[Logic, Probability and&nbsp;Reflection]]></title><type><![CDATA[link]]></type><html><![CDATA[<p>&nbsp;</p>
<div align="center">
<img width="450" src="https://i1.wp.com/math.ucr.edu/home/baez/mathematical/MIRI_workshop.jpg" /></div>
<p>Last week I attended the <a href="http://intelligence.org/">Machine Intelligence Research Institute&#8217;s</a> sixth <a href="http://intelligence.org/2013/07/24/miris-december-2013-workshop/">Workshop on Logic, Probability, and Reflection</a>.  This one was in Berkeley, where the institute has its headquarters.</p>
<p>You may know this institute under their previous name: the Singularity Institute.  It seems to be the brainchild of Eliezer Yudkowsky, a well-known advocate of &#8216;friendly artificial intelligence&#8217;, whom I interviewed in <a href="https://johncarlosbaez.wordpress.com/2011/03/07/this-weeks-finds-week-311/">week311</a>, <a href="https://johncarlosbaez.wordpress.com/2011/03/14/this-weeks-finds-week-312/">week312</a> and <a href="https://johncarlosbaez.wordpress.com/2011/03/25/this-weeks-finds-week-313/">week313</a> of <i>This Week&#8217;s Finds</i>.  He takes an approach to artificial intelligence that&#8217;s heavily influenced by mathematical logic, and I got invited to the workshop because I <a href="https://johncarlosbaez.wordpress.com/?s=tarski%27s+theorem">blogged about</a> a paper he wrote with Mihaly Barasz, Paul Christiano and Marcello Herreshoff on probability theory and logic.  </p>
<p>I only have the energy to lay the groundwork for a good explanation of what happened in the workshop.  So, after you read my post, please read this:</p>
<p>&bull; Benja Fallenstein, <a href="http://lesswrong.com/lw/jed/results_from_miris_december_workshop/">Results from MIRI&#8217;s December workshop</a>, <i>Less Wrong</i>, 28 December 2013.</p>
<p>The workshop had two main themes, so let me tell you what they were.</p>
<h3> Scientific induction in mathematics </h3>
<p>The first theme is related to that paper I just mentioned.   How should a rational agent assign probabilities to statements in mathematics?  Of course an omniscient being could assign </p>
<p>&bull; <b>probability 1</b> to every mathematical statement that&#8217;s provable,</p>
<p>&bull; <b>probability 0</b> to every statement whose negation is provable, </p>
<p>and </p>
<p>&bull; <img src="https://i1.wp.com/math.ucr.edu/home/baez/emoticons/yuck.gif" /> to every statement that is neither provable nor disprovable.  </p>
<p>But a real-world rational agent will never have time to check all proofs, so there will always be lots of statements it&#8217;s not sure about.  Actual mathematicians always have conjectures, like the <a href="https://en.wikipedia.org/wiki/Twin_prime_conjecture">Twin Prime Conjecture,</a> that we consider plausible even though nobody has proved them.  And whenever we do research, we&#8217;re constantly estimating how likely it is for statements to be true, and changing our estimates as new evidence comes in.   In other words, we use <a href="https://en.wikipedia.org/wiki/Inductive_reasoning">scientific induction</a> in mathematics.</p>
<p>How could we automate this?  Most of us don&#8217;t consciously assign numerical probabilities to mathematical statements.  But maybe an AI mathematician should.  If so, what rules should it follow?</p>
<p>It&#8217;s natural to try a version of <a href="http://www.scholarpedia.org/article/Algorithmic_probability">Solomonoff induction</a>, where our probability estimate, before any evidence comes in, favors statements that are <i>simple</i>.  However, this runs up against problems.  If you&#8217;re interested in learning more about this, try:</p>
<p>&bull; Jeremy Hahn, <a href="https://intelligence.org/wp-content/uploads/2013/12/scientific-induction-in-probabilistic-mathematics.pdf">Scientific induction in probabilistic mathematics</a>.  </p>
<p>It&#8217;s a summary of ideas people came up with during the workshop.  I would like to explain them sometime, but for now I should move on.</p>
<h3> The L&ouml;bian obstacle </h3>
<p>The second main theme was the &#8216;L&ouml;bian obstacle&#8217;.  <a href="https://en.wikipedia.org/wiki/L%C3%B6b%27s_theorem">L&ouml;b&#8217;s theorem</a> is the flip side of <a href="https://en.wikipedia.org/wiki/G%C3%B6del%27s_incompleteness_theorems#First_incompleteness_theorem">G&ouml;del&#8217;s first incompleteness theorem</a>, less famous but just as shocking.  It seems to put limitations on how much a perfectly rational being can trust itself.  </p>
<p>Since it&#8217;s the day after Christmas, let&#8217;s ease our way into these deep waters with the Santa Claus paradox, also known as <a href="https://en.wikipedia.org/wiki/Curry%27s_paradox">Curry&#8217;s paradox</a>.</p>
<p>If you have a child who is worried that Santa Claus might not exist, you can reassure them using this sentence:</p>
<div align="center">
<b>If this sentence is true, Santa Claus exists.</b>
</div>
<p>Call it P, for short.  </p>
<p>Assume, for the sake of argument, that P is true. Then what it says is true: “If P is true, Santa Claus exists.” And we’re assuming P is true. So, Santa Claus exists.</p>
<p>So, we&#8217;ve proved that <i>if P is true, Santa Claus exists</i>.</p>
<p>But that&#8217;s just what P says!  </p>
<p>So, P is true.</p>
<p>So, Santa Claus exists!</p>
<div align="center">
<img width="250" src="https://i2.wp.com/math.ucr.edu/home/baez/mathematical/santa_claus.jpg" />
</div>
<p>There must be something wrong about this argument, even if Santa Claus <i>does</i> exist, because if it were valid you could you use it to prove anything at all.  The self-reference is obviously suspicious.  The sentence in question is a variant of the <a href="https://en.wikipedia.org/wiki/Liar_paradox">Liar Paradox</a>:</p>
<div align="center">
<b>This sentence is false.</b>
</div>
<p>since we can rewrite the Liar Paradox as</p>
<div align="center">
<b>If this sentence is true, 0 = 1.</b>
</div>
<p>and then replace &#8220;0=1&#8221; by any false statement you like.</p>
<p>However, G&ouml;del figured out a way to squeeze solid insights from these dubious self-referential sentences.  He did this by creating a statement in the language of arithmetic, referring to nothing but numbers, which nonetheless manages to <i>effectively</i> say </p>
<div align="center">
<b>This sentence is unprovable.</b>
</div>
<p>If it were provable, you&#8217;d get a contradiction!  So, either arithmetic is inconsistent or this sentence is unprovable.  But if it&#8217;s unprovable, it&#8217;s true.  So, <i>there are true but unprovable statements in arithmetic&#8230; unless arithmetic is inconsistent!</i>  This discovery shook the world of mathematics.</p>
<p>Here I&#8217;m being quite sloppy, just to get the idea across.  </p>
<p>For one thing, when I&#8217;m saying &#8216;provable&#8217;, I mean provable <i>given some specific axioms</i> for arithmetic, like the Peano axioms.   If we change our axioms, different statements will be provable.  </p>
<p>For another, the concept of &#8216;true&#8217; statements in arithmetic is often shunned by logicians.  That may sound shocking, but there are many reasons for this: for example, Tarski showed that <a href="https://en.wikipedia.org/wiki/Tarski%27s_undefinability_theorem">the truth of statements about arithmetic is undefinable in arithmetic</a>.  &#8216;Provability&#8217; is much easier to deal with.  </p>
<p>So, a better way of thinking about G&ouml;del&#8217;s result is that he constructed a statement that is neither provable nor disprovable from Peano&#8217;s axioms of arithmetic, unless those axioms are inconsistent (in which case we can prove everything, but it&#8217;s all worthless).  </p>
<p>Furthermore, this result applies not just to Peano&#8217;s axioms but to any stronger set of axioms, as long as you can write a computer program to list those axioms.</p>
<p>In 1952, the logician Leon Henkin flipped G&ouml;del&#8217;s idea around and asked about a sentence in the language of arithmetic that says:</p>
<div align="center">
<b>This sentence is provable.</b>
</div>
<p>He asked: is this provable or not?  The answer is much less obvious than for G&ouml;del&#8217;s sentence.  Play around with it and see what I mean.</p>
<p>But in 1954, Martin Hugo Löb showed that Henkin&#8217;s sentence <i>is</i> provable!  </p>
<p>And Henkin noticed something amazing: Löb&#8217;s proof shows much more.  </p>
<p>At this point it pays to become a bit more precise.  Let us write <img src='https://s0.wp.com/latex.php?latex=%5Cmathrm%7BPA%7D+%5Cvdash+P&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;mathrm{PA} &#92;vdash P' title='&#92;mathrm{PA} &#92;vdash P' class='latex' /> to mean the statement <img src='https://s0.wp.com/latex.php?latex=P&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='P' title='P' class='latex' /> is provable from the Peano axioms of arithmetic.  G&ouml;del figured out how to encode statements in arithmetic as numbers, so let&#8217;s write <img src='https://s0.wp.com/latex.php?latex=%5C%23+P&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;# P' title='&#92;# P' class='latex' /> for the G&ouml;del number of any statement <img src='https://s0.wp.com/latex.php?latex=P.&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='P.' title='P.' class='latex' />  And G&ouml;del figured out how to write a statement in arithmetic, say </p>
<p><img src='https://s0.wp.com/latex.php?latex=%5Cmathrm%7BProvable%7D%28n%29&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;mathrm{Provable}(n)' title='&#92;mathrm{Provable}(n)' class='latex' /></p>
<p>which says that the statement with G&ouml;del number <img src='https://s0.wp.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='n' title='n' class='latex' /> is provable using the Peano axioms.</p>
<p>Using this terminology, what Henkin originally did was find a number <img src='https://s0.wp.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='n' title='n' class='latex' /> such that the sentence</p>
<p><img src='https://s0.wp.com/latex.php?latex=%5Cmathrm%7BProvable%7D%28n%29&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;mathrm{Provable}(n)' title='&#92;mathrm{Provable}(n)' class='latex' /></p>
<p>has G&ouml;del number <img src='https://s0.wp.com/latex.php?latex=n.&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='n.' title='n.' class='latex' />   So, this sentence says</p>
<div align="center">
<b>This sentence is provable from the Peano axioms of arithmetic.</b>
</div>
<p>What L&ouml;b did was show</p>
<p><img src='https://s0.wp.com/latex.php?latex=%5Cmathrm%7BPA%7D+%5Cvdash+%5Cmathrm%7BProvable%7D%28n%29&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;mathrm{PA} &#92;vdash &#92;mathrm{Provable}(n)' title='&#92;mathrm{PA} &#92;vdash &#92;mathrm{Provable}(n)' class='latex' /></p>
<p>In other words, he showed that Henkin sentence really <i>is</i> provable from the Peano axioms!  </p>
<p>What Henkin then did is prove that for <i>any</i> sentence <img src='https://s0.wp.com/latex.php?latex=P&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='P' title='P' class='latex' /> in the language of arithmetic, if</p>
<p><img src='https://s0.wp.com/latex.php?latex=%5Cmathrm%7BPA%7D+%5Cvdash+%5Cmathrm%7BProvable%7D%28%5C%23+P%29+%5Cimplies+P+&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;mathrm{PA} &#92;vdash &#92;mathrm{Provable}(&#92;# P) &#92;implies P ' title='&#92;mathrm{PA} &#92;vdash &#92;mathrm{Provable}(&#92;# P) &#92;implies P ' class='latex' /></p>
<p>then </p>
<p><img src='https://s0.wp.com/latex.php?latex=%5Cmathrm%7BPA%7D+%5Cvdash+P+&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;mathrm{PA} &#92;vdash P ' title='&#92;mathrm{PA} &#92;vdash P ' class='latex' /></p>
<p>In other words, suppose we can prove that the provability of <img src='https://s0.wp.com/latex.php?latex=P&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='P' title='P' class='latex' /> implies <img src='https://s0.wp.com/latex.php?latex=P.&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='P.' title='P.' class='latex' />  Then we can prove <img src='https://s0.wp.com/latex.php?latex=P&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='P' title='P' class='latex' />!</p>
<p>At first this merely sounds nightmarishly complicated.  But if you think about it long enough, you&#8217;ll see it&#8217;s downright terrifying!   For example, suppose <img src='https://s0.wp.com/latex.php?latex=P&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='P' title='P' class='latex' /> is some famous open question in arithmetic, like the Twin Prime Conjecture.   You might hope to prove</p>
<div align="center">
<b>The provability of the Twin Prime Conjecture implies the Twin Prime Conjecture.</b></div>
<p>Indeed, that seems like a perfectly reasonable thing to want.  But it turns out that proving this is as hard as proving the Twin Prime Conjecture!  Why?  Because if we can prove the boldface sentence above, L&ouml;b and Henkin&#8217;s work instantly gives us a proof of Twin Prime Conjecture!  </p>
<p>What does all this have to do with artificial intelligence?  </p>
<p>Well, what I just said is true not only for Peano arithmetic, but any set of axioms <i>including</i> Peano arithmetic that a computer program can list.  Suppose your highly logical AI mathematician has some such set of axioms, say <img src='https://s0.wp.com/latex.php?latex=%5Cmathrm%7BAI%7D.&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;mathrm{AI}.' title='&#92;mathrm{AI}.' class='latex' />   You might want it to trust itself. In other words, you might want </p>
<p><img src='https://s0.wp.com/latex.php?latex=%5Cmathrm%7BAI%7D+%5Cvdash+%5Cmathrm%7BProvable%7D%28%5C%23+P%29+%5Cimplies+P+&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;mathrm{AI} &#92;vdash &#92;mathrm{Provable}(&#92;# P) &#92;implies P ' title='&#92;mathrm{AI} &#92;vdash &#92;mathrm{Provable}(&#92;# P) &#92;implies P ' class='latex' /></p>
<p>for every sentence <img src='https://s0.wp.com/latex.php?latex=P.&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='P.' title='P.' class='latex' />   This says, roughly, that <i>whatever the AI can prove it can prove, it can prove</i>.  </p>
<p>But then L&ouml;b&#8217;s theorem would kick in and give</p>
<p><img src='https://s0.wp.com/latex.php?latex=%5Cmathrm%7BAI%7D+%5Cvdash+P+&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;mathrm{AI} &#92;vdash P ' title='&#92;mathrm{AI} &#92;vdash P ' class='latex' /></p>
<p>for every sentence <img src='https://s0.wp.com/latex.php?latex=P.&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='P.' title='P.' class='latex' />  And this would be disastrous: our AI would be inconsistent, because it could prove <i>everything!</i></p>
<p>This is just the beginning of the problems.  It gets more involved when we consider AI&#8217;s that spawn new AI&#8217;s and want to trust them.  For more see:</p>
<p>&bull; Eliezer Yudkowsky and Marcello Herreshoff, <a href="http://intelligence.org/files/TilingAgents.pdf">Tiling agents for self-modifying AI, and the Löbian obstacle</a>.</p>
<p>At workshop various people made progress on this issue, which is recorded in these summaries:</p>
<p>&bull; Eliezer Yudkowsky, <a href="https://intelligence.org/wp-content/uploads/2013/12/procrastination-paradox.pdf">The procrastination paradox</a>.</p>
<blockquote><p><b>Abstract.</b> A theorem by Marcello Herresho, Benja Fallenstein, and Stuart Armstrong shows that if there exists an infinite series of theories <img src='https://s0.wp.com/latex.php?latex=T_i&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='T_i' title='T_i' class='latex' /> extending <img src='https://s0.wp.com/latex.php?latex=%5Cmathrm%7BPA%7D&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;mathrm{PA}' title='&#92;mathrm{PA}' class='latex' /> where each <img src='https://s0.wp.com/latex.php?latex=T_i&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='T_i' title='T_i' class='latex' /> proves the soundness of <img src='https://s0.wp.com/latex.php?latex=T_%7Bi%2B1%7D&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='T_{i+1}' title='T_{i+1}' class='latex' />, then all the <img src='https://s0.wp.com/latex.php?latex=T_i&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='T_i' title='T_i' class='latex' /> must have only nonstandard models. We call this the Procrastination Theorem for reasons which will become apparent.
</p></blockquote>
<p>&bull; Benja Fallenstein, <a href="https://intelligence.org/wp-content/uploads/2013/12/consistency-waterfall.pdf">An infinitely descending sequence of sound theories each proving the next consistent</a>.</p>
<p>Here Fallenstein constructs a different sequence of theories <img src='https://s0.wp.com/latex.php?latex=T_i&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='T_i' title='T_i' class='latex' /> extending Peano arithmetic such that each <img src='https://s0.wp.com/latex.php?latex=T_i&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='T_i' title='T_i' class='latex' /> proves the consistency of <img src='https://s0.wp.com/latex.php?latex=T_%7Bi%2B1%7D%2C&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='T_{i+1},' title='T_{i+1},' class='latex' /> and all the theories are sound for <img src='https://s0.wp.com/latex.php?latex=%5CPi_1&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;Pi_1' title='&#92;Pi_1' class='latex' /> sentences&#8212;that is, sentences with only one <img src='https://s0.wp.com/latex.php?latex=%5Cforall&#038;bg=ffffff&#038;fg=000&#038;s=0' alt='&#92;forall' title='&#92;forall' class='latex' /> quantifier outside the rest of the stuff.</p>
<p>The following summaries would take more work to explain:</p>
<p>&bull; Nate Soares, <a href="https://intelligence.org/wp-content/uploads/2013/12/fallensteins-monster.pdf">Fallenstein&#8217;s monster</a>.</p>
<p>&bull; Nisan Stiennon, <a href="https://intelligence.org/wp-content/uploads/2013/12/recursively-defined-theories-are-well-defined.pdf"> Recursively-defined logical theories are well-defined</a>.</p>
<p>&bull; Benja Fallenstein, <a href="https://intelligence.org/wp-content/uploads/2013/12/tiling-agents-5-and-10.pdf">The 5-and-10 problem and the tiling agents formalism</a>.</p>
<p>&bull; Benja Fallenstein, <a href="https://intelligence.org/wp-content/uploads/2013/12/decreasing-strength-parametric-polymorphism.pdf">Decreasing mathematical strength in one formalization of parametric polymorphism.</a></p>
<p>Again: before reading any of these summaries, I urge you to read <a href="http://lesswrong.com/lw/jed/results_from_miris_december_workshop/">Benja Fallenstein&#8217;s post</a>, which will help you understand them!</p>
]]></html><thumbnail_url><![CDATA[https://i1.wp.com/math.ucr.edu/home/baez/mathematical/MIRI_workshop.jpg?fit=440%2C330]]></thumbnail_url><thumbnail_height><![CDATA[139]]></thumbnail_height><thumbnail_width><![CDATA[440]]></thumbnail_width></oembed>