<?xml version="1.0" encoding="UTF-8" standalone="yes"?><oembed><version><![CDATA[1.0]]></version><provider_name><![CDATA[Geometry and the imagination]]></provider_name><provider_url><![CDATA[https://lamington.wordpress.com]]></provider_url><author_name><![CDATA[Danny Calegari]]></author_name><author_url><![CDATA[https://lamington.wordpress.com/author/dannycaltech/]]></author_url><title><![CDATA[Measure theory, topology, and the role of&nbsp;examples]]></title><type><![CDATA[link]]></type><html><![CDATA[<p>Bill Thurston once observed that topology and measure theory are very <em>immiscible</em> (i.e. they don&#8217;t mix easily); this statement has always resonated with me, and I thought I would try to explain some of the (personal, psychological, and mathematical) reasons why. On the face of it, topology and measure theory are very closely related. Both are concerned with spaces equipped with certain algebras of sets (open sets, measurable sets) and classes of functions (continuous functions, measurable functions). Continuous functions (on reasonable spaces) are measurable, and (some) measures can be integrated to define continuous functions. However, in my mind at least, they are very different in a <em>psychological</em> sense, and one of the most important ways in which they differ concerns the role of <em>examples</em>.</p>
<p>At the risk of oversimplifying, one might say that one modern mathematical tradition, perhaps exemplified by the Bourbakists, insists that <em>examples are either irrelevant or misleading</em>. There is a famous story about Grothendieck, retold in <a href="http://www.ams.org/notices/200410/fea-grothendieck-part2.pdf">this article</a> by Allyn Jackson, which goes as follows:</p>
<blockquote><p>One striking characteristic of Grothendieck&#8217;s mode of thinking is that it seemed to rely so little on examples. This can be seen in the legend of the so-called &#8220;Grothendieck prime&#8221;. In a mathematical conversation, someone suggested to Grothendieck that they should consider a particular prime number. &#8220;You mean an actual number?&#8221; Grothendieck asked. The other person replied, yes, an actual prime number. Grothendieck suggested, &#8220;All right, take <img src="https://s0.wp.com/latex.php?latex=57&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="57" title="57" class="latex" />&#8221;.</p></blockquote>
<p>Leaving aside the &#8220;joke&#8221; of Grothendieck&#8217;s (supposed) inability to factor <img src="https://s0.wp.com/latex.php?latex=57&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="57" title="57" class="latex" />, this anecdote has an instructive point. No doubt Grothendieck&#8217;s associate was expecting a <em>small </em>prime number such as <img src="https://s0.wp.com/latex.php?latex=2&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="2" title="2" class="latex" /> or <img src="https://s0.wp.com/latex.php?latex=3&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="3" title="3" class="latex" />. What would have been the reaction if Grothendieck had said &#8220;All right, take <img src="https://s0.wp.com/latex.php?latex=2147483647&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="2147483647" title="2147483647" class="latex" />&#8220;? When one considers examples, one is prone to consider <em>simple</em> examples; of course this is natural, but one must be aware that such examples can be misleading. Morwen Thistlethwaite once made a similar observation about knot theory; from memory he said something like:</p>
<blockquote><p>When someone asks you to think about a knot, you usually imagine a trefoil, or a figure <img src="https://s0.wp.com/latex.php?latex=8&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="8" title="8" class="latex" />, or maybe a torus knot. But the right image to have in your mind is a room entirely filled with a long, tangled piece of string.</p></blockquote>
<p>Note that there is another crucial function of examples, namely their role as <em>counterexamples</em>, which certify the invalidity of a general claim &#8212; such counterexamples should, of course, be as simple as possible (and even Grothendieck was capable of coming up with some); but I am concerned here and in the sequel with the role of &#8220;confirming&#8221; examples, so to speak.</p>
<p>At the other extreme(?), and again at the risk of oversimplifying, one might take the &#8220;<a href="http://www.math.upenn.edu/~wilf/Downld.html"><img src="https://s0.wp.com/latex.php?latex=A%3DB&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="A=B" title="A=B" class="latex" /></a>&#8221; (or Petkovsek-Wilf-Zeilberger) point of view, that <em>sufficiently good/many examples are proofs</em>. They give a beautifully simple but psychologically interesting example (<strong>Theorem 1.4.2</strong> in <img src="https://s0.wp.com/latex.php?latex=A%3DB&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="A=B" title="A=B" class="latex" />): to show that the angle bisectors of a triangle are coincident, it suffices to verify this for a sufficiently large <em>but finite</em> (explicit) number of examples. The reason such a proof is valid is that the co-ordinates of the pairwise intersections of the angle bisectors are rational functions of (certain trigonometric functions of) the angles, of an explicit (and easily determined) degree, and to prove an identity between rational functions, it suffices to prove that it holds for enough values. Another aspect of the <img src="https://s0.wp.com/latex.php?latex=A%3DB&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="A=B" title="A=B" class="latex" /> philosophy is that by the process of <em>abstraction</em>, a theorem in one context can become an example in another. For example, &#8220;even plus odd equals odd&#8221; might be a theorem over <img src="https://s0.wp.com/latex.php?latex=%5Cmathbb%7BZ%7D&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;mathbb{Z}" title="&#92;mathbb{Z}" class="latex" />, but an example over <img src="https://s0.wp.com/latex.php?latex=%5Cmathbb%7BZ%7D%2F2%5Cmathbb%7BZ%7D&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;mathbb{Z}/2&#92;mathbb{Z}" title="&#92;mathbb{Z}/2&#92;mathbb{Z}" class="latex" />. One might say that the important thing about examples is that they should be <em>sufficiently</em> general that they exhibit all or enough of the complexity of the general case, and that if enough features of an example can be reimagined or abstracted as parameters, an example can become (or be translated into) a theorem.</p>
<p>In some fields of mathematics, one can make the idea of a &#8220;general example&#8221; rigorous. In algebraic geometry, one has the concept of a <em>generic point</em> on a scheme; in differential topology, one considers submanifolds in <em>general position</em>; in ergodic theory, one considers a <em>normal number</em> (or sequence in some fixed alphabet). In fact, it is not so clear whether a &#8220;formal&#8221; generic object in some domain should be thought of as the ultimate example, or as the ultimate rejection of the use of examples! In any case, in practice, when as mathematicians we select examples to test our ideas on, we rarely adhere to a rigorous procedure to ensure that our examples are good ones, and we are therefore susceptible to certain well-known psychological biases. The first is the <a href="http://en.wikipedia.org/wiki/Availability_heuristic"><em>availability heuristic</em></a>, as defined by the psychologists Kahneman and Tversky, which says roughly that people tend to overemphasize the importance of examples that they can think of most easily. Why exactly is this bad? Well, because it interacts with another bias, that it is easier to think of an example which is <em>more specific</em> &#8212; e.g. it is easier to think of a fruit that starts with the letter &#8220;A&#8221; than just to think of a fruit. One might argue that this bias is unavoidable, given the nature of the task &#8220;think of an example of X&#8221; &#8212; e.g. it is much easier to find a <em>unique</em> solution (of a differential equation, of a system of linear equations, etc.) than to find a solution to an underdetermined problem. In fact, finding a unique solution is so much easier than solving an underdetermined problem, that one often tries to solve the underdetermined problem by adding constraints until the solution is unique and can be found (e.g. simplex method in linear programming). Conversely, this bias is also part of the explanation for why examples are so useful: the mind devotes more attention and mental resources to a more specific object. So even if one is interested in finding a rigorous and abstract proof, it is often easier to find a proof for a specific example, and then to &#8220;rewrite&#8221; the proof, replacing the specific example by the general case, and checking that no additional hypotheses are used. The second psychological bias is that of <a href="http://en.wikipedia.org/wiki/Framing_(social_sciences)"><em>framing</em></a>. A frame consists of a collection of schemata and stereotypes that provide a context in which an event is interpreted. Many mathematical concepts or objects can be formulated in many different ways which are all <em>logically</em> equivalent, but which <em>frame</em> the concept or object quite differently. The word &#8220;bird&#8221; suggests (to most people) a schema which involves flight, wings, beaks, etc. The mental image it conjures up will (almost never) resemble a flightless bird like a penguin, or a kiwi, unless extra cues are given, like &#8220;a bird indigenous to New Zealand&#8221;. A statement about covering spaces might be equivalent to a statement in group theory, but the first might bring to mind topological ideas like paths, continuous maps, compact subsets etc. while the second might suggest homomorphisms, exact sequences, extensions etc., and the examples suggested by the frames might be substantially (mathematically) different, sometimes in crucial ways.</p>
<p>Back to measure theory and topology. In topology, one is frequently (always?) interested in a topological space. Here context is very important &#8212; a &#8220;topological space&#8221; could be a finite set, a graph, a solenoid, a manifold, a Cantor set, a sheaf, a CW complex, a lamination, a profinite group, a Banach space, the space of all compact metric spaces (up to isometry) in the Gromov-Hausdorff metric, etc. By contrast, a &#8220;measure space&#8221; is an interval, plus some countable (possibly empty) collection of atoms. Of course, one <em>thinks of</em> a measure space much more concretely, by adding some incidental extra structure which is irrelevant to the measure structure, but crucial to the mathematical (and psychological) interest of the space; hence a &#8220;measure space&#8221; could be the space of infinite sequences in a fixed alphabet, the Sierpinski gasket in the plane, the attractor of an Axiom A diffeomorphism, and so on. In other words, one is typically interested in measure theory as a <em>tool</em> to explore some mathematical object with additional structure, whereas one is frequently interested in topological spaces as objects of intrinsic interest in their own right. Many interesting classes of topological objects can be visualized in great detail &#8212; sometimes so much detail that in practice one generates proofs by examining sufficiently complicated examples, and building up clear and detailed mental models of the kinds of phenomena that can occur in a given context. Visualizing a &#8220;typical&#8221; measurable set (even a subset of <img src="https://s0.wp.com/latex.php?latex=%5Cmathbb%7BR%7D&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;mathbb{R}" title="&#92;mathbb{R}" class="latex" /> or the plane) or map is much more difficult, if it is even possible (or, for that matter, a non-measurable set). In fact, one tends routinely to bump up against important subtleties in mathematical logic (especially set theory) when trying even to define such elusive entities as a &#8220;typical&#8221; measurable subset of the interval. For instance, Solovay&#8217;s <a href="http://www.ams.org/mathscinet-getitem?mr=0265151">famous theorem</a> says (amongst other things) that the statement &#8220;every set of real numbers is Lebesgue measurable&#8221; is consistent with Zermelo-Frankel set theory without the axiom of choice (in fact, Solovay&#8217;s result is relative to the existence of certain large cardinals &#8212; so-called <em>inaccessible cardinals</em>). Solovay explicitly addresses in his paper the issue of explicitly describing a non-Lesbesgue measurable set or reals:</p>
<blockquote><p>Of course the axiom of choice is true, and so there are non-measurable sets. It is natural to ask if one can explicitly describe a non-Lesbegue measurable set.</p></blockquote>
<p>Say that a set of reals <img src="https://s0.wp.com/latex.php?latex=A&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="A" title="A" class="latex" /> is <em>definable</em> from a set <img src="https://s0.wp.com/latex.php?latex=x_0&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="x_0" title="x_0" class="latex" /> if there is a formula <img src="https://s0.wp.com/latex.php?latex=%5CPsi%28x%2Cy%29&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;Psi(x,y)" title="&#92;Psi(x,y)" class="latex" /> having free only the variables <img src="https://s0.wp.com/latex.php?latex=x&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="x" title="x" class="latex" /> and <img src="https://s0.wp.com/latex.php?latex=y&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="y" title="y" class="latex" />, so that <img src="https://s0.wp.com/latex.php?latex=A+%3D+%5Clbrace+y+%3A+%5CPsi%28x_0%2Cy%29%5Crbrace&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="A = &#92;lbrace y : &#92;Psi(x_0,y)&#92;rbrace" title="A = &#92;lbrace y : &#92;Psi(x_0,y)&#92;rbrace" class="latex" />. Solovay shows that (again, assuming the existence of an inaccessible cardinal), even if the axiom of choice is true, every set of reals definable from a countable sequence of ordinals is Lebesgue measurable (interestingly enough, one of the most important concepts introduced by Solovay in his paper is the notion of a <em>random real</em>, namely a real number that is not contained in any of a certain class of Borel sets of measure zero, namely those that are <em>rational</em> (i.e. those that can be encoded in a certain precise way); this resonates somewhat with the &#8220;generic points&#8221; and &#8220;normal numbers&#8221; mentioned earlier).</p>
<p>If imagining &#8220;good&#8221; examples in measure theory is hard, what is the alternative? Evidently, it is to imagine &#8220;bad&#8221; examples, or at least very non-generic ones. Under many circumstances, the &#8220;standard&#8221; mental image of a measurable map is a piecewise-constant map from the unit interval to a countable (even finite) set. This example rests on two approximations: the process of building up an arbitrary Borel set (in <img src="https://s0.wp.com/latex.php?latex=%5Cmathbb%7BR%7D&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;mathbb{R}" title="&#92;mathbb{R}" class="latex" />, say) from half-open intervals by complementing, intersections and unions; and the process of defining an arbitrary measurable function as a limit of a sequence of finite sums of multiples of indicator functions. Such a mental image certainly has its uses, but for my own part I think that if one is going to use such a mental model anyway, one should be aggressive about using one&#8217;s intuition about continuous functions and open sets to make the example as specific, as rich and as &#8220;generic&#8221; as possible, while understanding that the result is not the measurable function or set itself, but only an approximation to it, and one should try to keep in mind a <em>sequence</em> of such maps, with increasing complexity and richness (if possible).</p>
<p>Of course, non-measurable sets <em>do</em> arise in practice. If one wants in the end to prove a theorem whose truth or falsity does not depend on the Axiom of choice, then by Solovay one could do without such sets if necessary. The fact that we do not must mean that the use of non-measurable sets (necessarily constructed using the Axiom of choice) leads to shorter/more findable proofs, or more understandable proofs, or both. Let me mention a few examples of situations in which the Axiom of choice is extremely useful:</p>
<ol>
<li>The Hahn-Banach theorem in functional analysis</li>
<li>The existence of ultralimits of sequences of metric spaces (equivalently, the existence of Stone-Cech compactifications)</li>
<li>A group <img src="https://s0.wp.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="G" title="G" class="latex" /> is said to be left orderable if there is a total order <img src="https://s0.wp.com/latex.php?latex=%3C&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&lt;" title="&lt;" class="latex" /> on <img src="https://s0.wp.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="G" title="G" class="latex" /> such that <img src="https://s0.wp.com/latex.php?latex=g%3Ch&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="g&lt;h" title="g&lt;h" class="latex" /> implies <img src="https://s0.wp.com/latex.php?latex=fg+%3C+fh&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="fg &lt; fh" title="fg &lt; fh" class="latex" /> for all <img src="https://s0.wp.com/latex.php?latex=f%2Cg%2Ch+%5Cin+G&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="f,g,h &#92;in G" title="f,g,h &#92;in G" class="latex" />. If <img src="https://s0.wp.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="S" title="S" class="latex" /> is a finite subset of nontrivial elements of <img src="https://s0.wp.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="G" title="G" class="latex" />, the order partitions <img src="https://s0.wp.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="S" title="S" class="latex" /> into <img src="https://s0.wp.com/latex.php?latex=S%5E%2B+%5Ccup+S%5E-&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="S^+ &#92;cup S^-" title="S^+ &#92;cup S^-" class="latex" />, where the superscript denotes the elements that are greater than, respectively less than the identity element. Suppose for some finite set <img src="https://s0.wp.com/latex.php?latex=S&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="S" title="S" class="latex" />, and every partition into <img src="https://s0.wp.com/latex.php?latex=S%5E%2B+%5Ccup+S%5E-&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="S^+ &#92;cup S^-" title="S^+ &#92;cup S^-" class="latex" /> some product of elements of one of the subsets (with repeats allowed) is equal to the identity. Then necessarily <img src="https://s0.wp.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="G" title="G" class="latex" /> is not left orderable. In fact, the converse is also true: if no such &#8220;poison&#8221; subset exists, then <img src="https://s0.wp.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="G" title="G" class="latex" /> is left orderable. This follows from the compactness of the set of partitions of <img src="https://s0.wp.com/latex.php?latex=G-%5Ctext%7Bid%7D&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="G-&#92;text{id}" title="G-&#92;text{id}" class="latex" /> into two subsets <img src="https://s0.wp.com/latex.php?latex=G%5E%2B+%5Ccup+G%5E-&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="G^+ &#92;cup G^-" title="G^+ &#92;cup G^-" class="latex" /> (equivalently, the compactness of the set <img src="https://s0.wp.com/latex.php?latex=%5Clbrace+-1%2C1%5Crbrace%5E%7BG-%5Ctext%7Bid%7D%7D&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;lbrace -1,1&#92;rbrace^{G-&#92;text{id}}" title="&#92;lbrace -1,1&#92;rbrace^{G-&#92;text{id}}" class="latex" />) which follows from Tychonoff&#8217;s theorem.</li>
<li>The existence of Galois automorphisms of <img src="https://s0.wp.com/latex.php?latex=%5Cmathbb%7BC%7D&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;mathbb{C}" title="&#92;mathbb{C}" class="latex" /> over <img src="https://s0.wp.com/latex.php?latex=%5Cmathbb%7BQ%7D&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;mathbb{Q}" title="&#92;mathbb{Q}" class="latex" /> (other than complex conjugation). Such automorphisms are necessarily non-measurable, and therefore (by Solovay) cannot be constructed without the axiom of choice. In fact, this follows from a theorem of Mackey, that any measurable homomorphism between (sufficiently nice, i.e. locally compact, second countable) topological groups is continuous. We give the sketch of a proof. Suppose <img src="https://s0.wp.com/latex.php?latex=f%3AG+%5Cto+H&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="f:G &#92;to H" title="f:G &#92;to H" class="latex" /> is given, and without loss of generality, assume it is surjective. Let <img src="https://s0.wp.com/latex.php?latex=U&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="U" title="U" class="latex" /> be a neighborhood of the identity in <img src="https://s0.wp.com/latex.php?latex=H&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="H" title="H" class="latex" />, and let <img src="https://s0.wp.com/latex.php?latex=V&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="V" title="V" class="latex" /> be a symmetric open neighborhood of the identity with <img src="https://s0.wp.com/latex.php?latex=V%5Ccdot+V+%5Csubset+U&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="V&#92;cdot V &#92;subset U" title="V&#92;cdot V &#92;subset U" class="latex" />. The group <img src="https://s0.wp.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="G" title="G" class="latex" /> is covered by countably many translates of <img src="https://s0.wp.com/latex.php?latex=f%5E%7B-1%7D%28V%29&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="f^{-1}(V)" title="f^{-1}(V)" class="latex" />, and therefore the measure of <img src="https://s0.wp.com/latex.php?latex=f%5E%7B-1%7D%28V%29&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="f^{-1}(V)" title="f^{-1}(V)" class="latex" /> is positive. Let <img src="https://s0.wp.com/latex.php?latex=X+%5Csubset+f%5E%7B-1%7D%28V%29+%5Csubset+W&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="X &#92;subset f^{-1}(V) &#92;subset W" title="X &#92;subset f^{-1}(V) &#92;subset W" class="latex" /> where <img src="https://s0.wp.com/latex.php?latex=X&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="X" title="X" class="latex" /> is compact, <img src="https://s0.wp.com/latex.php?latex=W&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="W" title="W" class="latex" /> is open, and such that the (Haar) measure of <img src="https://s0.wp.com/latex.php?latex=W&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="W" title="W" class="latex" /> is less than twice the Haar measure of <img src="https://s0.wp.com/latex.php?latex=X&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="X" title="X" class="latex" /> (the existence of such an open set <img src="https://s0.wp.com/latex.php?latex=W&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="W" title="W" class="latex" /> depends on the fact that measure agrees with outer measure for measurable sets). Since <img src="https://s0.wp.com/latex.php?latex=W&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="W" title="W" class="latex" /> is open, there is an open neighborhood <img src="https://s0.wp.com/latex.php?latex=T&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="T" title="T" class="latex" /> of the identity in <img src="https://s0.wp.com/latex.php?latex=G&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="G" title="G" class="latex" /> so that <img src="https://s0.wp.com/latex.php?latex=tX+%5Csubset+W&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="tX &#92;subset W" title="tX &#92;subset W" class="latex" /> for all <img src="https://s0.wp.com/latex.php?latex=t+%5Cin+T&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="t &#92;in T" title="t &#92;in T" class="latex" />. But <img src="https://s0.wp.com/latex.php?latex=tX&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="tX" title="tX" class="latex" /> and <img src="https://s0.wp.com/latex.php?latex=X&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="X" title="X" class="latex" /> both have measure more than half the measure of <img src="https://s0.wp.com/latex.php?latex=W&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="W" title="W" class="latex" />, so they intersect. Since <img src="https://s0.wp.com/latex.php?latex=V&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="V" title="V" class="latex" /> is symmetric, so is <img src="https://s0.wp.com/latex.php?latex=f%5E%7B-1%7D%28V%29&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="f^{-1}(V)" title="f^{-1}(V)" class="latex" />, and therefore <img src="https://s0.wp.com/latex.php?latex=T+%5Csubset+f%5E%7B-1%7D%28V%5Ccdot+V%29&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="T &#92;subset f^{-1}(V&#92;cdot V)" title="T &#92;subset f^{-1}(V&#92;cdot V)" class="latex" />. This implies <img src="https://s0.wp.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="f" title="f" class="latex" /> is continuous, as claimed. A continuous Galois automorphism of <img src="https://s0.wp.com/latex.php?latex=%5Cmathbb%7BC%7D&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;mathbb{C}" title="&#92;mathbb{C}" class="latex" /> is either the identity, or complex conjugation.</li>
</ol>
<p>Personally, I think that one of the most compelling reasons to accept the Axiom of choice is psychological, and is related to the phenomenon of <em>closure</em>. If we see a fragment of a scene or a pattern, our mind fills in the rest of the scene or pattern for us. We have no photoreceptor cells in our eyes where the optic nerve passes through the retina, but instead of noticing this gap, we have an unperceived blind spot in our field of vision. If we can choose an element of a finite set whenever we want to, we feel as though nothing would stop us from making such a choice infinitely often. We are inclined to accept a formula with a &#8220;for all&#8221; quantifier  ranging over an infinite set, if the formula holds every time we check it. We are inclined to see patterns &#8212; even where they don&#8217;t exist. This is the seductive and dangerous (?) side of examples, and maybe a reason to exercise a little caution.</p>
<p>In fact, this discussion barely scratches the surface (and does not really probe into either topology or measure theory in any deep way). I would be very curious to hear contrasting opinions.</p>
<p><strong>Update (6/20):</strong> There are many other things that I could/should have mentioned about the interaction between measure theory and topology, and the difficulty of finding good generic examples in measure theory. For example, I definitely should have mentioned:</p>
<ol>
<li>Lusin&#8217;s theorem, which says that a measurable function is continuous on almost all its domain; e.g. if <img src="https://s0.wp.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="f" title="f" class="latex" /> is any measurable function on an interval <img src="https://s0.wp.com/latex.php?latex=%5Ba%2Cb%5D&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="[a,b]" title="[a,b]" class="latex" />, then for any positive <img src="https://s0.wp.com/latex.php?latex=%5Cepsilon&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;epsilon" title="&#92;epsilon" class="latex" /> there is a compact subset <img src="https://s0.wp.com/latex.php?latex=E+%5Csubset+%5Ba%2Cb%5D&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="E &#92;subset [a,b]" title="E &#92;subset [a,b]" class="latex" /> so that the measure of <img src="https://s0.wp.com/latex.php?latex=%5Ba%2Cb%5D+-+E&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="[a,b] - E" title="[a,b] - E" class="latex" /> is at most <img src="https://s0.wp.com/latex.php?latex=%5Cepsilon&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;epsilon" title="&#92;epsilon" class="latex" />, and <img src="https://s0.wp.com/latex.php?latex=f&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="f" title="f" class="latex" /> is continuous on <img src="https://s0.wp.com/latex.php?latex=E&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="E" title="E" class="latex" />.</li>
<li>von Neumann&#8217;s theorem, that a Borel probability measure <img src="https://s0.wp.com/latex.php?latex=%5Cmu&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;mu" title="&#92;mu" class="latex" /> on the unit cube <img src="https://s0.wp.com/latex.php?latex=I%5En&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="I^n" title="I^n" class="latex" /> in <img src="https://s0.wp.com/latex.php?latex=%5Cmathbb%7BR%7D%5En&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;mathbb{R}^n" title="&#92;mathbb{R}^n" class="latex" /> is equivalent to Lesbesgue measure (on the cube) by a self-homeomorphism of the cube (which can be taken to be the identity on the boundary) if and only if it is nonatomic, gives the boundary measure zero, and is positive on nonempty relatively open subsets.</li>
<li>Pairs of mutually singular measures of full support on simple sets. For example, let <img src="https://s0.wp.com/latex.php?latex=C&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="C" title="C" class="latex" /> be the Cantor set of infinite strings in the alphabet <img src="https://s0.wp.com/latex.php?latex=%5Clbrace+0%2C1%5Crbrace&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;lbrace 0,1&#92;rbrace" title="&#92;lbrace 0,1&#92;rbrace" class="latex" /> with its product topology, and define an infinite string <img src="https://s0.wp.com/latex.php?latex=W_%5Cinfty&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="W_&#92;infty" title="W_&#92;infty" class="latex" /> in <img src="https://s0.wp.com/latex.php?latex=C&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="C" title="C" class="latex" /> inductively as follows. For any string <img src="https://s0.wp.com/latex.php?latex=w&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="w" title="w" class="latex" />, define the complement <img src="https://s0.wp.com/latex.php?latex=w%5Ec&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="w^c" title="w^c" class="latex" /> to be the string whose digits are obtained from <img src="https://s0.wp.com/latex.php?latex=w&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="w" title="w" class="latex" /> by interchanging <img src="https://s0.wp.com/latex.php?latex=0&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="0" title="0" class="latex" /> and <img src="https://s0.wp.com/latex.php?latex=1&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="1" title="1" class="latex" />. Then define <img src="https://s0.wp.com/latex.php?latex=W_1&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="W_1" title="W_1" class="latex" /> to be the string <img src="https://s0.wp.com/latex.php?latex=1&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="1" title="1" class="latex" />, and inductively define <img src="https://s0.wp.com/latex.php?latex=W_%7Bn%2B1%7D%3D+W_n+W_n%5Ec+W_n%5Ec+%5Ccdots+W_n%5Ec&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="W_{n+1}= W_n W_n^c W_n^c &#92;cdots W_n^c" title="W_{n+1}= W_n W_n^c W_n^c &#92;cdots W_n^c" class="latex" /> where there are <img src="https://s0.wp.com/latex.php?latex=f%28n%29-1&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="f(n)-1" title="f(n)-1" class="latex" /> copies of <img src="https://s0.wp.com/latex.php?latex=W_n%5Ec&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="W_n^c" title="W_n^c" class="latex" />, and <img src="https://s0.wp.com/latex.php?latex=f%28n%29&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="f(n)" title="f(n)" class="latex" /> is chosen so that <img src="https://s0.wp.com/latex.php?latex=%5Cprod_n+%28f%28n%29-1%29%2Ff%28n%29+%3D+r+%3E+1%2F2&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;prod_n (f(n)-1)/f(n) = r &gt; 1/2" title="&#92;prod_n (f(n)-1)/f(n) = r &gt; 1/2" class="latex" />. Let <img src="https://s0.wp.com/latex.php?latex=%5CLambda&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;Lambda" title="&#92;Lambda" class="latex" /> be the set of accumulation points of <img src="https://s0.wp.com/latex.php?latex=W_%5Cinfty&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="W_&#92;infty" title="W_&#92;infty" class="latex" /> under the shift map. Any finite string that appears in <img src="https://s0.wp.com/latex.php?latex=W_%5Cinfty&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="W_&#92;infty" title="W_&#92;infty" class="latex" /> appears with definite density, so <img src="https://s0.wp.com/latex.php?latex=%5CLambda&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;Lambda" title="&#92;Lambda" class="latex" /> is invariant and minimal (i.e. every orbit is dense) under the shift map. However, the proportion of <img src="https://s0.wp.com/latex.php?latex=1&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="1" title="1" class="latex" />&#8216;s in <img src="https://s0.wp.com/latex.php?latex=W_n&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="W_n" title="W_n" class="latex" /> is at least <img src="https://s0.wp.com/latex.php?latex=r&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="r" title="r" class="latex" /> for <img src="https://s0.wp.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="n" title="n" class="latex" /> odd, and at most <img src="https://s0.wp.com/latex.php?latex=1-r&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="1-r" title="1-r" class="latex" /> for <img src="https://s0.wp.com/latex.php?latex=n&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="n" title="n" class="latex" /> even. Let <img src="https://s0.wp.com/latex.php?latex=d_n&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="d_n" title="d_n" class="latex" /> denote the Dirac measure on the infinite string <img src="https://s0.wp.com/latex.php?latex=W_nW_nW_n%5Ccdots&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="W_nW_nW_n&#92;cdots" title="W_nW_nW_n&#92;cdots" class="latex" />, and let <img src="https://s0.wp.com/latex.php?latex=e_n&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="e_n" title="e_n" class="latex" /> denote the average of <img src="https://s0.wp.com/latex.php?latex=d_n&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="d_n" title="d_n" class="latex" /> over its (finite) orbit under the shift map. Define <img src="https://s0.wp.com/latex.php?latex=%5Cmu_0%5Ei+%3D+%5Cfrac+1+i+%5Csum_%7Bj%5Cle+i%7D+e_%7B2j%7D&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;mu_0^i = &#92;frac 1 i &#92;sum_{j&#92;le i} e_{2j}" title="&#92;mu_0^i = &#92;frac 1 i &#92;sum_{j&#92;le i} e_{2j}" class="latex" /> and <img src="https://s0.wp.com/latex.php?latex=%5Cmu_1%5Ei+%3D+%5Cfrac+1+i+%5Csum_%7Bj%5Cle+i%7D+e_%7B2j%2B1%7D&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;mu_1^i = &#92;frac 1 i &#92;sum_{j&#92;le i} e_{2j+1}" title="&#92;mu_1^i = &#92;frac 1 i &#92;sum_{j&#92;le i} e_{2j+1}" class="latex" />. These probability measures are shift-invariant, and have shift-invariant weak limits <img src="https://s0.wp.com/latex.php?latex=%5Cmu_0%2C%5Cmu_1&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;mu_0,&#92;mu_1" title="&#92;mu_0,&#92;mu_1" class="latex" /> as <img src="https://s0.wp.com/latex.php?latex=i+%5Cto+%5Cinfty&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="i &#92;to &#92;infty" title="i &#92;to &#92;infty" class="latex" /> with support in <img src="https://s0.wp.com/latex.php?latex=%5CLambda&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;Lambda" title="&#92;Lambda" class="latex" />. Moreover, if <img src="https://s0.wp.com/latex.php?latex=%5CLambda_j&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;Lambda_j" title="&#92;Lambda_j" class="latex" /> denotes the strings in <img src="https://s0.wp.com/latex.php?latex=%5CLambda&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;Lambda" title="&#92;Lambda" class="latex" /> that start with <img src="https://s0.wp.com/latex.php?latex=j&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="j" title="j" class="latex" /> for <img src="https://s0.wp.com/latex.php?latex=j%3D0%2C1&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="j=0,1" title="j=0,1" class="latex" />, then <img src="https://s0.wp.com/latex.php?latex=%5Cmu_j%28%5CLambda_j%29+%5Cge+r&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;mu_j(&#92;Lambda_j) &#92;ge r" title="&#92;mu_j(&#92;Lambda_j) &#92;ge r" class="latex" />. In particular, the space of shift invariant probability measures on <img src="https://s0.wp.com/latex.php?latex=%5CLambda&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;Lambda" title="&#92;Lambda" class="latex" /> is at least <img src="https://s0.wp.com/latex.php?latex=1&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="1" title="1" class="latex" />-dimensional, and we may therefore obtain distinct mutually singular ergodic shift-invariant probability measures on <img src="https://s0.wp.com/latex.php?latex=%5CLambda&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;Lambda" title="&#92;Lambda" class="latex" />. Since <img src="https://s0.wp.com/latex.php?latex=%5CLambda&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;Lambda" title="&#92;Lambda" class="latex" /> is minimal, both measures have full support.</li>
<li>Shelah&#8217;s theorem that if one works in (ZF) plus the axiom of dependent choice, if there is an uncountable well-ordered set of reals, then there is a non-(Lebesgue) measurable set, which shows the necessity of Solovay&#8217;s use of inaccessible cardinals. (By Solovay, the axiom of dependent choice is consistent with the statement that every set of reals is Lebesgue-measurable).</li>
</ol>
]]></html></oembed>