<?xml version="1.0" encoding="UTF-8"?><!-- generator="wordpress.com" -->
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	>

<channel>
	<title>pigeonhole-principle &amp;laquo; WordPress.com Tag Feed</title>
	<link>http://wordpress.com/tag/pigeonhole-principle/</link>
	<description>Feed of posts on WordPress.com tagged "pigeonhole-principle"</description>
	<pubDate>Mon, 07 Jul 2008 13:13:35 +0000</pubDate>

	<generator>http://wordpress.com/tags/</generator>
	<language>en</language>

<item>
<title><![CDATA[17 orang mendiskusikan tiga topik]]></title>
<link>http://artofmathematics.wordpress.com/?p=555</link>
<pubDate>Tue, 10 Jun 2008 07:44:07 +0000</pubDate>
<dc:creator>Johan</dc:creator>
<guid>http://artofmathematics.wordpress.com/?p=555</guid>
<description><![CDATA[[IMO 1964] Tujuh belas orang berdiskusi satu sama lain. Dalam satu diskusi hanya ada dua orang, dan ]]></description>
<content:encoded><![CDATA[<p>[IMO 1964] Tujuh belas orang berdiskusi satu sama lain. Dalam satu diskusi hanya ada dua orang, dan setiap orang berdiskusi dengan semua orang lainnya. Totalnya ada tiga topik yang dibahas. Buktikan ada tiga orang yang saling mendiskusikan topik yang sama.</p>
<p><!--more Lihat Solusi --></p>
<p>Solusi<br />
Misalkan topik itu adalah $latex T_1,T_2,T_3$, dan ada satu orang disebut $latex A$. Menurut prinsip rumah merpati, ada satu topik yang didiskusikan dengan 6 orang lainnya, anggaplah $latex T_3$. Jika dua dari 6 orang ini mendiskusikan $latex T_3$, kita selesai. Jadi anggap keenam orang ini mendiskusikan $latex T_1,T_2$ satu sama lain, tapi tidak $latex T_3$. Sebutlah satu dari enam orang ini $latex B$. Dengan prinsip rumah merpati, ia mendiskusikan salah satu topik dengan minimum 3 orang lainnya, sebutlah topik $latex T_1$. Jika dua dari tiga orang ini mendiskusikan $latex T_1$, kita selesai. Jika tidak, berarti ketiga orang ini mendiskusikan $latex T_2$, sehingga terbukti.</p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[Reflections of the implications of the Pigeonhole principle in our lives]]></title>
<link>http://f241vc15.wordpress.com/?p=63</link>
<pubDate>Sun, 17 Feb 2008 07:04:25 +0000</pubDate>
<dc:creator>f241vc15</dc:creator>
<guid>http://f241vc15.wordpress.com/?p=63</guid>
<description><![CDATA[Readability test scores for this post are as follows:
Flesch Reading Ease:      70.52
Flesch-Kincaid]]></description>
<content:encoded><![CDATA[<p><i>Readability test scores for this post are as follows:</i></p>
<p><i>Flesch Reading Ease:      70.52<br />
Flesch-Kincaid Grade Level:      7.00<br />
Automated Readability Index:      7.00</i></p>
<p style="font-style:normal;text-decoration:none;"><i>For more info about readability tests, check out my post about those <a href="http://f241vc15.wordpress.com/2008/03/18/how-readable-are-you/">here</a>.<br />
</i></p>
<p>Math is fun in my opinion. It has, for the longest time, including Science, been my favorite subject.</p>
<p>Anyway, this post is about a portion of mathematics which is more popularly known as the pigeon hole principle. The pigeonhole principle states that, given<i>  k+1</i> or more objects (<i>k</i> being any positive integer) which are to be placed on <i>k</i> number of holes, there is at least one hole which will contain two or more of the objects. Or to generalize the definition in a mathematical way:</p>
<blockquote><p>if <i>M </i>objects are placed into <i>j</i> holes, <i>M </i>greater than or equal to <i>j</i>, then there is at least one hole containing at least <i>k</i> number of objects, where <i>k</i> is equal to the <i>ceiling( M / j )</i> .</p></blockquote>
<p>For example, if I have 12 pigeons to be placed into 11 pigeonholes, there will be at least one pigeonhole which will have 2 pigeons inside. This is so because the ceiling value <i>k</i>, the smallest integer  value not less than (<i>M / j) </i>, is 2 since 12 divided by 11 is approximately 1.09 whose ceiling value <i>k</i> is equal to 2</p>
<p><b>"So what?"</b></p>
<p>you may say. What would possibly be in the pigeonhole principle that could interest me? Well, a lot actually, same as a lot of other seemingly 'impractical' mathematical theories for our every day lives.</p>
<p><b>1 </b>If we take the upper limit on the <a href="http://science.howstuffworks.com/hair-coloring1.htm">average number of hair on the human head</a> (nice alliteration) to be about 150,000, and say a given city has a population size of 500,000, there would be at least 4 people with the same number of hair on their heads. This happened because taking the population size (500,000) as the pigeons and the maximum possible number of hair on the human head (150,000) as the pigeonholes, <i>k</i> is found to be equal to 4. Isn't that amazing to know? (okay sorry, I know not everyone will find that interesting).</p>
<p><b>2 </b>Take a pizza party for example: If I were to treat four of my friends so that the five of us will eat a pizza pie of 12 slices, at least one among us will be eating 3 pizzas, while the rest only eat 2. This assumes of course that everyone eats a whole slice of pizza. Now isn't that worth knowing at times like this? (^)__(^)</p>
<p><b>3</b> Another example is when you want to get say, candies from those vending machines which shows you their contents but you can't pick any flavor in particular (i.e. the machine gives the candy to you at random). Say you can see that there are only two candy flavors left: orange and lemon, and there are 12 of each. You will know (using the pigeonhole principle) that you can get at least 2 candies of the same flavor for sure by buying (not 13 candies) at least 3 candies only! This is because in the event that the first 2 candies you get are of different flavors, you know for certain that the third one will be one of the 2 flavors, giving you at least 2 candies of the same flavor!</p>
<p>However, if you want at least 2 of a specific flavor, say, lemon flavor, it becomes a bit different: you need to get at least 14 candies to be sure that you get 2 lemon flavored candies.</p>
<p><b>4</b> Say you have a shop/store with (doesn't have to be the exact number, since using the principle will still give you the correct answer) 50 aisles, 85 horizontal locations on each aisle, and 5 shelves. The products you sell are stored in bins on horizontal locations on the shelves of each aisle. What is the least number of products you would buy to ensure that at least 2 products are stored in the same bin?  That is, based on the above definition, what should be <i>M</i> so that for <i>j</i> = 85 x 50 x 5 = 21250, <i>k</i> = 2 ? Surely enough, <b>M<i> </i></b>would be equal to 21251 since taking the ceiling of 21251 / 21250 would be equal to 2.</p>
<p><b>5</b> If you collect/put together 13 people randomly (or even purposefully), by the pigeonhole principle, there are at least 2 people (pigeons) from that group who were born on the same month, since there are only 12 months (pigeonholes). And, much more obvious, is that among a group of 8 people (randomly/non-randomly chosen), at least two among them were born on the same day.</p>
<p>The <i>important</i> thing to remember here is to figure out which among the objects you're dealing with is the pigeon and which is the pigeonhole.</p>
<p>There you go! There are tons more of examples on the daily applications of the pigeonhole principle, from computer networks, to train stations, to supermarkets, to art classes, etc. Anybody disagrees with me? (^)__(^)</p>
<p>Btw, comments/suggestions/questions/corrections are welcome, as long as they come in a calm and ruly way.</p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[Tiga angka terakhir dari pangkat bilangan]]></title>
<link>http://artofmathematics.wordpress.com/?p=252</link>
<pubDate>Thu, 07 Feb 2008 10:33:09 +0000</pubDate>
<dc:creator>Johan</dc:creator>
<guid>http://artofmathematics.wordpress.com/?p=252</guid>
<description><![CDATA[[Easy as ?] Buktikan bahwa terdapat suatu bilangan asli  sehingga  berakhiran dengan .

Solusi
Karen]]></description>
<content:encoded><![CDATA[<p>[Easy as $latex \pi$?] Buktikan bahwa terdapat suatu bilangan asli $latex n$ sehingga $latex 29^n$ berakhiran dengan $latex 001$.</p>
<p><!--more Lihat Solusi --></p>
<p>Solusi<br />
Karena terdapat 1000 angka dari $latex 000$  sampai $latex 999$, maka di antara $latex 29^1$, $latex 29^2$, $latex 29^3$, $latex \ldots$, $latex 29^{1001}$ terdapat dua bilangan yang tiga angka terakhirnya berbeda (dari prinsip rumah burung). Maka misalkan $latex 29^k$ dan $latex 29^l$ berakhiran dengan tiga angka yang sama, di mana $latex k&#62;l$.</p>
<p>Maka $latex 29^k-29^l$ habis dibagi 1000. Tetapi $latex 29^k-29^l=29^l(29^{k-l}-1)$ harus habis dibagi 1000, sedangkan $latex 29^l$ tidak mungkin habis dibagi 1000, karena 29 dan 1000 relatif prima. Maka $latex 29^{k-l}-1$ habis dibagi 1000, dan berakhiran dengan $latex 000$. Maka $latex 29^{k-l}$ berakhiran dengan $latex 001$. Terbukti.</p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[Lingkaran dalam persegi berisi titik]]></title>
<link>http://artofmathematics.wordpress.com/?p=251</link>
<pubDate>Thu, 07 Feb 2008 10:08:26 +0000</pubDate>
<dc:creator>Johan</dc:creator>
<guid>http://artofmathematics.wordpress.com/?p=251</guid>
<description><![CDATA[[Easy as ?] Di dalam persegi berukuran  terdapat 51 titik. Buktikan bahwa terdapat sebuah lingkaran ]]></description>
<content:encoded><![CDATA[<p>[Easy as $latex \pi$?] Di dalam persegi berukuran $latex 1\times1$ terdapat 51 titik. Buktikan bahwa terdapat sebuah lingkaran dengan radius $latex 1/7$ yang berisi tiga titik di dalam persegi tersebut.</p>
<p><!--more Lihat Solusi --></p>
<p>Solusi<br />
Perhatikan bahwa $latex 2/7&#62;\sqrt2/5$. Tetapi $latex 2/7$ adalah diameter dari suatu lingkaran berjari-jari $latex 1/7$, sedangkan $latex \sqrt2/5$ adalah diagonal dari suatu persegi bersisi $latex 1/5$. Maka lingkaran itu dapat dibuat berada di luar persegi $latex 1/5\times1/5$.</p>
<p>Bagilah persegi $latex 1\times1$ tersebut menjadi 25 persegi yang identik, yaitu dengan memotong 5 bagian secara horizontal dan 5 bagian secara vertikal. Maka ukuran persegi-persegi kecil itu adalah $latex 1/5\times1/5$. Karena terdapat 51 titik, dengan prinsip rumah burung, terdapat setidaknya tiga titik yang berada di salah satu persegi. Tetapi kita dapat membuat lingkaran berjari-jari $latex 1/7$ di luar persegi itu. Jadi, di dalam lingkaran yang dibuat di luar persegi itu terdapat tiga titik.</p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[Sembilan titik dalam persegi]]></title>
<link>http://artofmathematics.wordpress.com/?p=169</link>
<pubDate>Sun, 03 Feb 2008 14:44:01 +0000</pubDate>
<dc:creator>Johan</dc:creator>
<guid>http://artofmathematics.wordpress.com/?p=169</guid>
<description><![CDATA[[JBMO 1997] Di dalam sebuah persegi dengan sisi 1, terdapat sembilan titik. Buktikan bahwa terdapat ]]></description>
<content:encoded><![CDATA[<p>[JBMO 1997] Di dalam sebuah persegi dengan sisi 1, terdapat sembilan titik. Buktikan bahwa terdapat tiga titik yang membentuk segitiga dengan luas $latex \le1/8$.</p>
<p><!--more Lihat Solusi --></p>
<p>Solusi<br />
Bagilah persegi itu menjadi empat persegi kecil lagi, dengan sisi masing-masing $latex 1/2$. Dengan prinsip Dirichlet, atau rumah burung, kita mengetahui bahwa terdapat satu persegi kecil yang memiliki 3 titik di dalamnya. Luas segitiga itu maksimum jika titik-titiknya berada di titik-titik sudut persegi, sehingga luasnya $latex \frac{1}{2}\cdot\frac{1}{2}\cdot\frac{1}{2}=\frac{1}{8}$. Maka, luasnya $latex \le\frac{1}{8}$.</p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[Permainan catur]]></title>
<link>http://artofmathematics.wordpress.com/2007/12/26/permainan-catur/</link>
<pubDate>Wed, 26 Dec 2007 13:21:06 +0000</pubDate>
<dc:creator>Johan</dc:creator>
<guid>http://artofmathematics.wordpress.com/2007/12/26/permainan-catur/</guid>
<description><![CDATA[[USSR Olympiad Problem Book] Seorang pemain catur berlatih selama 11 minggu. Setiap hari ia bermain ]]></description>
<content:encoded><![CDATA[<p>[USSR Olympiad Problem Book] Seorang pemain catur berlatih selama 11 minggu. Setiap hari ia bermain paling sedikit satu kali, tetapi setiap minggu ia bermain tidak lebih dari 12 kali. Buktikan bahwa pada beberapa hari berurutan tertentu pada 11 minggu itu, ia bermain tepat 21 kali.</p>
<p><!--more Lihat Solusi --></p>
<p>Solusi<br />
Misalkan $latex a_1$ adalah banyak permainannya pada hari pertama, $latex a_2$ adalah banyak permainan pada hari pertama dan kedua, dan seterusnya. $latex a_n$ adalah banyak permainan dari hari pertama sampai hari ke-$latex n$. Maka, barisan $latex a_1$, $latex a_2$, $latex a_3$, $latex \ldots$, $latex a_{77}$ terdiri dari bilangan-bilangan berbeda yang nilainya meningkat. Berdasarkan syarat bahwa setiap minggu tidak lebih dari 12 permainan, maka dalam 11 minggu tidak lebih dari $latex 11\cdot12=132$. Maka</p>
<p align="center">$latex 1\le a_1&#60;a_2&#60;a_3&#60;\ldots&#60;a_{77}\le132$,</p>
<p align="left">dan</p>
<div align="center"></div>
<p align="center">$latex 22\le a_1+21&#60;a_2+21&#60;a_3+21&#60;\ldots&#60;a_{77}+21\le153$.</p>
<p>Maka, $latex a_1$, $latex a_2$, $latex a_3$, $latex \ldots$, $latex a_{77}$, $latex a_1+21$, $latex a_2+21$, $latex a_3+21$, $latex \ldots$, $latex a_{77}+21$ adalah 154 bilangan asli dari 1 sampai 153. Berdasarkan prinsip rumah burung, terdapat dua bilangan yang sama. Tetapi 77 bilangan pertama tidak ada yang sama, begitu pula 77 bilangan terakhir. Jadi, satu bilangan adalah dari 77 bilangan pertama, bilangan lainnya dari 77 bilangan terakhir. Misalkan kedua bilangan itu adalah</p>
<p align="center">$latex a_i=a_j+21$,</p>
<p>untuk suatu $latex j&#60;i$, maka</p>
<p align="center">$latex 21=a_i-a_j$.</p>
<p>Jadi terdapat hari dari hari ke-$latex j+1$ sampai hari ke-$latex i$ yang terjadi tepat 21 permainan.</p>
]]></content:encoded>
</item>
<item>
<title><![CDATA[Soft analysis, hard analysis, and the finite convergence principle]]></title>
<link>http://terrytao.wordpress.com/2007/05/23/soft-analysis-hard-analysis-and-the-finite-convergence-principle/</link>
<pubDate>Thu, 24 May 2007 03:34:46 +0000</pubDate>
<dc:creator>Terence Tao</dc:creator>
<guid>http://terrytao.wordpress.com/2007/05/23/soft-analysis-hard-analysis-and-the-finite-convergence-principle/</guid>
<description><![CDATA[In the field of analysis, it is common to make a distinction between &#8220;hard&#8221;, &#8220;quan]]></description>
<content:encoded><![CDATA[<p>In the field of analysis, it is common to make a distinction between "hard", "quantitative", or "finitary" analysis on one hand, and "soft", "qualitative", or "infinitary" analysis on the other. "Hard analysis" is mostly concerned with finite quantities (e.g. the cardinality of finite sets, the measure of bounded sets, the value of convergent integrals, the norm of finite-dimensional vectors, etc.) and their <i>quantitative</i> properties (in particular, upper and lower bounds). "Soft analysis", on the other hand, tends to deal with more infinitary objects (e.g. sequences, measurable sets and functions, <a href="http://en.wikipedia.org/wiki/Sigma_algebra">$latex \sigma$-algebras</a>, Banach spaces, etc.) and their <i>qualitative</i> properties (convergence, boundedness, integrability, completeness, compactness, etc.). To put it more symbolically, hard analysis is the mathematics of $latex \varepsilon$, $latex N$, $latex O()$, and $latex \leq$<sup><a href="#_ftn1" name="_ftnref1" title="_ftnref1">[1]</a></sup>; soft analysis is the mathematics of 0, $latex \infty$, $latex \in$, and $latex \to$.</p>
<p>At first glance, the two types of analysis look very different; they deal with different types of objects, ask different types of questions, and seem to use different techniques in their proofs. They even use<sup><a href="#_ftn2" name="_ftnref2" title="_ftnref2">[2] </a></sup>different axioms of mathematics; the <a href="http://en.wikipedia.org/wiki/Axiom_of_infinity">axiom of infinity</a>, the <a href="http://en.wikipedia.org/wiki/Axiom_of_choice">axiom of choice</a>, and the <a href="http://en.wikipedia.org/wiki/Completeness_%28order_theory%29">Dedekind completeness axiom</a> for the real numbers are often invoked in soft analysis, but rarely in hard analysis. (As a consequence, there are occasionally some finitary results that can be proven easily by soft analysis but are in fact <i>impossible</i> to prove via hard analysis methods; the <a href="http://en.wikipedia.org/wiki/Paris-Harrington_theorem">Paris-Harrington theorem</a> gives a famous example.) Because of all these differences, it is common for analysts to specialise in only one of the two types of analysis. For instance, as a general rule (and with notable exceptions), discrete mathematicians, computer scientists, real-variable harmonic analysts, and analytic number theorists tend to rely on "hard analysis" tools, whereas <strike>functional analysts,</strike> operator algebraists, abstract harmonic analysts, and ergodic theorists tend to rely on "soft analysis" tools. (PDE is an interesting intermediate case in which <i>both</i> types of analysis are popular and useful, though many practitioners of PDE still prefer to primarily use just one of the two types. Another interesting transition occurs on the interface between point-set topology, which largely uses soft analysis, and metric geometry, which largely uses hard analysis.  Also, the ineffective bounds which crop up from time to time in analytic number theory are a sort of hybrid of hard and soft analysis.  Finally, there are examples of evolution of a field from soft analysis to hard (e.g. Banach space geometry) or vice versa (e.g. recent developments in extremal combinatorics, particularly in relation to the regularity lemma).)</p>
<p><!--more--></p>
<p>It is fairly well known that the <i>results </i>obtained by hard and soft analysis respectively can be connected to each other by various "correspondence principles" or "compactness principles". It is however my belief that the relationship between the two types of analysis is in fact much closer<sup><a href="#_ftn3" name="_ftnref3" title="_ftnref3">[3]</a></sup> than just this; in many cases, qualitative analysis can be viewed as a convenient abstraction of quantitative analysis, in which the precise dependencies between various finite quantities has been efficiently concealed from view by use of infinitary notation. Conversely, quantitative analysis can often be viewed as a more precise and detailed refinement of qualitative analysis. Furthermore, a method from hard analysis often has some analogue in soft analysis and vice versa, though the language and notation of the analogue may look completely different from that of the original. I therefore feel that it is often profitable for a practitioner of one type of analysis to learn about the other, as they both offer their own strengths, weaknesses, and intuition, and knowledge of one gives more insight<sup><a href="#_ftn4" name="_ftnref4" title="_ftnref4">[4]</a></sup> into the workings of the other. I wish to illustrate this point here using a simple but not terribly well known result, which I shall call the "finite convergence principle" (thanks to <a href="http://www.dpmms.cam.ac.uk/~bjg23/" class="snap_noshots">Ben Green</a> for suggesting this name; <a href="http://research.microsoft.com/~chayes/" class="snap_noshots">Jennifer Chayes</a> has also suggested the "<a href="http://en.wikipedia.org/wiki/Metastability">metastability</a> principle"). It is the finitary analogue of an utterly trivial infinitary result - namely, that every bounded monotone sequence converges - but sometimes, a careful analysis of a trivial result can be surprisingly revealing, as I hope to demonstrate here.</p>
<p>Before I discuss this principle, let me first present an informal, incomplete, and inaccurate "dictionary" between soft and hard analysis, to try to give a rough idea of the (partial) correspondences between the two:</p>
<table border="1">
<tr>
<td valign="top">Soft analysis</td>
<td valign="top">Hard analysis</td>
</tr>
<tr>
<td valign="top">x finite</td>
<td valign="top">x bounded (e.g. x = O(1))</td>
</tr>
<tr>
<td valign="top">x vanishes</td>
<td valign="top">x small (e.g. $latex &#124;x&#124; \leq \varepsilon$)</td>
</tr>
<tr>
<td valign="top">x infinite</td>
<td valign="top">x large (e.g. $latex &#124;x&#124; \geq N$)</td>
</tr>
<tr>
<td valign="top">$latex x_n \to 0$</td>
<td valign="top">Quantitative decay bound (e.g. $latex x_n = O(n^{-c})$)</td>
</tr>
<tr>
<td valign="top">$latex x_n$ is convergent</td>
<td valign="top">$latex x_n$ is metastable (see below)</td>
</tr>
<tr>
<td valign="top">f uniformly continuous</td>
<td valign="top">Lipschitz or Hölder bound on f (e.g. $latex &#124;f(x)-f(y)&#124; = O(&#124;x-y&#124;)$)</td>
</tr>
<tr>
<td valign="top">$latex f \in X$</td>
<td valign="top">$latex \&#124;f\&#124;_X = O(1)$</td>
</tr>
<tr>
<td valign="top">E compact</td>
<td valign="top">Metric entropy bound on E</td>
</tr>
<tr>
<td valign="top">E is Lebesgue measurable</td>
<td valign="top">E is (quantitatively) approximated by bounded complexity sets</td>
</tr>
<tr>
<td valign="top">V is generated by S</td>
<td valign="top">V is an algorithm initialised by S</td>
</tr>
<tr>
<td valign="top">u locally extremises F(u)</td>
<td valign="top">u has no nearby competitor with significantly better value of F</td>
</tr>
</table>
<p>One can draw two conclusions from this table:</p>
<ol>
<li>Soft analysis statements can often be stated both succinctly and rigorously, by using precisely defined and useful concepts (e.g. compactness, measurability, etc.). In hard analysis, one usually has to sacrifice one or the other: either one is rigorous but verbose (using lots of parameters such as $latex \varepsilon$, N, etc.), or succinct but "fuzzy" (using intuitive but vaguely defined concepts such as "size", "complexity", "nearby", etc.).</li>
<li>A single concept in soft analysis can have multiple hard analysis counterparts; a "naive" translation of a statement in soft analysis into hard analysis may be incorrect. (In particular, one should not use the above table blindly to convert from one to the other.)</li>
</ol>
<p>Anyway, back to the finite convergence principle. The infinite convergence principle is well known, though perhaps not by this name, to anyone who has taken undergraduate analysis:</p>
<blockquote><p><b>Infinite convergence principle.</b> Every bounded monotone sequence $latex x_n$ of real numbers is convergent.</p></blockquote>
<p>This basic principle - essentially equivalent to the Dedekind completeness axiom for the real numbers - is of course fundamental to many parts of infinitary analysis, most obviously in the theory of infinite sequences and series, but it also is implicit in just about any context in which one studies an "infinite" object (e.g. an infinite-dimensional vector space) by expressing it as a monotone limit of "finite" objects. It is undoubtedly an important tool in soft analysis. What, then, is its counterpart in hard analysis?</p>
<p>We will answer this question presently, but let us first make the infinite convergence principle a bit more quantitative. We may as well normalise the bounded sequence $latex x_n$ to lie between 0 and 1. Expanding out the "epsilon-delta" definition of convergence, we obtain</p>
<blockquote><p><b>Infinite convergence principle (again).</b> If $latex 0 \leq x_1 \leq x_2 \leq \ldots \leq 1$, then there exists a real number x such that for every $latex \varepsilon &#62; 0$, there exists an N such that $latex &#124;x_n-x&#124; \leq \varepsilon$ for all $latex n \geq N$.</p></blockquote>
<p>There are quite a lot of quantifiers here. One can cut down the complexity a little bit by replacing the notion of a convergent sequence with that of a <a href="http://en.wikipedia.org/wiki/Cauchy_sequence">Cauchy sequence</a>. This lets us eliminate the need for a limit x, which does not have an obvious finitary counterpart. So we get</p>
<blockquote><p><b>Infinite convergence principle (yet again).</b> If $latex \varepsilon &#62; 0$ and $latex 0 \leq x_1 \leq x_2 \leq \ldots \leq 1$, there exists an N such that $latex &#124;x_n-x_m&#124; \leq \varepsilon$ for all $latex n,m \geq N$.</p></blockquote>
<p>Note now that one does not need the real number system to make this principle both meaningful and non-trivial; the principle already works quite well when restricted to the rationals. (Exercise: prove this principle for the rationals without constructing the real number system.) Informally speaking, this principle asserts that every bounded monotone sequence is eventually stable up to error $latex \varepsilon$.</p>
<p>Now let's try to find the finitary (quantitative) equivalent of this principle. The most naive thing to do is simply to replace the infinite sequence by a finite sequence, thus</p>
<blockquote><p><b>Finite convergence principle (first attempt).</b> If $latex \varepsilon &#62; 0$ and $latex 0 \leq x_1 \leq x_2 \leq \ldots \leq x_M \leq 1$, there exists an N such that $latex &#124;x_n-x_m&#124; \leq \varepsilon$ for all $latex N \leq n,m \leq M$.</p></blockquote>
<p>But this is trivially true; one can simply set N equal to M (or any number larger than M). So one needs to strengthen the claim. What about making N be independent of M, and only dependent on $latex \varepsilon$?</p>
<blockquote><p><b>Finite convergence principle (second attempt).</b> If $latex \varepsilon &#62; 0$ and $latex 0 \leq x_1 \leq x_2 \leq \ldots \leq x_M \leq 1$, there exists an $latex N = N(\varepsilon)$ depending only on $latex \varepsilon$ such that $latex &#124;x_n-x_m&#124; \leq \varepsilon$ for all $latex N \leq n,m \leq M$.</p></blockquote>
<p>But this is trivially false; consider for instance a sequence $latex x_i$ which equals zero except at i=M, at which point we jump up to $latex x_M = 1$. We are not going to get the Cauchy property unless we set N to be as large as M... but we can't do that if we only want N to depend on $latex \varepsilon$.</p>
<p>So, is there anything non-trivial that one can say at all about finite bounded monotone sequences? Well, we have the <a href="http://en.wikipedia.org/wiki/Pigeonhole_principle">pigeonhole principle</a>:</p>
<blockquote><p><b>Pigeonhole principle.</b> If $latex \varepsilon &#62; 0$ and $latex 0 \leq x_1 \leq x_2 \leq \ldots \leq x_M \leq 1$ is such that $latex M \geq 1/\varepsilon + 1$, there exists an $latex 1 \leq N &#60; M$ such that $latex &#124;x_{N+1} - x_N&#124; \leq \varepsilon$.</p></blockquote>
<p>Indeed, if the gaps between each element $latex x_N$ of the sequence and the next $latex x_{N+1}$ were always larger than $latex \varepsilon$, then $latex x_M - x_1$ would exceed $latex (M-1)\epsilon \geq 1$, a contradiction. This principle is true, but it is too weak to be considered a true finitary version of the infinite convergence principle; indeed, we see that the pigeonhole principle easily implies</p>
<blockquote><p><b>Weak infinite convergence principle.</b> If $latex 0 \leq x_1 \leq x_2 \leq \ldots \leq 1$, then $latex \liminf_{n \to \infty} &#124;x_{n+1}-x_n&#124; = 0$.</p></blockquote>
<p>but does not obviously imply the full infinite convergence principle.</p>
<p>The problem is that the pigeonhole principle only establishes <i>instantaneous</i> stability of the sequence at some point n, whereas the infinite convergence principle concludes the <i>permanent</i> stability of the sequence after some point N. To get a better finitary match to the infinite convergence principle, we need to extend the region of stability that the pigeonhole principle offers. Now, one can do some trivial extensions such as</p>
<blockquote><p><b>Pigeonhole principle (second version).</b> If $latex \varepsilon &#62; 0$ and $latex k \geq 1$ and $latex 0 \leq x_1 \leq x_2 \leq \ldots \leq x_M \leq 1$ is such that $latex M \geq k/\varepsilon + 1$, there exists $latex 1 \leq N &#60; N+k \leq M$ such that $latex &#124;x_n - x_m&#124; \leq \varepsilon$ for all $latex N \leq n,m \leq N+k$,</p></blockquote>
<p>which one can quickly deduce from the first pigeonhole principle by considering the sparsified sequence $latex x_k, x_{2k}, x_{3k}, \ldots$. But this is only a little bit better, as it now gives the infinitary statement</p>
<blockquote><p><b>Slightly less weak infinite convergence principle.</b> If $latex 0 \leq x_1 \leq x_2 \leq \ldots \leq 1$, then $latex \liminf_{n \to \infty} &#124;x_{n+k}-x_n&#124; = 0$ for all k,</p></blockquote>
<p>but is still not strong enough to imply the infinite convergence principle in its full strength. Nevertheless, it shows that we can extend the realm of stability offered by the pigeonhole principle. One can for instance sparsify further, replacing n+k with 2n:</p>
<blockquote>
<p align="left"><b>Pigeonhole principle (third version).</b> If $latex \varepsilon &#62; 0$ and $latex k \geq 1$ and $latex 0 \leq x_1 \leq x_2 \leq \ldots \leq x_M \leq 1$ is such that $latex M \geq 2^{1/\varepsilon} + 1$, there exists $latex 1 \leq N &#60; 2N \leq M$ such that $latex &#124;x_n - x_m&#124; \leq \varepsilon$ for all $latex N \leq n,m \leq 2N$.</p>
</blockquote>
<p>This can be proven by applying the first version of the pigeonhole principle to the sparsified sequence $latex x_1, x_2, x_4, x_8, \ldots$. This corresponds to an infinite convergence principle in which the conclusion is that $latex \liminf_{n \to \infty} &#124;x_{2n} - x_n&#124; = 0$.</p>
<p>One can of course keep doing this, achieving various sparsified versions of the pigeonhole principle which each capture part of the infinite convergence principle. To get the full infinite convergence principle, one cannot use any single such sparsified version of the pigeonhole principle, but instead must take <b>all of them at once</b>. This is the full strength of the finite convergence principle:</p>
<blockquote>
<p align="left"><b>Finite convergence principle</b><b>.</b> If $latex \varepsilon &#62; 0$ and $latex F: {\Bbb Z}_+ \to {\Bbb Z}_+$ is a function and $latex 0 \leq x_1 \leq x_2 \leq \ldots \leq x_M \leq 1$ is such that $latex M$ is sufficiently large depending on F and $latex \varepsilon$, then there exists $latex 1 \leq N &#60; N+F(N) \leq M$ such that $latex &#124;x_n - x_m&#124; \leq \varepsilon$ for all $latex N \leq n,m \leq N+F(N)$.</p>
</blockquote>
<p>This principle is easily proven by appealing to the first pigeonhole principle with the sparsified sequence $latex x_{i_1}, x_{i_2}, x_{i_3}, \ldots$ where the indices are defined recursively by $latex i_1 := 1$ and $latex i_{j+1} := i_j + F(i_j)$. This gives an explicit bound on M as $latex M := i_{\lfloor 1/\varepsilon\rfloor + 1}$. Note that the first pigeonhole principle corresponds to the case $latex F(N) \equiv 1$, the second pigeonhole principle to the case $latex F(N) \equiv k$, and the third to the case $latex F(N) \equiv N$. A particularly useful case for applications is when F grows exponentially in N, in which case M grows <a href="http://en.wikipedia.org/wiki/Tetration">tower-exponentially</a> in $latex 1/\varepsilon$.</p>
<p>Informally, the above principle asserts that any sufficiently long (but finite) bounded monotone sequence will experience arbitrarily high-quality amounts of <a href="http://en.wikipedia.org/wiki/Metastability">metastability</a> with a specified error tolerance $latex \varepsilon$, in which the duration F(N) of the metastability exceeds the time N of onset of the metastability by an arbitrary function F which is specified in advance.</p>
<p>Let us now convince ourselves that this is the true finitary version of the infinite convergence principle, by deducing them from each other:</p>
<p><b>The finite convergence principle implies the infinite convergence principle</b>. Suppose for contradiction that the infinite convergence principle failed. Untangling the quantifiers, this asserts that there is an infinite sequence $latex 0 \leq x_1 \leq x_2 \leq \ldots \leq 1$ and an $latex \varepsilon &#62; 0$ with the property that, given any positive integer N, there exists a larger integer $latex N+F(N)$ such that $latex x_{N+F(N)} - x_N &#62; \varepsilon$. But this contradicts the finite convergence principle. $latex \Box$</p>
<p><b>The infinite convergence principle implies the finite convergence principle</b>. Suppose for contradiction that the finite convergence principle failed. Untangling the quantifiers, this asserts that there exists $latex \varepsilon &#62; 0$ and a function F, together with a collection $latex 0 \leq x^{(i)}_1 \leq \ldots \leq x^{(i)}_{M_i} \leq 1$ of bounded monotone sequences whose length $latex M_i$ goes to infinity, such that for each one of these sequences, there does <b>not</b> exist $latex 1 \leq N &#60; N+F(N) \leq M_i$ such that $latex &#124;x^{(i)}_n - x^{(i)}_m&#124; \leq \varepsilon$ for all $latex N \leq n,m \leq N+F(N)$. Let us extend each of the finite bounded sequences to infinite bounded sequences in some arbitrary manner, e.g. defining $latex x^{(i)}_n = 1$ whenever $latex n &#62; M_i$. The space of all bounded sequences is well-known<sup><a href="#_ftn5" name="_ftnref5" title="_ftnref5">[5]</a></sup> to be <a href="http://en.wikipedia.org/wiki/Sequentially_compact">sequentially compact</a> in the <a href="http://en.wikipedia.org/wiki/Product_topology">product topology</a>, thus after refining the i labels to a subsequence if necessary, we can assume that the sequences $latex (x^{(i)}_n)_{n=1}^\infty$ converge in the product topology (i.e. pointwise) to a new limit sequence $latex (x_n)_{n=1}^\infty$. Since each of the original sequences were bounded in the interval [0,1] and monotone, we see that the limit sequence is also. Furthermore, we claim that there does not exist any $latex N \geq 1$ for which $latex &#124;x_n - x_m&#124; &#60; \varepsilon$ for all $latex N \leq n,m \leq N+F(N)$. Indeed, if this were the case, then by pointwise convergence we would also have $latex &#124;x^{(i)}_n - x^{(i)}_m&#124; &#60; \varepsilon$ for all $latex N \leq n,m \leq N+F(N)$ and all sufficiently large i, but this contradicts the construction of the $latex x^{(i)}_n$. But now we see that this infinite bounded monotone sequence $latex (x_n)_{n=1}^\infty$ contradicts the infinite convergence principle. $latex \Box$</p>
<p>One can draw some morals from the above discussion:</p>
<ol>
<li>The finitary version of an infinitary statement can be significantly more verbose and ugly-looking than the infinitary original, and the arrangement of quantifiers becomes crucial.</li>
<li>The "naive" finitisation of an infinitary statement is often not the correct one.</li>
<li>While the finitary version of an infinitary statement is indeed quantitative, the bounds obtained can be quite poor (e.g. tower-exponential or worse).</li>
<li>The deduction of the infinitary statement from the finitary one is quite short, as long as one is willing to work indirectly (arguing by contradiction).</li>
<li>The deduction of the finitary statement from the infinitary one is a bit more complicated, but still straightforward, and relies primarily on compactness.</li>
<li>In particular, the equivalence of the finitary and infinitary formulations requires a non-trivial amount of infinitary mathematics (though in this particular case, we can at least leave the ultrafilters out of it).</li>
</ol>
<p>These morals apply not only to the finite and infinite convergence principle, but to many other pairs of finitary and infinitary statements, for instance <a href="http://en.wikipedia.org/wiki/Szemeredi%27s_theorem">Szemerédi's theorem</a> on one hand and the <i>Furstenberg recurrence theorem</i> on the other, as briefly discussed in <a href="http://terrytao.wordpress.com/2007/04/07/simons-lecture-ii-structure-and-randomness-in-ergodic-theory-and-graph-theory/" class="snap_noshots">my Simons lecture on these topics</a>. (In these contexts, the correspondence between the finitary and infinitary statements is known as the <i>Furstenberg correspondence principle</i>.)</p>
<p align="center">--- Applications ---</p>
<p>So, we've now extracted a quantitative finitary equivalent of the infinitary principle that every bounded monotone sequence converges. But can we actually <i>use</i> this finite convergence principle for some non-trivial finitary application? The answer is a definite <b>yes</b>: the finite convergence principle (implicitly) underlies the famous <a href="http://en.wikipedia.org/wiki/Szemer%C3%A9di_regularity_lemma">Szemerédi regularity lemma</a>, which is a major tool in graph theory, and also underlies some rather less well known regularity lemmas, such as the <a href="http://www.ams.org/mathscinet-getitem?mr=2153903" class="snap_noshots">arithmetic regularity lemma of Green</a>. More generally, this principle seems to often arise in any finitary application in which tower-exponential bounds are inevitably involved.</p>
<p>Before plunging into these applications, let us first establish a Hilbert space version<sup><a href="#_ftn6" name="_ftnref6" title="_ftnref6">[6]</a></sup> of the convergence principle. Given a (closed) subspace X of a Hilbert space H, and a vector $latex v \in H$, let $latex \pi_X v$ be the <a href="http://en.wikipedia.org/wiki/Orthogonal_projection">orthogonal projection</a> from v onto X. If X is finite dimensional, then this projection can be defined in a finitary way, for instance by using the <a href="http://en.wikipedia.org/wiki/Gram%E2%80%93Schmidt_process">Gram-Schmidt orthogonalisation procedure</a> to X. If X is infinite dimensional, then even the existence of the orthogonal projection is not completely trivial, and in fact relies ultimately on the infinite convergence principle. Closely related to the existence of this projection is the following monotone continuity property:</p>
<blockquote><p><b>Hilbert space infinite convergence principle</b>. Let $latex 0 \subset X_1 \subset X_2 \subset \ldots \subset H$ be a nested sequence of subspaces of a Hilbert space H, and let $latex X := \overline{\bigcup_{n=1}^\infty X_n}$ be the monotone closed limit of the $latex X_n$. Then for any vector v, $latex \pi_{X_n} v$ converges strongly in H to $latex \pi_X v$.</p></blockquote>
<p>As with the infinite convergence principle in [0,1], there is a Cauchy sequence version which already captures the bulk of the content:</p>
<blockquote><p><b>Hilbert space infinite convergence principle (again)</b>. Let $latex 0 \subset X_1 \subset X_2 \subset \ldots \subset H$ be a nested sequence of subspaces of a Hilbert space H, and let $latex \varepsilon &#62; 0$. Then for any vector v there exists N such that $latex \&#124; \pi_{X_n} v - \pi_{X_m} v \&#124;_H^2 \leq \varepsilon$ for all $latex n,m \geq N$.</p></blockquote>
<p>One can deduce this principle from the analogous principle in [0,1] by first normalising $latex \&#124;v\&#124;_H = 1$, and then observing from Pythagoras' theorem that $latex \&#124; \pi_{X_n} v \&#124;_H^2$ (which one should view as the <i>energy</i> of $latex X_n$ as measured relative to v) is a bounded monotone sequence from 0 to 1. Applying the infinite convergence principle, followed by Pythagoras' theorem yet again, we obtain the claim. Once one sees this, one immediately concludes that there is also a finitary equivalent:</p>
<blockquote><p><b>Hilbert space finite convergence principle.</b> If $latex \varepsilon &#62; 0$ and $latex F: {\Bbb Z}_+ \to {\Bbb Z}_+$, and $latex 0 \subset X_1 \subset X_2 \subset \ldots X_M \subset H$ is such that M is sufficiently large depending on F and $latex \varepsilon$, then for any vector v with $latex \&#124;v\&#124;_H \leq 1$ there exists $latex 1 \leq N \leq N + F(N) \leq M$ such that $latex \&#124; \pi_{X_n} v - \pi_{X_m} v \&#124;_H^2 \leq \varepsilon$ for all $latex N \leq n,m \leq N+F(N)$.</p></blockquote>
<p>Informally, given a long enough sequence of nested subspaces, and a given bounded vector v, one can find an arbitrarily good region of metastability in the orthogonal projections of v into these subspaces.</p>
<p>From this principle one can then quickly deduce the <a href="http://en.wikipedia.org/wiki/Szemer%C3%A9di_regularity_lemma">Szemerédi regularity lemma</a> as follows. Let G = (V,E) be a graph. One can think of the adjacency matrix $latex 1_E$ of this graph as an element of the (finite-dimensional) Hilbert space $latex L^2(V \times V)$, where the product space $latex V \times V$ is given normalised counting measure (and the discrete sigma-algebra $latex 2^V \times 2^V$). We can construct a nested sequence $latex {\mathcal B}_0 \subset {\mathcal B}_1 \subset {\mathcal B}_2 \subset \ldots$ of $latex \sigma$-algebras in $latex V$ (which one can think of as a sequence of increasingly fine partitions of V), together with the attendant sequence $latex L^2({\mathcal B}_0 \times {\mathcal B}_0) \subset L^2({\mathcal B}_1 \times {\mathcal B}_1) \subset \ldots$ of subspaces (this corresponds to functions on $latex V \times V$ which are constant on any product of pair of cells in the partition), by the following greedy algorithm<sup><a href="#_ftn7" name="_ftnref7" title="_ftnref7">[7]</a></sup>:</p>
<ol>
<li>Initialise $latex {\mathcal B}_0 := \{ \emptyset, V\}$ to be the trivial $latex \sigma$-algebra (i.e. the trivial partition).</li>
<li>Given $latex {\mathcal B}_n$, let $latex f_n := {\mathbb E}( 1_E &#124; {\mathcal B}_n \times {\mathcal B}_n)$ be the orthogonal projection of $latex 1_E$ to the space $latex L^2({\mathcal B}_n \times {\mathcal B}_n)$ (thus the value on any product of cells is just the edge density between that pair of cells), and let $latex g_n := 1_E - f_n$ be the deviation of the graph from its density.</li>
<li>Let $latex A_n, B_n$ be sets in V which maximise the discrepancy $latex \frac{1}{&#124;V&#124;^2} \sum_{a \in A_n} \sum_{b \in B_n} g_n(a,b)$.</li>
<li>Let $latex {\mathcal B}_{n+1}$ be the $latex \sigma$-algebra generated by $latex {\mathcal B}_n$ and $latex A_n, B_n$. Now increment n by n+1 and return to Step 2.</li>
</ol>
<p>Let $latex \varepsilon &#62; 0$ and $latex F: {\Bbb Z}_+ \to {\Bbb Z}_+$ be a function. Applying the Hilbert space finite convergence principle to the above sequence of vector spaces, we obtain some N with some bounded size (depending on $latex \varepsilon$ and F) such that</p>
<p align="center">$latex \&#124; f_n - f_m \&#124;_{L^2(V \times V)} \leq \varepsilon^2$ (*)</p>
<p>for all $latex N \leq n, m \leq N+F(N)$. By a further application of the pigeonhole principle (for Hilbert spaces), one can find $latex N \leq n \leq N+F(N)$ such that</p>
<p align="center">$latex \&#124; f_{n+1} - f_n\&#124;_{L^2(V \times V)} \leq \varepsilon^2/F(N)$.</p>
<p>What this basically means is that the partition $latex {\mathcal B}_n$ is very regular, in that even the greediest way to refine this partition does not significantly capture any more of the fluctuations of the graph G. By choosing F to be a suitable exponentially growing function, one can make the regularity of this partition exceed the number of cells (which is basically $latex 2^{2N}$) in the partition $latex {\mathcal B}_N$, which is "within epsilon" of the partition $latex {\mathcal B}_n$ in the sense of (*). Putting all this together, one can get a strong version of the Szemerédi regularity lemma, which implies the usual formulation by a simple argument; see <a href="http://www.ams.org/mathscinet-getitem?mr=2212136" class="snap_noshots">my paper on this topic</a> for further discussion. The choice of F being exponential is what results in the notorious tower-exponential bounds in this regularity lemma (which are necessary, thanks to a <a href="http://www.ams.org/mathscinet-getitem?mr=1445389" class="snap_noshots">result of Gowers</a>). But one can reduce F to, say, a polynomial, resulting in more civilised bounds but with a weaker regularity conclusion. Such a "weak regularity lemma" was for instance established <a href="http://www.ams.org/mathscinet-getitem?mr=1723039" class="snap_noshots">by Frieze and Kannan</a>, and also underlies the "generalised Koopman von Neumann theorem" which is a key component of <a href="http://front.math.ucdavis.edu/math.NT/0404188" class="snap_noshots">my result with Ben Green</a> establishing long arithmetic progressions in the primes. In the opposite direction, various flavours of "strong regularity lemma" have appeared in the literature, for instance in <a href="http://www.ams.org/mathscinet-getitem?mr=2181610" class="snap_noshots">this paper of Alon and Shapira</a>, and also turn out to be convenient ways to formulate hypergraph versions of the regularity lemma of adequate strength to imply non-trivial theorems (such as Szemerédi's theorem).</p>
<p>Rather than using sets which maximise discrepancy, one can also use sublevel sets of the eigenvectors of the adjacency matrix corresponding to the largest eigenvalues of the matrix to generate the partition; see this <a href="http://www.ams.org/mathscinet-getitem?mr=1674741" class="snap_noshots">paper of Frieze and Kannan</a> for details of a closely related construction.</p>
<p>The appearance of spectral theory (eigenvalues and eigenvectors) into this topic brings one in contact with Fourier analysis, especially if one considers <a href="http://en.wikipedia.org/wiki/Circulant_matrix">circulant matrices</a> (which correspond in graph-theoretic terms to <a href="http://en.wikipedia.org/wiki/Cayley_graph">Cayley graphs</a> on a cyclic group). This leads us towards the <a href="http://www.ams.org/mathscinet-getitem?mr=2153903" class="snap_noshots">arithmetic regularity lemma of Green</a>, which regularises a bounded function f on a finite abelian group G in terms of a partition generated by the sublevel sets (Bohr sets) of a bounded number of characters; the precise formulation is a bit lengthy to state properly, although it simplifies substantially in the "finite field model" case when G is a vector space over a small finite field (e.g. $latex {\Bbb F}_2$). This arithmetic regularity lemma can also be established using the finite convergence principle (in either the numerical form or the Hilbert space form). Indeed, if we let $latex H = L^2(G)$ and let $latex V_n$ be the vector space generated by the characters associated to the n largest Fourier coefficients of f, then by applying the finite convergence principle (with v = f) we can locate a metastable region, where there is not much going on (in an $latex L^2$ sense) between $latex V_N$ and $latex V_{N+F(N)}$ for some (exponentially growing) function F, thus there is a "spectral gap" of sorts between the N largest Fourier coefficients and the coefficients ranked N+F(N) and beyond. The sublevel sets of characters associated to the N largest coefficients can then be used to regularise the original function f. Similar ideas also appear in <a href="http://www.ams.org/mathscinet-getitem?mr=853455" class="snap_noshots">an old paper of Bourgain</a>, and in a paper of <a href="http://www.arxiv.org/pdf/math.CA/0601565" class="snap_noshots">Green and Konyagin</a>.</p>
<p align="center">--- A non-finitisable statement ---</p>
<p>One may think from the above discussion that every infinitary statement concerning, say, the natural numbers, has a finitary analogue which is equivalent to it. It turns out that this is not quite the case (and in fact some subtle issues in logic come in), even for very "natural" and "non-pathological" infinitary statements. In particular, the following innocuous-looking infinitary statement is basically non-finitisable:</p>
<blockquote>
<p align="left"><b>Infinite pigeonhole principle. </b>If the natural numbers are divided into finitely many colour classes, then one of the colour classes is infinite.</p>
</blockquote>
<p align="left">This principle is of course a triviality in the infinitary world, and there are some obvious candidates for finitary versions (e.g. the finite pigeonhole principle), but they are not equivalent to the infinitary principle the same way that the finite convergence principle is equivalent to the infinite convergence principle; there is a failure of compactness here. There is a finitary version (of sorts), but it is somewhat difficult to state. Define a <i>set function </i>to be any function f which takes a finite set A of natural numbers as input, and returns a natural number F(A) as output. Let us say that a set function F is <i>asymptotically stable</i><sup><a href="#_ftn8" name="_ftnref8" title="_ftnref8">[8]</a></sup> if, given any nested sequence $latex A_1 \subset A_2 \subset A_3 \subset \ldots$ of finite sets, the numbers $latex F(A_n)$ are constant for sufficiently large n. For instance, the "least element" set function $latex f(A) = \inf( A )$ (adopting the non-standard convention that the infimum of the empty set is 0) is asymptotically stable, but the "cardinality" set function $latex f(A) = &#124;A&#124;$ is not. Anyway, the correct "finitary" version of the infinite pigeonhole principle is</p>
<blockquote>
<p align="left"><b>"Finitary" infinite pigeonhole principle. </b>Let F be an asymptotically stable set function, and let k be a positive integer. Then there exists a positive integer N with the property that whenever the set {1,...,N} is divided into k colour classes, at least one of the colour classes A has the property that &#124;A&#124; &#62; F(A).</p>
</blockquote>
<p align="left">It is a good exercise to see that this principle is equivalent to the infinite pigeonhole principle. Note that the plain old pigeonhole principle essentially corresponds to the case when F is a constant function - and is thus definitely a very special case. The case when $latex F(A) = f(\inf( A ))$ for some fixed function f is already very interesting - it is the 1-uniform case of a "strong Ramsey theorem"and is barely provable by finitary means (try it, say for k=10 and $latex F(A) := \inf(A) + 10$. What quantitative bound for N do you get?), although the general case of that theorem is not (but does follow easily from the above principle), thanks to the <a href="http://en.wikipedia.org/wiki/Paris-Harrington_theorem">Paris-Harrington theorem</a>. The assumption of asymptotic stability of F is necessary, as one can see by considering the counterexample F(A) := &#124;A&#124;.</p>
<p align="left">I am enclosing "finitary" in quotes because while most of the assertion of this principle is finitary, one part still is not, which is the notion of "asymptotically stable". This is a notion which cannot be precisely formulated in a purely finitary manner, even though the notion of a set function is basically a finitary concept (ignoring for now a subtle issue about what "function" means). If one insists on working in a finitary setting, then one can recast the infinite pigeonhole principle as a <a href="http://en.wikipedia.org/wiki/Axiom_schema">schema</a> of finitary principles, one for each asymptotically stable set function F, but in order to work out exactly which set functions are asymptotically stable or not requires infinitary mathematics. (And for some (constructible, well-defined) set functions, the asymptotic stability is <i>undecidable</i>; this fact is closely related to the undecidability of the <a href="http://en.wikipedia.org/wiki/Halting_problem">halting problem</a> and is left as an exercise to the reader.)</p>
<p align="left">The topic of exactly which statements in infinitary mathematics are "truly infinitary" is a fascinating one, and is basically a question in <a href="http://en.wikipedia.org/wiki/Reverse_mathematics">reverse mathematics</a>, but we will not be able to discuss it here.</p>
<p align="left">(I am indebted to <a href="http://en.wikipedia.org/wiki/Harvey_Friedman">Harvey Friedman</a> for discussions on the Paris-Harrington theorem and the infinite pigeonhole principle.)</p>
<p align="center">--- Footnotes ---</p>
<p>(N.B. Footnotes were created following the suggestions from <a href="http://santoki.wordpress.com/2007/02/12/footnotes-the-long-way/" class="snap_noshots">this post</a> by <a href="http://santoki.wordpress.com/" class="snap_noshots">santoki</a>.)</p>
<p><a href="#_ftnref1" name="_ftn1" title="_ftn1">[1]</a> One can subdivide hard analysis into further subcategories by inspecting what kind of inequalities are used. There is "exact hard analysis" where one really uses $latex \leq$; "quasi-exact hard analysis" in which one is willing to lose absolute constants (and so one sees notation such as O(), $latex \lesssim$, or $latex \ll$); "logarithmically coarse hard analysis" in which one is willing to lose quantities such as $latex \log^{O(1)} N$ which are "logarithmic" in some key parameter N; and "polynomially coarse hard analysis" in which one is willing to lose quantities such as $latex N^{O(1)}$ which are polynomial in key parameters. Finally, there is "<a href="http://en.wikipedia.org/wiki/Coarse_structure">coarse analysis</a>" in which one is willing to lose arbitrary functions of key parameters. The relationships between these flavours of hard analysis are interesting, but will have to wait to be discussed elsewhere.</p>
<p><a href="#_ftnref2" name="_ftn2" title="_ftn2">[2]</a> One can use these axioms to make finer distinctions, for instance "strongly finitary" analysis, in which one is not even willing to use real numbers, but instead only works with finite complexity numbers (e.g. rationals), and "strongly infinitary" analysis, in which one freely uses the axiom of choice (or related concepts such as <a href="http://en.wikipedia.org/wiki/Ultrafilter">ultrafilters</a>). There are also hybrids between finitary and infinitary analysis, such as "pre-infinitary" analysis, in which one takes sequences of increasingly large or complex objects, and uses phrases such as "passing to a subsequence if necessary" a lot, but does not actually "jump to the limit"; we also have "pseudo-finitary" analysis, of which <a href="http://en.wikipedia.org/wiki/Non-standard_analysis">non-standard analysis</a> is the most prominent example, in which infinitary methods are re-expressed using infinitesimals or other pseudo-finitary objects. Again, these distinctions will have to wait to be discussed elsewhere.</p>
<p><a href="#_ftnref3" name="_ftn3" title="_ftn3">[3]</a> There are rigorous results from <a href="http://en.wikipedia.org/wiki/Proof_theory">proof theory</a>, such as <a href="http://en.wikipedia.org/wiki/Herbrand%27s_theorem_%28proof_theory%29">Herbrand's theorem</a>, which can allow one to automatically convert certain types of qualitative arguments into quantitative ones<strike>, but there appears to be only a small amount of literature applying these general results in logic to actual soft analysis arguments</strike>.  (See the comments to this post for some recent developments in this area.)</p>
<p><a href="#_ftnref4" name="_ftn4" title="_ftn4">[4]</a> For instance, in <a href="http://front.math.ucdavis.edu/math.NT/0404188">my result with Ben Green</a> establishing arbitrarily long arithmetic progressions of primes, the argument was (necessarily) finitary in nature, but it was absolutely essential for us to be aware of the infinitary arguments and intuition that had been developed in ergodic theory, as we had to adapt such arguments to the finitary setting in order to conclude our proof, and it would have far less evident how to discover such arguments if we were always restricted to looking at finitary settings. In general, it seems that infinitary methods are good for "long-range" mathematics, as by ignoring all quantitative issues one can move more rapidly to uncover qualitatively new kinds of results, whereas finitary methods are good for "short-range" mathematics, in which existing "soft" results are refined and understood much better via the process of making them increasingly sharp, precise, and quantitative. I feel therefore that these two methods are complementary, and are both important to deepening our understanding of mathematics as a whole.</p>
<p><a href="#_ftnref5" name="_ftn5" title="_ftn5">[5]</a> This looks like <a href="http://en.wikipedia.org/wiki/Tychonoff%27s_theorem">Tychonoff's theorem</a>, but because we require sequential compactness here rather than topological compactness, the result here is in fact much closer in spirit to the <a href="http://en.wikipedia.org/wiki/Arzela-Ascoli_theorem">Arzelà-Ascoli theorem</a>. In particular, the <a href="http://en.wikipedia.org/wiki/Arzela-Ascoli_theorem">axiom of choice</a> is not actually used here, instead one repeatedly uses the <a href="http://en.wikipedia.org/wiki/Bolzano-Weierstrass_theorem">Bolzano-Weierstrass theorem</a> for the interval [0,1] followed by a <a href="http://en.wikipedia.org/wiki/Diagonalization_argument">diagonalisation argument</a>. The astute reader here will observe that the Bolzano-Weierstrass theorem is essentially equivalent to the infinite convergence principle! Fortunately, there is no circularity here, because we are only using this theorem in order to deduce the finite convergence principle from the infinite, and not the other way around.</p>
<p><a href="#_ftnref6" name="_ftn6" title="_ftn6">[6]</a> One could also view this as a "noncommutative" or "quantum" version of the convergence principle, but this is somewhat of an abuse of terminology, despite the presence of the Hilbert space, since we don't actually have any noncommutativity or any other quantum weirdness going on.</p>
<p><a href="#_ftnref7" name="_ftn7" title="_ftn7">[7]</a> One can also replace this greedy algorithm by a random algorithm, in which each $latex {\mathcal B}_{n+1}$ is obtained from $latex {\mathcal B}_n$ by adding a neighbourhood $latex N(v_n)$ of a randomly chosen vertex $latex v_n$ in V. This use of randomisation appears in the infinitary setting in this <a href="http://front.math.ucdavis.edu/math.CO/0602037">paper of myself</a>, and in the finitary setting in this <a href="http://arxiv.org/pdf/math/0612838">paper of Ishigami</a>.</p>
<p><a href="#_ftnref8" name="_ftn8" title="_ftn8">[8]</a> This notation is <i>very</i> roughly equivalent to the notion of a function F(A) defined for all sets of integers (both finite and infinite) which can always be "computed" in "finite time". But one should take this informal definition with a large grain of salt: while there is indeed an algorithm for computing F(A) for any given set A which will eventually give the right answer, you might not be able to tell when the algorithm has finished! A good example is the asymptotically stable function $latex F(A) := \inf(A)$: you can "compute" this function for any set A by initialising the answer to 0, running a counter n from 0 to infinity, and resetting the answer permanently to n the first time n lies in A. As long as A is non-empty, this algorithm terminates in finite time with the correct answer; if A is empty, the algorithm gives the right answer from the beginning, but you can never be sure of this fact! In contrast, the cardinality &#124;A&#124; of a possibly infinite set A cannot be computed even in this rather unsatisfactory sense of having a running "provisional answer" which is guaranteed to eventually be correct.</p>
<p>[<i>Update</i>, May 24: Typos corrected. A distinction drawn between the strong Ramsey theorem (a theorem that can be stated in finitary mathematics), and the Paris-Harrington theorem (which shows that the strong Ramsey theorem is unprovable by finitary means). Acknowledgment to Harvey Friedman added, as well as a link to reverse mathematics.]</p>
<p>[<i>Update</i>, May 25: functional analysis changed to operator algebras.  Footnote [3] modified in view of recent comments.]</p>
<p>[Update, May 31: $latex \inf(A \cup \{0\}) $ corrected to "$latex \inf(A)$ when A is non-empty, and 0 if A is empty".]</p>
]]></content:encoded>
</item>

</channel>
</rss>
