Infinite_Lottery title corr This document has two parts: The original preprint of “How to Build an Infinite Lottery Machine” And Preprint, co-authored with Alexander R. Pruss, of Correction to John D. Norton “How to Build an Infinite Lottery Machine.” It corrects an error in Section 11.3 of the original preprint. 1 September 13, 14; October 21, 27, December 13, 2016; February 14, 2017. How to Build an Infinite Lottery Machine John D. Norton1 Department of History and Philosophy of Science University of Pittsburgh http://www.pitt.edu/~jdnorton An infinite lottery machine is used as a foil for testing the reach of inductive inference, since inferences concerning it require novel extensions of probability. Its use is defensible if there is some sense in which the lottery is physically possible, even if exotic physics is needed. I argue that exotic physics is needed and describe several proposals that fail and at least one that succeeds well enough. 1. Introduction An infinite lottery machine, as I shall construe it here, chooses from a countable infinity of outcomes n = 1, 2, 3, … without favoring any outcome. The machine is immediately problematic if one applies routine probabilistic notions to it. If P(n) is the probability of outcome n, then the probability that one of the outcomes is realized is given by the infinite sum: P(1) + P(2) + … + P(n) + … = 1 (1) It must equal one, as shown, since one of the outcomes 1, 2, 3, … is sure to happen. However each individual outcome must have the same probability ε = P(1) = P(2) = … = P(n) = … (2) If ε>0, then the infinite sum of probabilities in (1) is infinite, contradicting the axiom of probability theory that the probability of the whole event space is unity. If however we set ε=0, then the same sum yields zero. 1 I thank John Earman, Casper Storm Hansen, Bryan Roberts, David Snoke, Teddy Seidenfeld and Porter Williams for helpful discussion. 2 The infinite lottery is almost always conceived in the philosophical literature as a technical problem for probabilistic inference. On the evidence of the design and set up of the machine, what inductive support does each outcome accrue?2 Because of the problem above, a probability seems unable to measure that support. Most attempts to escape adjust the notion of probability. De Finetti (1972; §5.17) discarded “countable additivity,” the capacity to sum the infinitely many outcomes of (1) to yield a probability. Instead, he employs “finite additivity,” which allows only finitely many terms in (1) to be summed. Then we can set ε = 0 in (2) without immediate problems, since the infinite sum of (1) is no longer admissible as a means of computing a probability. There has been considerable critical weighing of the ramifications of discarding countable additivity. See, for example, Bartha (2004), Blackwell and Diaconis (1996), Kadane, Schervish, and Seidenfeld (1986), Kadane and O’Hagan (1995) and Williamson (1999). A more elaborate proposal, sets ε in (2) to an infinitesimal, employing non-standard analysis. See for example Benci, Horsten, and Wenmackers (2013) and Wenmackers and Horsten, (2013). The cogency of this approach has in turn been debated. See for example, Pruss (2014), Williamson (2007) and Weintraub (2008). My primary goal in this paper is not to address these technical problems in probabilistic inference. Rather it is to address a physical problem: is an infinite lottery machine physically possible? If, as some contend, the very idea of the lottery is internally contradictory, then it is a physical impossibility and we should renounce it as a probe of our notions of probabilistic inference. This simple-sounding question of physical possibility will prove to be treacherous. No one, as far as I know, has found a way to build a real, infinite lottery machine. Instead, time and again, obvious proposals for such machines run into fatal difficulties. Section 2 below reviews a few natural proposals to give a sense of how they fail. The repeated failures suggest some principled obstacle. In Section 3, two arguments that try to establish the impossibility of infinite lottery machines are shown to fail. If nothing rules out an infinite lottery machine in principle, but none are constructible in our ordinary world, then in just what sense could one be said to be physically possible? Here we 2 Here I will restrict myself to objective approaches to inductive inference. Analogous problems arise if we ask what our subjective belief should be in each outcome. 3 run headlong into the awkward problem that we have no precise notion of physical possibility. Rather physical possibility comes in a myriad of varieties. Stricter forms allow only minor departures from actuality. More exotic forms skirt contradictions, such as supertask systems, in which an infinity of distinct actions are completed in finite time. Since there is no single, universally correct notion of physical possibility, I propose in Section 4 that we choose a sense of physical possibility fitting to the infinite lottery’s role as a probe or foil for inductive inference. This automatically authorizes a quite liberal notion of physical possibility. For we expect that inductive reasoning, like deductive reasoning, should be possible not just in worlds exactly like ours, but in any world that (a) can be described without logical contradiction (hence non-trivial deductive inference is possible) and (b) is governed by some regularities (so inductive inference is possible). The sections following map out suitable regularities conservatively. This is to minimize exotic possibilities, for the interest of an infinite lottery as a probe increases the closer its world is to one like ours. Section 5 specifies more clearly just what the lottery’s “choosing without favor” must mean. Section 6 makes conservative recommendations over which components may be employed as primitives. Section 7 urges that the infinity of the lottery requires that we admit supertasks, lest the range of possibility be too narrowly restricted and it advises caution in their use, since imprudent use can easily lead to internal contradictions. The remainder of the paper explores various proposals for infinite lotteries. Section 8 rejects a supertask proposal that employs an infinitely accelerated random walk. Section 9 rejects an ingenious proposal by Hansen (2016) that employs a reversed supertask. The concluding sections 10 and 11 describe my best candidates for an infinite lottery machine. The weaker one is in Section 10. It employs measurement on a quantum system comprised of a superposition of infinitely many basis states. Section 11 extracts an infinite lottery from an infinite system of binary randomizers (“coin tosses”). The last example is notable in that its individual outcomes and even whether it is successful at all are nonmeasurable in the natural background probability measure. Readers impatient to get the best answer I have to the “How To …” of the title should jump to these last sections, although they may find their complexities puzzling without the preparation of the earlier sections. 4 2. Simple Machines that Fail It may seem that all this is much ado about very little. Might we not find a simple way to implement an infinite lottery? The trouble is that simple proposals have a maddening habit of falling apart on inspection. Here are three to give a flavor of the problems. 2.1 The Infinitely Large Urn A traditional lottery machine consists of a finite set of identically constituted, numbered balls in an urn. The balls are mixed by tumbling the urn or agitating the contents with stirrers or air jets. One or more balls are then drawn from a recovery point to yield the random outcome. An infinite lottery machine is constituted in the same way, but with infinitely many balls. We have no assurance that it will work. There are two ways to see the problems. First, informally, the dynamics of mixing is qualitatively different in a finite and an infinite urn. No matter how large the finite urn, if it is tumbled through many revolutions, we have a reasonable expectation that every ball has passed back and forth many times and is thus as likely as any other to be in the vicinity of the recovery point. Infinitely many balls of equal size require an urn of infinite volume. If either tumbled or agitated, how do we know that the result is not merely some local rearrangement of the balls? Thorough mixing will require unlimited speeds so that even the most distant balls can cover arbitrarily large distances in a bounded time to reach the recovery point. How do we realize these speeds throughout the urn? How do we realize them so that each ball has an equal chance of selection? Second, there are technical results that fail in the infinite case. In statistical physics, a finite system of hard spheres colliding in a box is a strong candidate for an “ergodic system,” that is, loosely speaking, one that over time visits all possible configurations, near enough. That makes it an excellent randomizing device. A lottery machine consisting of finitely many balls in an urn is close enough to it for us to expect comparable behavior. Once we move to an infinite system, however, the technical apparatus breaks down. We no longer have ergodic expectations. Indeed we can no longer even expect deterministic time development. This last breakdown may seem good. However it is not. For the best we can say is that we just do not know how the infinite system will behave. We need more. We need positive affirmation that the infinite system will behave well as a randomizer. We do not have it. 5 2.2 The Jumping Flea The trouble with the infinite urn is that we are unsure of its dynamics. This problem is avoided in another approach that starts with a finite scheme whose dynamics are more clearly defined and takes a limit to infinity. We can choose without favor among N numbers, 1, 2, 3, …, N as follows. A flea jumps in one direction only along a long tape with cells marked 1, 2, 3, …, N, starting at cell 1. See Figure 1. Figure 1. The Jumping Flea The flea jumps to the next cell only if a randomizer instructs “jump” as opposed to “halt.” The probability P(n,n+1) that the randomizer instructs a jump from n to n+1 is: for 00, 6 That is, the flea vacates each cell with unit probability, so that, if the limit is taken, the flea has come to rest in no cell.3 2.3 A Spin of a Pointer on Dial The last proposal fails since the probability distribution (3) does not behave well in the infinite case. We may try to remedy this deficiency by picking a stochastic system whose probability density is already well-adapted to the infinite case. A uniform probability density over infinitely many outcomes is realized by spinning a pointer over a circular dial with an angular scale etched around its circumference. The pointer is perfectly machined so that no angular orientation is favored and the result is read as an angular position in 0o and 360o inclusive, as indicated by the pointer, when it comes to rest. See Figure 2. Figure 2. A Spinning Pointer and Dial 3 If we accelerate the jumping in a supertask, so infinitely many jumps are completed in finite time, the flea will have jumped “off to infinity,” behaving like what is called a “space evader” in Manchak and Roberts (2016, §1.4). 7 So far, the device chooses randomly among too many outcomes: all of the continuum many, real angles in 0o and 360o. The infinite lottery must choose among only countably many outcomes, since they can be mapped one-one to the naturals 1, 2, 3, …. We adapt the device to this case by selecting a uniformly distributed subset of countably infinite angles: the rational angles.4 We spin the pointer until eventually a rational angle is chosen. It is our outcome. There are two problems. First, the outcome of a rational angle is a measure zero event in the uniform measure. That means that it occurs with probability zero; and the probability of success remains zero among any finite number of repetitions. For all practical purposes, it will not happen, no matter how often we spin. Second, we have no way to read the result. The rational angles are densely distributed among the real angles. That means that infinitely many other rational candidates occupy any neighborhood around the outcome, no matter how small the neighborhood. No finite magnification will be sufficient to discriminate these nearby rational angles from the correct outcome. We might attempt a rescue by choosing a discretely spaced set of countable angles, such as {359o, 358o, 357o, … , 2o, 1o, 1/2o, 1/3o, 1/4o, …}. While any outcome in this set can be discriminated from its neighbors with a magnifier of finite magnification, we have lost the assurance that the outcomes are chosen without favor. They are no longer uniformly distributed over the reals. 3. Against the Possibility of an Infinite Lottery Machine The repeated failure of obvious proposals for infinite lottery machines suggests that the problem may be deeper than a mere failure of our powers of imagination. There may be principled reasons behind the failures. Here are two arguments that purport to establish the impossibility of an infinite lottery machine. I explain why both fail. 4 They are uniform in the sense that any two angular segments of the same size host rational angles that can be mapped one-one onto each other; and the entire set of rational angles is mapped back to itself if we shift all the angles by adding any fixed rational angle to them. 8 3.1 The Argument from Probability Spielman (1977, pp. 254-55) admits that he has “not the slightest idea” for how such a random selection could be made. He illustrates the difficulty with the failure of a simple probabilistic scheme that selects a number by translating sequential coin tosses into a binary number. That is, the outcome of sequential coin tosses heads-heads-tails-heads… is recoded as the binary number 1101… For a natural number to be selected, the tosses must terminate finitely. The mechanism that assures this is another sequence of coin tosses, each of which comes after each toss of the original sequence. We terminate when this second series of tosses shows heads for the first time. The probability that the binary number chosen has n or fewer binary digits is just 1 – 1/2n. Hence with probability 1 – 1/2n, the number chosen is less than or equal to 2n. Thus we are near certain that the number will be less than, say, 2100, even though there are infinitely many more possible numbers. This scheme favors smaller numbers. Experimenting with other examples like this, one finds quickly that any ordinary probabilistic mechanism ends up probabilistically privileging smaller numbers; and that may seem inevitable when we recall that a probability distribution must normalize to unity. It looks like we are trying to use physics to overturn a mathematical fact, rather like the doomed attempts to square the circle and trisect the angle. The thought can be captured in the following argument: Argument from Probability No probability distribution can jointly satisfy the normalization condition (1) and the equality of probabilities (2). (mathematical fact) Therefore, no physical machine can realize the distribution. (physical fact) The weakness of the argument from probability is the first mathematical fact. It is true as long as we consider real-valued, countably additive probability measures. The summation in the normalization condition (1) is undefined as a probability for either of a real, merely finitely additive probability distribution or one with non-standard infinitesimal values. If the probability distribution is one of these, the still true mathematical fact of the argument fails to preclude successful operation of the infinite lottery machine. That is, the inference is a fallacy. 9 3.2 Argument from Two Infinite Lottery Machines5 Consider two lotteries. For any outcome on the first, there are only finitely many smaller numbered outcomes on the second, but infinitely many larger numbered outcomes. Therefore the outcome of the second has, with overwhelming probability, the greater number. The same inference, starting with the second lottery machine, concludes that, with overwhelming probability, the outcome on the first has the greater number. Both cannot be true. Therefore an infinite lottery machine is impossible. The fallacy of this argument lies in set theory, prior to consideration of probabilities. Consider all pairs of natural numbers . For any particular value of m, say M, there are infinitely many n>M but only finitely many n, there are infinitely many pairs with n>m and only finitely many with n” requires us to form the union of the sets {n:n0. This contradicts the presumption that a ball only disappears from the urn if a god removes it. With these default rules, the supertask now fails in the second way indicated in Section 7.3. The presumed behavior is inconsistent with the dynamics governing the system. Another way to see the inconsistency is to imagine three versions of the supertask that differ only in the default rule used: “all,” “even” and “odd,” respectively, where “odd” has the 23 natural definition in analogy to “even.” In each version, the default rule is never activated. Therefore the default rule can make no difference to the outcome. Yet it does if the eliminative reasoning is respected. In the case of “all,” the outcome can be any ball number. In the case of “even,” it can only be an even numbered ball. In case of “odd,” it can only be an odd numbered ball. There is a final problem even if the supertask proceeds as intended. It arises if we assume countable additivity for the probability measures used in the ball selection. (In that case, we can consistently assign no probability to the selection of any individual ball.) The probability that one or more of the lowest numbered balls is removed by the god-collectiveN = {godN, godN+1, … , god2N-1} is one half.15 The urn passed to godN-1 has already been acted on by infinitely many god-collectives: god-collectiveN, god-collective2N, god-collective3N, … With a probability of one half, each god-collective removed one or more of the lowest numbered balls. Hence with probability one, computed by a limiting process authorized by countable additivity, infinitely many of the lowest numbered balls have been removed by them; and the urn is empty by the time godN-1 receives it; and it is so for any N. Since successful operation of the lottery requires that the urn never be empty, we have, with probability one, that the lottery fails to operate successfully. 10. A Quantum Mechanical Infinite Lottery Machine Quantum mechanics holds great promise for proposals of infinite lottery machines. Its standard interpretation is the one place in which practical, modern science has irreducibly random processes. It turns out, however, that quantum infinite lottery machines of simple design face difficulties similar to those we have seen. Quantum randomness derives from one process: quantum measurement. We measure a system that is in a superposition of states and in the process, originally known as “collapse of the wave packet,” it randomly adopts one of the states in the superposition. The collapse can be 15 The probability that none of the gods in the collective remove the lowest numbered ball from its urn is . 24 effected in any of the degrees of freedom of a quantum system. Real life quantum random number generators most commonly use photons and exploit collapse in particle spin degrees of freedom, in particle arrival times, in particle positions and in particle numbers. For a recent survey, see Ma et al. (2016). Most of these schemes do not extend especially well to the infinite case. Spin observables adopt finitely many values only. Arrival time and position are each continuous magnitudes, whose outcome must somehow be reduced to a countable infinity. They face the same problems as the spinning pointer machine of Section 2.3. More promising are systems with infinitely many discrete states. One is the superposition of one-photon, two-photon, three-photon, … states |1>, |2>, |3>, … in a coherent superposition, such as is produced by a laser: (6) where α is a complex field amplitude and µ = |α|2 is the mean photon number. When the particle number n is measured, the coherent state collapses to one of the n-photon states |n>, where n is Poisson distributed. Such randomizers are practical and have been built, as described in Applegate et al. (2015). To adapt this randomizer to an infinite lottery, we replace the normalized superposition (6) by an unnormalizable16 superposition |1> + |2> + …. + |n> + …. (7) of n-particle states, |n>. On measurement of the particle number, this state collapses to one of the infinitely many states |n>, without favor to any. In this regard, it conforms with label independence. There are two problems with this scheme.17 First is technical: because the state of (7) is unnormalizable, it is a vector in a vector space, but not in a Hilbert space, which is the structure 16 For a concern that quantum theory likely precludes such unnormalizable states, see Earman (manuscript). 17 This scheme can also be implemented (but with the same two problems) as an infinite superposition of the discrete energy eigenstates of a particle in an infinitely deep potential well; or a harmonic oscillator. 25 upon which ordinary quantum theory is built. The application of the Born rule for the probability of different outcomes is problematic in precisely the same way as is application of probabilities to the infinite lottery. These problems can be solved by liberalizing the mathematical formalism. For the physical ideas behind quantum theory assure us that state (7) will collapse on measurement to one of its components. The second problem cannot be solved by such liberalization. The energy of the n particle state increases linearly with n. Hence the state (7) is associated with an unbounded energy in the following sense: for any finite energy E, there will be only finitely many states in (7) with energy less than E, but infinitely many with energy greater than E. Unbounded energy makes creation of the state dubious and, if constructed, will destroy any localized measurement device. We can attempt a solution by exploiting systems with infinitely many discrete states, whose energies are bounded from above and below. A simple case is the infinitely many s electron orbitals adopted by an excited electron in a stationary state in a hydrogen atom. The energies En of the ns orbitals for n = 1, 2, 3, … coincide with those of the original Bohr theory of the atom of 1913 (following Sommerfeld, 1923, p. 214): (8) for an electron of charge e, mass m and Planck’s constant h. The energies vary from the ground state energy -(2π2me4)/h2 to approaching arbitrarily close to energy 0 as n grows towards infinity. For the infinite lottery machine, we prepare an equally weighted superposition of the ns orbitals for all n, so that each orbital ns corresponds to |n> in (7). Upon measurement the superposition collapses to one of the states |n> as before. There are, once again, problems. First, we still have the technical problem of an unnormalized state (7). Second, for large n, the energy of the orbital is very close to 0, which is the energy at which an electron is liberated from the atom. Hence, the slightest increase in energy from, for example, a thermal fluctuation, will eject the electron from the atom. Infinitely many states in the superposition are within any arbitrarily small energy of this zero energy level. Hence the superposition is very unstable. Third, because the energy of infinitely many of the states are clustered so closely together, measuring the energy of the collapsed state will once again require the troublesome capacity to make indefinitely fine discriminations. 26 The most direct measurement scheme is to allow the excited electron to drop back to the lowest energy ground state n=1 and measure the frequencies of the resulting emitted photon or photons. That measurement will collapse the superposition to one of the infinitely many orbitals undergoing decay. The energy of that orbital is determined by the measured frequencies of the emitted photon or photons. For the simplest case of a single jump to the n=1 ground state, a single photon is emitted with the frequency ν: For the infinitely many large n states, the frequency ν will be almost exactly (2π2me4)/h3 and arbitrarily fine discrimination in measurement of ν will be needed to separate neighboring values of n, as n grows large. Finally, the size of the electron orbitals grows arbitrarily large. Their Bohr atom radii grow with n as (Sommerfeld, 1923, p. 212):18 So the superposition requires that the universe be empty of other atoms, for otherwise Coulomb repulsion from these other atoms’ electrons would confine the original atom’s electron and alter the energy relation (8), so that the electron energy is no longer bounded from above. 11. The Infinite Array An infinite lottery machine requires exotic processes. The best will employ the minimum of them; and this section will describe my best proposal for such a minimum. In the proposal, the processes of selection of the number are separated from those that communicate the number to us. The processes of selection require no supertasks and, with just a little indulgence, can be mapped onto real systems in our universe. 18 A more modern calculation (Johnson, 2007, p. 34), using the Schrödinger wave for the electron orbitals, affirms that the position expectation for the nth s state grows with n2. 27 11.1 The Selection of the Number The system consists of an infinite, two-dimensional array of independent, two-state randomizers. For simplicity, imagine each randomizer as a coin toss with equal probabilities of heads H and tails T: P(H) = P(T) = 1/2. (9) The coins are all flipped at the same moment. Our interest is a countable subset of the row configurations that encode the natural numbers 1, 2, 3, … A simple coding system encodes the number n as a row whose first n terms are H and all remaining terms are T. The number 5 is encoded uniquely as HHHHHTTTTTTTTTTTTT… The outcome of the infinite lottery machine is the natural number encoded in the array in the first row that successfully encodes a natural number, if there is one, as shown in the marked row in Figure 6. Figure 6. An Infinite Array of Binary Randomizers 11.2 Label Independence Label invariance is easily implemented in the lottery machine for each particular configuration of coin toss outcomes. Since each coin can be relabeled as H or T independently of 28 the others, there will always be a relabeling that maps any particular configuration to any other. Thus each has an equal chance. That is not quite enough to assure that the infinite lottery machine favors no particular number. For each number outcome corresponds to an infinite set of the configurations. Ideally we should like to show that a single relabeling of the randomizers can convert the entire set that returns outcome number N to the entire set that returns outcome number N'. A little reflection shows that no single relabeling can do this. There is something enough close to it. Partition the set of configurations that return a definite number according to the row in which the first encoded number appears. Consider one member of the partition: those for which the first encoded number outcome appears in row M. That partition is further divided according to whether the number outcome is 1, 2, ... The set of outcomes in this partition that returns number N can be mapped onto the set that returns number N': all that is needed is a relabeling of the randomizers in row M alone so that the row now encodes N'. All the other rows are left unchanged. Hence, given that the first encoded number appears in row M, label invariance assures us that any of 1, 2, ... are chosen without favor. This lack of favor is unaffected by which row number M happens to be the first row that encodes a definite number. Hence the lottery as a whole is choosing its outcome without favoring any particular number. 11.3 Nonmeasurable Outcomes The probability measure induced by (9) assigns a probability measure to many of the outcomes. For example, according to it, there is a probability 1 of there being infinitely many H in the array overall. However, outcomes of interest to us are almost all of the character of infinitely many trials of a zero probability process. They turn out to be nonmeasurable in this induced measure. This lottery machine may fail to operate successfully if there is no row among the infinitely many that encodes a finite number. To demonstrate that success is a nonmeasurable outcome, we calculate the probability of success in two ways; and get two different answers. First, we calculate by rows and find that success has zero probability. The probability that the first row has any particular combination of H and T is just 1/2∞ = 0. Since there are only a countable infinity of finite numbers, the probability that the first row encodes any finite number 29 at all is a countably infinite sum of these zeros, that is, zero. Hence, with probability one, the first row does not encode a finite number. The same follows for any nominated row. Since the row outcomes are probabilistically independent, the probability that no row encodes a finite number is just the product of these infinitely many ones, which is one. That is, there is probability zero of a row encoding a finite number in the array. Second, we calculate by columns and find the probability of success is one. Consider the outcome HTTTTTT…. which encodes the number 1; and the truncated rows that appear in the first N columns. If this outcome is present in the array, then its first N terms will appear in at least one of the truncated rows. The probability that some nominated row does not contain the truncated number is (1-1/2N)<1. Since the row contents are probabilistically independent, the probability that the truncated number is in none of the infinitely many truncated rows is (1-1/2N)∞ = 0. That is, the truncated number appears with probability one in at least one of the truncated rows of the first N columns. Now take the limit as N goes to infinity. This unit probability is preserved in the limit. With probability one, the row encoding 1 appears at least once in the array. The lottery machine is successful if any row encodes any finite number. Therefore the probability of success is greater than the probability of any row encoding just the number 1. Therefore the probability of success is one. By analogous reasoning, we find other nonmeasurable outcomes. The presence of a row encoding any particular finite number in the array is nonmeasurable. To see this, consider the rectangle of the first M rows and N columns, where M = 2NK and the constant K fixes the side ratio of the rectangle. The probability that the first N terms of a row that encodes some nominated finite number is not a truncated row in the rectangle is (1-1/2N)M = (1-1/2N)2NK. The limit of this probability as the size of the rectangle grows infinitely large is e-K. Hence the probability of at least one row encoding the nominated finite number is (1- e-K). Since 0