In a previous post, I wrote that complexity and specification never spontaneously arise unassisted from the chaos of a random non-intelligent source. You’d think that would be a reasonable statement that anyone could agree with, but from time to time I encounter people who seem to believe not only that chance is a causal agent rather than merely an attribute of a given scenario (e.g., “The Universe happened by chance”) but also that chance combined with time results in anything we can assume from the standpoint of naturalist philosophy. Consequently, what seems reasonable ends up being dismissed as superstition, and the justification given is often some variation of the Infinite Monkey Theorem, which usually goes unchallenged in casual conversation because most people (myself included) aren’t fluent enough in probability theory to object.
About five years ago, I was browsing through an introduction to applied probability and found the following example in a chapter on repeated independent trials (Roberts, Richard A. “Independence and Repeated Trials.” An Introduction to Applied Probability, Addison-Wesley, 1992, pp. 88–89). While still not succinct enough for a casual conversation, it’s probably concise enough for someone with good math-ninja skills (like John Lennox) to present in a formal debate. It’s very compelling.
Example 3.5.2
Suppose a billion monkeys type on word processors at a rate of 10 symbols per second. Assume that the word processors produce 27 symbols, namely, 26 letters of the English alphabet and a space. These monkeys type for 10 billion years. What is the probability that they can type the first sentence of Lincoln’s “Gettysburg Address”?
Fourscore and seven years ago our fathers brought forth on this continent a new nation conceived in liberty and dedicated to the proposition that all men are created equal.
Solution: If we counted correctly, there are 168 symbols in this sentence. Let N be the number of symbols typed per monkey in  years. A sample space of possible letter sequences typed by each monkey is
 years. A sample space of possible letter sequences typed by each monkey is
 (N times)
   (N times)
where X is the set of 27 symbols. Now define s as a particular sequence that is m symbols long:

Let  be the event that s appears at the
 be the event that s appears at the  index in a sequence of length N:
 index in a sequence of length N:
 = {
 = { (
     ( occurs on index k)}
 occurs on index k)}
In other words,  is the event that a particular sequence, s, which is m symbols long, appears in one monkey’s sequence and ends on index k. The event of interest is A, given by
 is the event that a particular sequence, s, which is m symbols long, appears in one monkey’s sequence and ends on index k. The event of interest is A, given by

In our case, m = 168 and N is the number of symbols typed by one monkey in  years. Thus by Boole’s Inequality,
 years. Thus by Boole’s Inequality,
![P delim{[}{A}{]} = P {[}{bigcup{k=m}{N}{A_{k}}}{]} <= {sum{k=m}{N}{P delim{[}{A_k}{]}}  }. P delim{[}{A}{]} = P {[}{bigcup{k=m}{N}{A_{k}}}{]} <= {sum{k=m}{N}{P delim{[}{A_k}{]}}  }.](http://www.tinymoe.com/bigbookomaths/wp-content/plugins/wpmathpub/phpmathpublisher/img/math_972.5_c63faacedeb368601cea21f2f04cea7a.png)
Now the probability of the event  is the number of outcomes of event
 is the number of outcomes of event  divided by the total number of outcomes. The total number of outcomes is
 divided by the total number of outcomes. The total number of outcomes is  . The number of outcomes that produce event
. The number of outcomes that produce event  is
 is  . The exponent is
. The exponent is  because there are m positions specified in
 because there are m positions specified in  . The remaining
. The remaining  are arbitrary. Thus
 are arbitrary. Thus
![P delim{[}{A_k}{]} = {27^{N-m}}/{27^N} = 1/{27^m}. P delim{[}{A_k}{]} = {27^{N-m}}/{27^N} = 1/{27^m}.](http://www.tinymoe.com/bigbookomaths/wp-content/plugins/wpmathpub/phpmathpublisher/img/math_978_d89158a86bbb4131aa07e70669e6a4cf.png)
And so
![P delim{[}{A}{]} <= {sum{k=m}{N}{(1/{27^m})}} = {N - m + 1}/{27^m} approx N/{27^m} P delim{[}{A}{]} <= {sum{k=m}{N}{(1/{27^m})}} = {N - m + 1}/{27^m} approx N/{27^m}](http://www.tinymoe.com/bigbookomaths/wp-content/plugins/wpmathpub/phpmathpublisher/img/math_961_22373e57c412844da807e684936423b4.png)
since N >> m. Thus ![P delim{[}{A}{]} P delim{[}{A}{]}](http://www.tinymoe.com/bigbookomaths/wp-content/plugins/wpmathpub/phpmathpublisher/img/math_991.5_adab2241900c4aac5f4b32481f14964e.png) is bounded above by
 is bounded above by  ,  where N is the number of trials produced by a single monkey. This is the probability bound for
,  where N is the number of trials produced by a single monkey. This is the probability bound for ![P delim{[}{A}{]} P delim{[}{A}{]}](http://www.tinymoe.com/bigbookomaths/wp-content/plugins/wpmathpub/phpmathpublisher/img/math_991.5_adab2241900c4aac5f4b32481f14964e.png) for one monkey. For
 for one monkey. For  monkeys, we can think of forming one long sequence by concatenating
 monkeys, we can think of forming one long sequence by concatenating  sequences so that N becomes the number of trials produced by
 sequences so that N becomes the number of trials produced by  monkeys. Thus
 monkeys. Thus ![P delim{[}{A}{]} P delim{[}{A}{]}](http://www.tinymoe.com/bigbookomaths/wp-content/plugins/wpmathpub/phpmathpublisher/img/math_991.5_adab2241900c4aac5f4b32481f14964e.png) is bounded above by
 is bounded above by
![P delim{[}{A}{]} P delim{[}{A}{]}](http://www.tinymoe.com/bigbookomaths/wp-content/plugins/wpmathpub/phpmathpublisher/img/math_991.5_adab2241900c4aac5f4b32481f14964e.png) 
 
The probability of producing a specified sequence of English text 168 symbols long is bounded above by  . No reasonable model for the source of the “Gettysburg Address” would propose a team of monkeys as author.
. No reasonable model for the source of the “Gettysburg Address” would propose a team of monkeys as author.
Actually, there are 171 characters (minus the period), and the string should be written all in upper or lower case, but the point of the exercise is still valid. Roberts calculates the probability of the string occurring anywhere (from index position 168 to the last typed character) over the total number of characters a single hypothetical monkey could type in 10 billion years (N characters). He then uses Boole’s Inequality to establish an upper bound for that probability (i.e., the probability is no higher than the right side of that inequality) and then scales the scenario up by  (1 billion monkeys). The denominator in the last approximation is a power of ten to make it more readable, and
 (1 billion monkeys). The denominator in the last approximation is a power of ten to make it more readable, and  (2.95e+240) is for all intents and purposes close enough to
 (2.95e+240) is for all intents and purposes close enough to  (1.0e+240). (Anyone who’s seriously thought about the origin of information has likely already taken note that scenarios like the Infinite Monkey Theorem assume an infinite event space encompassing all probabilities/possibilities.)
 (1.0e+240). (Anyone who’s seriously thought about the origin of information has likely already taken note that scenarios like the Infinite Monkey Theorem assume an infinite event space encompassing all probabilities/possibilities.)
Roberts then adds, “A similar example of repeated independent trials used as a mechanism for obtaining order from disorder is chemical evolution. Because of its philosophical implications, however, the example is more controversial.”
Controversial! Well, I guess I’ll have to reproduce that example, too ….
