The Book of Books

Introduction

The Library of Babel as a self-contained backup memory

THE LIBRARY, as Borges described, has many books.1

Each book has 410 pages, each containing 40 lines, each containing 80 characters. Therefore, each book contains:

410 ppb × 40 lpp × 80 cpl = 1,312,000 cpb

(1)

It’s easy to see that lots of things can be said with this many characters. The Gutenberg Project edition of «Alice’s Adventures in Wonderland» is only about 160-thousand characters, and that includes extraneous text (like Project Gutenberg’s own license, indices, etc.). The numbers closely match rules of thumb for printed text, so it’s reasonable to think of a single tome of THE LIBRARY as being approximately as long as a 410-pages book in regular type.2

In Borges’ research paper / fictional account, there are people who destroy books left and right, but the narrator isn’t fazed: after all, if the axioms about THE LIBRARY3 are true,4 one must remember two things:5

One: The Library is so enormous that all reduction of human origin is infinitesimal. Another: each tome is unique, irreplaceable, but (since the Library is total) there are always several hundreds of thousands of imperfect facsimiles: of works that differ by no more than a single letter or a comma.

Just how many imperfect copies are there, given the working theory of THE LIBRARY?


Let’s imagine any one book that will be called the Original. It’s possible that the last character of the last line of the last page is a period. A «typo» is, then, having any other character in that particular place. There are, then 24 «incorrect» characters for this particular place and therefore, 24 books that differ from the Original only in the last character.

This argument can be extended to and and all of the other 1,311,999 «places» in the book. In other words, there are 24 «incorrect» options for each and every one of the 1,312,000 characters in the book. So, there are 24 almost-perfect books that differ only on the first character, 24 more that differ only on the second, 24 more for the third… Then, the total number of books that differ from the Original in one and only one character, is:6

241312000 ≈ 1.4096... × 101810837 ≈ 10106.257 = T1

(2)

Let’s call this number T1, because it will indicate how many books have exactly 1 typo (compared to the Original). This mighty number, written complete, is about 1,810,000 digits long! In other words, imagine a book from THE LIBRARY that contains only the number 9 repeated in each and every one of its pages.7 That mighty number is still way, way smaller than the number of nearly-perfect copies of the Original. How much smaller? Roughly speaking, adding a single digit to a number generally means it grows by a single order of magnitude. The number of copies is 500 thousand orders of magnitude larger than the largest number that can fit inside a book from THE LIBRARY. In comparison, the known universe’s diameter8 is larger than a Planck unit of length9 by only 50 orders of magnitude, give or take.

But wait! This calculation is inspired by the fact that there are nearly perfect copies of the Original, and whose mistakes are so small as to make them trivial. One typo in a 410-page book is certainly not enough to make it unreadable.

What about two typos? Arguably, having two typos in a book is also not enough to render the book indistinguishable from the Original.10 How many two-typo books are there? What’s the value of T2?

For every 1-typo book, there will be T1 books that differ from it in only one character, but only one of them will be the Original. Then, the number of 2-typo books will be:

T2 = T1 × (T1 − 1) = T12 − T1

(3)

Without even reaching for a calculator, I can say that in the above expression, the T12 dominates. And how much! It’s approximately 1.98 × 103621674. Subtracting T1 is nothing. Given these magnitudes, it may be easier to remember that T2 ≈ T12

The number of 3-typo books is again calculated by reasoning that it comes from making almost all changes to a 2-typo book, except for the one that would «reduce» it to a 1-typo book. There are T1 − 1 such changes, therefore…

T3 = (T1 − 1) × T2 = (T1 − 1) × (T12 − T1) = T1(T1−1)2

(4)

Again, the number with the largest exponent dominates all others (T13 in this case). The exact value for an n-typo book can be expressed like so:

Tn = (T1 − 1) × Tn − 1 = T1(T1−1)n − 1

(5)

We can see that the pattern holds: in general the number of books with n typos can be approximated by T1n and so it grows exponentially. Five typos scattered regularly in a 410-page book are hardly enough to render a book unreadable—or unrecognizable—but the total number of books with only 5 typos easily dwarfs the number of all information humanity has ever produced.11

How many typos are needed to render a text completely unreadable? That question will remain open for now, because a complete answer will require refinement of ideas, concepts and tools beyond the scope of this small fragment. It’s obvious, for instance, that if I were to insert a typo in 50% of all characters in «Hamlet» it would most likely become unreadable, but just using that number and calling it a day seems cheap. For once, it’s not constructive and for twice, it’s completely arbitrary. At 0% typos the Original is readable and at 100% typos it’s unrecognizable. It stands to reason that the «turning point» must be somewhere in between, but this is not a continuous problem, since there’s no way to have, say half a typo. Even defining what that «turning point» is will prove to be very much non-trivial. Therefore, the question of how many typos are needed to render a text unrecognizable from the Original will be left open for future curious minds.

The Library of Babel as repository of tangential truths

But, just how many are true typos?

There exists a book in THE LIBRARY that is a perfect copy of «THE TRAGEDY OF HAMLET, PRINCE OF DENMARK» followed by nothing but blank characters.12 There exists a book that is almost the same except for the fact that every occurrence of the string HAMLET_13 has instead the string OPHELIA. The play reads a bit weird, but it’s understandable nonetheless. Does this book qualify as a typo-riddled version of «Hamlet» or is it a new book in and of itself? I have no good answer.

However, this small exercise speaks to us of another kind of books existing in THE LIBRARY: books14 that are very related to other books through similar concepts. The book I mentioned in the above paragraph is an example of such: a book discussing the tragedy of Ophelia. But there’s more to it.

These books might discuss things that are false, or have never been imagined by any human, ever, but are hinted at from our existing books in this Earth. For instance, there’s the Hundred Years’ War, but there are books on the Ninety-nine Years’ War, the Ninety-eight Years’ War… None of those conflicts have actually happened on this Earth15 but they are written down in a book somewhere in THE LIBRARY in quite the detail.

Another example: the Clay Institute of Mathematics has offered a prize for whoever proves or disproves the Hodge conjecture, but nothing is said of the Aaron conjecture, the Aaronson Conjecture, the Smith Conjecture, the Jones Conjecture or the Williams Conjecture. Maybe THE LIBRARY even contains proof of all of them.

These I will call Tangential Truths: the ideas that don’t exist, but are hinted at through texts and ideas that do exist on this Earth. The Tangential Truths are those that are revealed to us not directly,16 but only through preexisting texts.

The Tangential Truths are not the same as truths that are yet unknown, like a mathematical proof or scientific result. The difference is that the latter are susceptible to be found through other means, and not as a literary epiphany from the Platonic Ideal that is THE LIBRARY.17 The Tangential Truths are what come to mind from imagination, trying to substitute a single word in a title and thinking what that could be. A single, non-ambiguous definition of Tangential Truths will be impossible beyond mere demonstration.

The Book of Books

This Book of Books is merely one in myriads, but it contains a few of these Tangential Truths. The Book of Books is not mine, for I cannot claim to be a Creator that large. Instead, I’m merely a prophet, a head that rearranges characters and words and puts down in paper what he finds. Others might be there, doing the same.

This Book of Books is one of many Books of Books, all describing books of Tangential Truths that are in THE LIBRARY. There exists a book that correctly catalogues all of these Books of Books, but it’s impossible for us to even understand what it is, for it would require an even greater Epiphany. It’s difficult to even comprehend the Tangential Truths, comprehending the Tangential Truths derived from they would require a demiurgic act beyond the abilities of mortal men and women.

Through abstraction of Truths and their Tangential Truths, one becomes closer and closer to the Creator Himself. I don’t know if such a thing should even be attempted. After all, before THE LIBRARY there was THE TOWER, and It was doomed to failure by decree from Above. THE LIBRARY exists and alone should suffice, lest we once again reach what should not be reached..


  1. And here’s my entry for the «Understatement of the Millenium» award.

  2. My printed copy of Herman Hesse’s «The Glass Bead Game» has 520 pages, 38 lines per page and the first line I picked had 60 characters. Although the type has no fixed width, it’s reasonable that a random line picked somewhere in the middle cannot deviate that much from others and it’s more likely than not to lie close to the average.

  3. Summarized, these say that (a) THE LIBRARY exists ab aeterno; and (b) the books are made of only 25 characters. There’s also the observation that no inhabitant of THE LIBRARY has ever seen two identical books.

  4. The terminology is Borges’, not mine. Of course, if they are axioms, they can be presumed to either be evident in themselves and/or impossible to prove. The phrase “if the axioms are true” is regularly useless, because their axiomatic nature and their truth are obviously intertwined.

  5. This translation is mine and 99% sure to be imperfect.

  6. Try it on WolframAlpha

  7. It’s worth noting that the books, as described by Borges, contain only alphabetic characters without digits of any kind, so this particular book is nowhere to be found in THE LIBRARY. It’s also worth noting that there could be some language in which a letter—say, «N»—stands for the number 9 and then a tome containing only the letter «N» represents this gargantuan number.

  8. About 8.8 × 1026 meters.

  9. About 1.6 × 10 − 35 meters

  10. One could imagine a strange language that admits no orthographical/spelling errors anywhere, but I resist such a thought: any beings that evolved such language might be incredibly fragile to unexpected changes and that seems to go against real living beings, characterized among other things by an ability to adapt to their environment.

  11. It’s hard to find hard numbers on this, mostly because «the total amount of information ever produced» is a very tricky thing to even define. However, numbers as «low» in our scale as T5 are already large enough to surpass most comparisons. Most estimates say that the total size of the Internet is better measured in zettabytes, but that is only 8 × 1021 bits of information. Just by looking at the exponents, we know that there’s more books with 5 typos than the total number of books produced by the whole world.

  12. The Project Gutenberg edition is about 192 thousand characters long.

  13. Notice the space at the end

  14. Let’s assume them typo-free for now.

  15. That we know of.

  16. As in, they are not product of scientific research, human ingenuity or other such processes.

  17. Borges also said: «I have always imagined that Paradise will be a kind of library» and THE LIBRARY might be just that. Some might argument that the True Platonic Ideal would be an infinite library and not merely a Very Large one. Others could argue that the True Platonic Ideal would be a single infinite book. I choose neither.