The definition and construction of the surreals is due to John Horton Conway. They were introduced in Donald Knuth's 1974 book Surreal Numbers: How Two Ex-Students Turned on to Pure Mathematics and Found Total Happiness. This book is a mathematical novelette, and is notable as one of the rare cases where a new mathematical idea was first presented in a work of fiction. In his book, which takes the form of a dialogue, Knuth coined the term surreal numbers for what Conway had simply called numbers originally. Conway liked the new name, and later adopted it himself. Conway then described the surreal numbers and used them for analyzing games in his 1976 book On Numbers and Games.
Constructing surreal numbers
A form is a pair of sets, called its left setL and its right setR; it is written { L | R }. (When L and R are given as lists of elements, the braces around them can be omitted.) If there is no combination of x in L and y in R such that x≥y, the form is numeric. Either or both of L and R may be the empty set; without knowing anything about the potential elements of L and R, or about the comparison operation ≥, one can exhibit the numeric form { { } | { } }, also written { | }.
Construction Rule
A form { L | R } is numeric if and only if there is no combination of and such that xR ≤ xL.
The numeric forms are placed in equivalence classes; each such equivalence class is a surreal number. The equivalence class containing { | } is labeled 0; in other words, { | } is a form of the surreal number 0. The elements of L and R are drawn from the universe of the surreal numbers (not of forms, but of their equivalence classes). The recursive definition of surreal numbers is completed by defining comparison and equivalence:
Comparison Rule
Given numeric forms x = { XL | XR } and y = { YL | YR }, x ≤ y if and only if:
there is no such that y ≤ xL, and
there is no such that yR ≤ x.
A comparison y ≤ c between a form y and a surreal number c is performed by choosing a form z from the equivalence class c and evaluating y ≤ z; and likewise for c ≤ x and for comparison b ≤ c between two surreal numbers.
A well-ordering relationship must have the property that x = y (i. e., x ≤ y and y ≤ x are both true) only when x and y are the same object. This is not the case for surreal number forms, but is true by construction for surreal numbers.
Equivalence Rule
Two surreal number forms x and y are forms of the same number (lie in the same equivalence class) if and only if both x ≤ y and y ≤ x.
This group of definitions is recursive, and requires some form of mathematical induction to define the universe of objects (forms and numbers) that occur in them. The only surreal numbers reachable via finite induction are the dyadic fractions; a wider universe is reachable given some form of transfinite induction.
Induction Rule
There is a generation S0 = { 0 }, in which 0 consists of the single form { | }.
Given any ordinal numbern, the generation Sn is the set of all surreal numbers that are generated by the construction rule from subsets of .
The base case is actually a special case of the induction rule, with 0 taken as a label for the "least ordinal". Since there exists no Si with i < 0, the expression is the empty set; the only subset of the empty set is the empty set, and therefore S0 consists of a single surreal form { | } lying in a single equivalence class 0.
The first iteration of the induction rule produces the three numeric forms { | 0 } < { | } < { 0 | } (the form { 0 | 0 } is non-numeric because 0≤0). The equivalence class containing { 0 | } is labeled 1 and the equivalence class containing { | 0 } is labeled −1. These three labels have a special significance in the axioms that define a ring; they are the additive identity (0), the multiplicative identity (1), and the additive inverse of 1 (−1). The arithmetic operations defined below are consistent with these labels.
For every i < n, since every valid form in Si is also a valid form in Sn, all of the numbers in Si also appear in Sn (as supersets of their representation in Si). (The set union expression appears in our construction rule, rather than the simpler form Sn-1, so that the definition also makes sense when n is a limit ordinal.) Numbers in Sn that are a superset of some number in Si are said to have been inherited from generation i. The smallest value of α for which a given surreal number appears in Sα is called its birthday. For example, the birthday of 0 is 0, and the birthday of −1 is 1.
A second iteration of the construction rule yields the following ordering of equivalence classes:
Comparison of these equivalence classes is consistent irrespective of the choice of form. Three observations follow:
S2 contains four new surreal numbers. Two contain extremal forms: { | −1, 0, 1 } contains all numbers from previous generations in its right set, and { −1, 0, 1 | } contains all numbers from previous generations in its left set. The others have a form that partitions all numbers from previous generations into two non-empty sets.
Every surreal number x that existed in the previous "generation" exists also in this generation, and includes at least one new form: a partition of all numbers other thanx from previous generations into a left set (all numbers less than x) and a right set (all numbers greater than x).
The equivalence class of a number depends only on the maximal element of its left set and the minimal element of the right set.
The informal interpretations of { 1 | } and { | −1 } are "the number just after 1" and "the number just before −1" respectively; their equivalence classes are labeled 2 and −2. The informal interpretations of { 0 | 1 } and { −1 | 0 } are "the number halfway between 0 and 1" and "the number halfway between −1 and 0" respectively; their equivalence classes are labeled 1/2 and −1/2. These labels will also be justified by the rules for surreal addition and multiplication below.
The equivalence classes at each stage n of induction may be characterized by their n-complete forms (each containing as many elements as possible of previous generations in its left and right sets). Either this complete form contains every number from previous generations in its left or right set, in which case this is the first generation in which this number occurs; or it contains all numbers from previous generations but one, in which case it is a new form of this one number. We retain the labels from the previous generation for these "old" numbers, and write the ordering above using the old and new labels:
−2 < −1 < −1/2 < 0 < 1/2 < 1 < 2.
The third observation extends to all surreal numbers with finite left and right sets. (For infinite left or right sets, this is valid in an altered form, since infinite sets might not contain a maximal or minimal element.) The number { 1, 2 | 5, 8 } is therefore equivalent to { 2 | 5 }; one can establish that these are forms of 3 by using the birthday property, which is a consequence of the rules above.
Birthday property
A form x occurring in generation n lies in a number inherited from an earlier generation i < n if and only if there is some number in Si that is greater than all elements of the left set of x and less than all elements of the right set of x. If x is inherited from any generation earlier than n, there is a least such i, and exactly one number c with this least birthday i lies between the left and right sets of x. x is a form of this c, i. e., it lies in the equivalence class in Sn that is a superset of the representation of c in generation i.
Surreal arithmetic
The addition, negation (additive inverse), and multiplication of surreal number formsx = { XL | XR } and y = { YL | YR } are defined by three recursive formulas. Negation is simplest:
Negation
where .
This formula involves the negation of the surreal numbers appearing in the left and right sets of x, which is to be understood as the result of choosing a form of the number, evaluating the negation of this form, and taking the equivalence class of the resulting form. This only makes sense if the result is the same irrespective of the choice of form of the operand. This can be proven inductively using the fact that the numbers occurring in XL and XR are drawn from generations earlier than that in which the form x first occurs, and observing the special case:
-0 = - { | } = { | } = 0.
The definition of addition is also a recursive formula:
Addition
where .
This formula involves sums of one of the original operands and a surreal number drawn from the left or right set of the other. These are to be understood as the result of choosing a form of the numeric operand, performing the sum of the two forms, and taking the equivalence class of the resulting form. This only makes sense if the result is the same irrespective of the choice of form of the numeric operand. This can also be proven inductively with the special cases:
0 + 0 = { | } + { | } = { | } = 0
x + 0 = x + { | } = { XL + 0 | XR + 0 } = { XL | XR } = x
0 + y = { | } + y = { 0 + YL | 0 + YR } = { YL | YR } = y
(The latter two cases are of course themselves proven inductively.)
The recursive formula for multiplication contains arithmetic expressions involving the operands and their left and right sets, such as the expression that appears in the left set of the product of x and y. This is to be understood as the set of surreal numbers resulting from choosing one number from each set that appears in the expression and evaluating the expression on these numbers. (In each individual evaluation of the expression, only one number is chosen from each set, and is substituted in each place where that set appears in the expression.)
This depends, in turn, on the ability to (a) multiply pairs of surreal numbers drawn from the left and right sets of x and y to get a surreal number, and negate the result; (b) multiply the surreal number formx or y and a surreal number drawn from the left or right set of the other operand to get a surreal number; and (c) add the resulting surreal numbers. This again involves special cases, this time containing 0 = { | }, the multiplicative identity 1 = { 0 | }, and its additive inverse -1 = { | 0 }.
Multiplication
xy & = { X_L | X_R } { Y_L | Y_R }
& = left{ X_L y + x Y_L - X_L Y_L, X_R y + x Y_R - X_R Y_R | X_L y + x Y_R - X_L Y_R, x Y_L + X_R y - X_R Y_L right}
end{align}
It can be shown that these formulas are consistent, in the sense that:
addition and negation are defined recursively in terms of "simpler" addition and negation steps, so that operations on numbers with birthday n will eventually be expressed entirely in terms of operations on numbers with birthdays less than n;
multiplication is defined recursively in terms of additions, negations, and "simpler" multiplication steps, so that the product of numbers with birthday n will eventually be expressed entirely in terms of sums and differences of products of numbers with birthdays less than n;
as long as the operands are well-defined surreal number forms (each element of the left set is less than each element of the right set), the results are again well-defined surreal number forms;
when forms are gathered into equivalence classes using the "birthday rule", the result of negating x or adding or multiplying x and y does not depend on the choice of form of x and y; and
these operations obey the associativity, commutativity, additive inverse, and distributivity axioms in the definition of a field, with additive identity 0 = { | } and multiplicative identity 1 = { 0 | }.
With these rules one can now verify that the numbers found in the first few generations were properly labeled. The construction rule is repeated to obtain more generations of surreals:
where a and b are integers and 0 ≤ b < n.
The class of all surreal numbers that are generated in some Sn for finite n may be denoted as S* = . Since every Sn is well-ordered, so is S*; in particular, one may form the three classes S0 = { 0 }, S+ = , and S- = , and state that S* is the union of these three classes. No individual Sn is closed under addition and multiplication (except S0), but S* is; it is the subring of the rationals consisting of all dyadic fractions.
At an appropriate stage of transfinite induction, the surreal numbers may be expected to form a category on which the addition and multiplication operations (as well as the surreal construction step) are closed, and in which the multiplicative inverse of every nonzero number can be found. Assuming that one can find such a class, the surreal numbers, with their ordering and these algebraic operations, constitute an ordered field, with the caveat that they do not form a set but a proper class. In fact, it is a very special ordered field: the biggest one. Every other ordered field can be embedded in the surreals. (See also the definition of rational numbers and real numbers.)
"To Infinity ..."
Let there be an ordinal ω greater than the natural numbers, and define Sω as the set of all surreal numbers generated by the construction rule from subsets of S*. (This is the same inductive step as before, since the ordinal number ω is the smallest ordinal that is larger than all natural numbers; however, the set union appearing in the inductive step is now an infinite union of finite sets, and so this step can only be performed in a set theory that allows such a union.) A unique infinitely large positive number occurs in Sω:
ω = { S* | } = { 1, 2, 3, 4, ... | }.
Sω also contains objects that can be identified as the rational numbers. For example, the ω-complete form of the fraction 1/3 is given by:
1/3 = { y in S*: 3y < 1 | y in S*: 3y > 1 }.
The product of this form of 1/3 with any form of 3 is a form whose left set contains only numbers less than 1 and whose right set contains only numbers greater than 1; the birthday property implies that this product is a form of 1.
Not only do all the rest of the rational numbers appear in Sω; the remaining finite real numbers do too. For example
The only infinities in Sω are ω and -ω; but there are other non-real numbers in Sω among the reals. Consider the smallest positive number in Sω:
.
This number is larger than zero but less than all positive dyadic fractions, and therefore an infinitesimal number; it is often labeled ε. The only infinitesimals in Sω are ε and -ε, whose ω-complete forms differ from the ω-complete form of 0 only in that 0 is included in their left set and right set respectively. More generally, for every dyadic fraction x in S*, there are distinct equivalence classes x, x+ε, and x-ε in Sω; the reader may verify, using the rule for surreal addition, that x+ε is an accurate label for the element of Sω given by a form with x in its left set and all dyadic fractions greater than x in its right set.
One can determine the relationship between ω and ε by multiplying them to obtain:
ω · ε = { ε · S+ | ω · S+ + S* + ε · S* }.
This expression is only well-defined in a set theory which permits transfinite induction up to . In such a system, one can demonstrate that all the elements of the left set of ω · ε are positive infinitesimals and all the elements of the right set are positive infinities, and therefore ω · ε is the oldest positive finite number, i. e., 1. Consequently,
1/ε = ω.
Some authors systematically use ω−1 in place of the symbol ε.
Contents of Sω
Given any x = { L | R } in Sω, either:
L and R are both empty, in which case x = 0; or
R is empty and there exists some integer n≥0 which is greater than every element of L, in which case x equals the smallest such integer n; or
R is empty and there exists no integer n which is greater than every element of L, in which case x equals +ω; or
L is empty and there exists some integer n≤0 which is less than every element of R, in which case x equals the largest such integer n; or
L is empty and there exists no integer n which is less than every element of R, in which case x equals -ω; or
L and R are both non-empty, and:
there exists some dyadic fraction y that is "strictly between" L and R (greater than all elements of L and less than all elements of R), in which case x equals the oldest such dyadic fraction y; or
there exists no y strictly between L and R, but there is some dyadic fraction that is greater than or equal to all elements of L and less than all elements of R, in which case x equals y+ε; or
there exists no y strictly between L and R, but there is some dyadic fraction that is greater than all elements of L and less than or equal to all elements of R, in which case x equals y-ε; or
every dyadic fraction is either greater than some element of R or less than some element of L, in which case x is some real number that has no representation as a dyadic fraction.
This construction of the real numbers differs from the Dedekind cuts of standard analysis in that it starts from dyadic fractions rather than general rationals and naturally identifies each dyadic fraction in Sω with its forms in previous generations. The rationals are not an identifiable stage in the the surreal construction; they are merely the subset Q of Sω containing all elements x such that xb = a for some a and some nonzero b, both drawn from S*. By demonstrating that Q is closed under individual repetitions of the surreal arithmetic operations, one can show that it is a field; and by showing that every element of Q is reachable from S* by a finite series (no more than two, actually) of arithmetic operations including multiplicative inversion, one can show that Q is strictly smaller than the subset of Sω identified with the reals.
"... and Beyond."
Continuing to perform transfinite induction beyond Sω produces more ordinal numbers α, each represented as the largest surreal number having birthday α. (This is essentially a definition of the ordinal numbers resulting from transfinite induction.) The first such ordinal is ω+1 = { ω | }. There is another positive infinite number in generation ω+1:
ω−1 = { 1, 2, 3, 4, ... | ω }.
It is important to observe that the surreal number ω−1 is not an ordinal; the ordinal ω is not the successor of any ordinal. This is a surreal number with birthday ω+1, which is labeled ω−1 on the basis that it coincides with the sum of ω = { 1, 2, 3, 4, ... | } and −1 = { | 0 }. Similarly, there are two new infinitesimal numbers in generation ω+1:
At a later stage of transfinite induction, there is a number larger than ω+k for all natural numbers k:
2ω = ω + ω = { ω+1, ω+2, ω+3, ω+4, ... | }
This number may be labeled ω + ω both because its birthday is ω + ω (the first ordinal number not reachable from ω by the successor operation) and because it coincides with the surreal sum of ω and ω; it may also be labeled 2ω because it coincides with the product of ω = { 1, 2, 3, 4, ... | } and 2 = { 1 | }. It is the second limit ordinal; reaching it from ω via the construction step requires a transfinite induction on . This involves an infinite union of infinite sets, which is a "stronger" set theoretic operation than the previous transfinite induction required.
Note that the conventional addition and multiplication of ordinals does not always coincide with these operations on their surreal representations. The sum of ordinals 1 + ω equals ω, but the surreal sum is commutative and produces 1 + ω = ω + 1 > ω. The addition and multiplication of the surreal numbers associated with ordinals coincides with the natural sum and natural product of ordinals.
Just as 2ω is bigger than ω+n for any natural number n, there is a surreal number ω/2 that is infinite but smaller than ω−n for any natural number n.
ω/2 = { S* | ω − S* }
where x − Y = { x − y | y in Y }. It can be identified as the product of ω and the form { 0 | 1 } of 1/2. The birthday of ω/2 is the limit ordinal 2ω.
Labels for the surreal numbers that arise in later stages of transfinite induction are determined by reference to the rules of ordinal arithmetic and of surreal exponentiation.
Base 2 Exponentiation
The base 2 exponential of a surreal number x, denoted 2x, is defined as follows:
* 2x+y = 2x · 2y. This leads to implicit definitions for exponentials of negative and dyadic fractional surreals:
** 2−x = (2x)−1 (where (y)−1 is the unique solution z of y·z=1 restricted to y,z > 0).
** (where is the unique solution z of z·z=y restricted to y,z > 0).
The existence and uniqueness of these solutions is challenging to prove, and provides the connection between the surreal number construction and the real-closed fields of model theory.
* When the birthday α of the exponent x is a limit ordinal, the value of 2x is determined by reference to its α-complete form, and is computed differently depending on whether the left set or right set of this form has an extremal member. (They cannot both have extremal members; if they did, some number with birthday < α would lie between them. It is possible for neither the left set nor the right set to have an extremal member, as in the case where α=ω and x is an irrational number.)
** If the right set of the α-complete form of x has a least member x0, then . (The value of can be determined, since x0 has an earlier birthday.) The quantity z=x0−x is a positive infinitesimal; its exponetial 2−z is calculated by using its Taylor series, which alternates between negative and positive terms. Multiply by 2−z to get 2x.
** If the left set of the α-complete form of x has a greatest member x0, then . This time z=x−x0 is a positive infinitesimal; multiply by the multiplicative inverse of 2−z to get 2x.
** If neither the left set nor the right set of the α-complete form of x has an extremal member, then for any form { XL | XR } of x. (This formula applies to the case where x is the limit ordinal α or its negation, as well as to numbers resulting from "Conway cuts".)
The use of the Taylor series of 2−z = exp(−(ln 2)·z) ensures that 2ε · 2−ε = 1 and so forth. The definition is phrased in terms of 2−z for positive infinitesimal z only because it results in a Taylor series whose terms alternate between positive and negative (and are of strictly decreasing magnitude because zk+1 < czk for all real c). Truncating the series at order zk always produces an overestimate for even k and an underestimate for odd k. This results in a direct construction of left and right sets of the "limit" of the series, without use of deep results about limits and convergence.
The exponential map x → 2x is of great interest in proving facts about the surreals. It has the fixed point ω; this is its only fixed point among numbers with birthday less than ε0 (epsilon nought). If one starts from the first ordinal greater than ω and applies the exponential map repeatedly, one obtains a series of larger ordinals:
ω+1 → ω·2 → ω2 → ωω → → ... →
(see Knuth's up-arrow notation). Starting instead from the non-ordinal ω−1, one obtains a similar pattern of exponentiation for infinite non-ordinals:
ω−1 → ω·1/2 → (→ ωε → ) → → → ...
Each step of this sequence produces the smallest infinite number whose birthday is the ordinal in the same step of the previous sequence, with the exception of the two steps in parentheses; they are the smallest infinite numbers in generations ω3 and ω4 respectively.
In a sufficiently strong axiomatization of set theory, one can form generations of surreals labeled by a countable series of countable ordinals:
, , , , ...
Other fields of interest occur as subsets of these generations; for instance, the rational expressions in ω, which form a classical example of a non-Archimedean field, occur as a subset of generation ωω, much as the rational numbers occur as a subset of generation ω. But for every k≥2, generation contains infinitesimal elements much smaller than the multiplicative inverse of its ordinal. Consequently, terminating transfinite induction before any particular results in a ring that is also closed under the construction step, but not in a field.
However, the countable union of the sets for natural numbers k, which may be denoted , is closed under both the surreal construction step and the arithmetic operations, including the multiplicative inverse. (The existence of the multiplicative inverse of any nonzero number in Σ is provable using the fact that the base 2 exponential maps Σ invertibly onto its positive half Σ+. This results in the formula