The surreal numbers

John Horton Conway's book On Numbers and Games introduces a formulation of the notion of number which falls naturally out of the approach he took to game theory. This arises more naturally than the classical real numbers and extends them gracefully to embrace diverse infinities and infinitesimals. Wikipedia has an entry for the surreal numbers which provides a nice clear description of what you get out of Conway's definitions.

Preamble

It is possible to describe the surreals as mappings from ordinals to a two-member set, to be thought of as {+,−}. To interpret a mapping ({+,−}: f |n) with n ordinal as a surreal, first identify the maximal ordinal m subsumed by n for which ({+}:f|m) or ({−}:f|m), i.e. the first m entries in f are identical. We then interpret (:f:m) as m if ({+}:f|m) or as −m if ({−}:f|m); if m is 0 these both interpret (:f:m) as 0. If n > m, we know f(m) is the member of {+,−} that isn't in (|f:m); and interpret f as (:f:m) + sum(:f(i+m)/power(i,2) ←i:), subject to a suitable extension of power to transfinite ordinals. But I prefer to start from Conway's formulation, albeit transposed from his notation (which is optimized for discussion of games) to mine (which, for all its deficiencies, is at least consistent across everything I write).

Conway's formulation

Conway discovered the surreals in the course of studying the theory of deterministic two-player games: the surreals are a particular class of game. Actually, it'd more closely match normal understanding to refer to game positions: in any position of a game, for each of the players, we can consider the moves open to that player and, consequently, the set of positions that may result from that player's move. The game position may then be characterized by the two sets of positions that may result from the respective players' moves. If either player's set is empty, and it is that player's turn to move, then that player has lost the game. Otherwise, one can evaluate the positions open to the player whose turn it is; this player ideally chooses the best of these positions to play and it becomes the other player's turn.

Given only the characterization of positions in terms of the sets of positions to which they can lead, it's possible to determine everything about the strategy of a game. Now, of course, one can really start a game in any of the positions to which it can lead, not just its orthodox starting position; so the distinction between a position and a game becomes unimportant. So, formally, Conway defines a game to be a pair of sets of games; one member of the pair is termed its left set and the other its right set. If the game's name is x then the denotation x^R stands for any member of the right set of x; likewise x^L is an arbitrary member of its left set.

Conway then goes on to define a partial ordering on games: x≤y iff there is no y^R that's ≤ x and no x^L that's ≥ y, where ≥ is reverse(≤). The intersection of ≥ and ≤ is an equivalence for which, if you replace one game with another equivalent to it, the dynamics may be radically different but the outcome is the same (at least when both players play optimally). Conway then establishes surreals as a sub-type of games, where each surreal is equivalent to some surreal which is a pair of sets of surreals (as distinct from more general games); and the union of ≤ and ≥ is the all-relation on surreals.

Notational conversion

I want to transpose that into my notation and ways of formulating mathematics, not because of any deficiency of Conway's but simply so that everything on this site uses one consistent approach. Conway's formulation requires a game to be a pair of sets; which I could implement as a list ({sets}:|2) but prefer to implement as an atom W←X; and it's natural in my formalism to have W and X be collections rather than sets, or indeed to consider what relations I could use in their places. Since I'm only interested in the surreals themselves, each of which is expressible as a pair of sets of surreals, I'll skip the games entirely. So a surreal is a pair W←X in which W and X are collections of surreals.

The mutually reverse relations ≤ and ≥, each ({surreals}::{surreals}), are essential to the discussion; for W←X to be a number, we require that no member of X is ≤ any member of W. Each surreal is then a pair W←X of collections of surreals for which (W:≥:X) is empty. (Note that, when X and W are non-empty, (W:≤:X) determines X and W entirely; but we can't use this as the representation of W←X because it's empty when either X or W is empty, losing all information about the other.) Given two such values, W←X and Y←Z, we have Y←Z ≥ W←X precisely if both ({W←X}:≥:Z) and (W:≥:{Y←Z}) are empty. Notice that left values only appear as left constraints and right values as right constraints, when constraining ≥. Once we've established that ≤ and ≥ are transitive and reflexive, we can intersect them to get an equivalence relation.

For a surreal W←X, we don't really need W and X to be collections: we can replace W with any relation which has the same right values as it and X with any relation which has the same left values as it. Any right value of W can be ignored if it's ≤ some other right value of W; likewise for any left value of X that's ≥ some other – and, conversely, we can add arbitrary surreals that shall be ignored to either side. Then again, as long as ≥ and ≤ are reflexive, each subsumes every collection of its values. So, in fact, instead of collections of surreals, we could use sub-relations of ≤ or of ≥ for X and Y. Still, since only the right values of W and left values of X are relevant, it's merely inconvenient to get a notionally distinct surreal when replacing W with some other relation having the same right values, or likewise X for left; so I'll restrict to collections.

The surreals and their ordering

We can define ≤ = reverse(≥). The members of the left value of a surreal number are known as its left options; the members of the right value of a surreal number are known as its right options; collectively, these are known as options of the surreal.

Once I've established that the intersection of ≤ and ≥ is an equivalence relation, I'll use that as the proper notion of equality for surreals, which = shall then denote. When it is necessary to distinguish is the same pair of collections of surreals (implicitly meaning they have the same members in this same sense) from equality (as mediated by our equivalence), I shall follow Conway in writing ≡ for the same pair comparison.

Inductive principle

A proposition about the surreals takes the form of a statement with names, quantified over by the proposition, standing for surreals: as such, it is an encoding of a family of statements obtained from it by consistently replacing each name with one surreal (and removing the wording that says to quantify over that name). Refer to each such statement as an instance of the proposition; and describe an instance as established if has been proven true. Likewise: any specification of a way to obtain a value from one or several surreals, whose values are arbitrary, uses names for these last and is, again, an encoding of a family of instances, each of which replaces each name consistently with one surreal. Describe an instance of a specification as established if the value of all expressions it implicates are fully and properly specified.

For the purposes of this section, I define a simplifying step as a transformation of such an instance in which the surreals it acts on (i.e uses in place of the names in the proposition or specification) are optionally permuted and one of them is replaced by one of its options. I'll describe one instance of a proposition or specification as simpler than another if the former may be arrived at from the latter by some sequence of simplifying steps.

The inductive principle used in specifying functions on the surreals and proving theorems about them is that, in establishing each instance, one may inductively presume that all simpler instances are established. The rationale for this is that there is no way to construct a surreal except in terms of simpler ones.

Justification

To map this principle formally onto the classical inductive approach, classify surreals into ordinal-indexed generations as follows: for any ordinal n, generation n of the surreals comprises all surreals whose left and right values are subsumed by the union, over ordinals m earlier than n, of generation m. (Notice that each ordinal's generation is subsumed by that of any later ordinal.) We can then associate each surreal with the intersection of all ordinals for which the surreal is in that ordinal's generation (i.e. the first such ordinal); describe this as the generation of the surreal. The tacit presumption contained in the inductive principle is that every surreal is in some ordinal's generation.

Consider a simple proposition or definition, H, conforming to our inductive principle and only implicating one surreal in each instance. We can prove H orthodoxly by induction on the generation of this surreal, since every simpler surreal has earlier generation. For more complex propositions and definitions, orthodoxy requires more complex multiple induction: but the added complexity is merely an artifact of the limitations of orthodox induction. The interested reader is welcome to convert the proofs and specifications below into the form required for this approach, but Conway's given inductive principle makes the proofs and specifications vastly more intelligible without any actual loss of rigour.

Reflexivity and transitivity

By way of illustrating how to use this inductive principle, let us now establish that ≥ does actually have two of the properties normally associated with operators using its name: it's reflexive (i.e. every surreal is ≥ itself) and transitive (i.e. u≥v and v≥w implies u≥w).

Theorem: ≥ is reflexive

Proof: given a surreal W←X, we inductively suppose that every option of W←X is a fixed point of ≥. We need to show W←X ≤ W←X, i.e. ({W←X}:≥:X) = {} = (W:≥:{W←X}); and we know (W:≥:X) = {}. For the left half of this, there must be no x in X with W←X ≥ x, for which it suffices that each x in X has non-empty ({x}:≥:X), for example.

Every right option x of W←X is a member of X and a fixed point of ≥ hence a fixed point of ({x}:≥:X) which is consequently non-empty, so ({W←X}:≥:X) = {}. Every left option w of W←X is likewise a member of W and a fixed point of ≥ hence a fixed point of (W:≥:{w}), so (W:≥:{W←X}) = {}. We have thus proven that W←X ≥ W←X; thus any surreal whose options are fixed points of ≥ is itself a fixed point of ≥; hence every surreal is in fact a fixed point of ≥, i.e. ≥ is reflexive.

Theorem: ≥ is transitive

given u≥v and v≥w, we need to show u≥w. Let u ≡ U←X, v ≡ V←Y and w ≡ W←Z; we are given that, among other restrictions of ≥, ({v}:≥:X) and (W:≥:{v}) are empty. For any option s of w, our inductive principle allows us to presume that s≥u (and u≥v) implies s≥v, whence {s in W: s≥v} subsumes {s in W: s≥u}. However, (W:≥:{v}) is given to be empty, hence so is {s in W: s≥v}, hence so is its sub-collection {s in W: s≥u}, hence so is (W:≥:{u}). Likewise, from our inductive principle, for any option s of u, (v≥w and) w≥s implies v≥s, whence {} = {s in X: v≥s} subsumes {s in X: w≥s}, whence ({w}:≥:X) is also empty. But (W:≥:{u}) = {} = ({w}:≥:X) iff u≥w – QED.

The simplest surreal

The constructibility requirement, implied by the rationale for our inductive principle, obliges us to consider how we may construct some actual surreals, for which we'll need some collections of surreals. We inescapably have {} as a collection of surreals and it fatuously satisfies ({}:≥:{}) = {}, giving us {}←{} as a surreal. This lets us construct another collection of surreals, which we can use in constructing further surreals with which to construct further collections of surreals and so on.

From the definitions, for any collection X of surreals, (X:≥:{}) = {} = ({}:≥:X) fatuously, so X←{} and {}←X are surreals. Likewise, ({{}←{}}:≥:{}), ({}:≥:{X←{}}), ({}:≥:{{}←{}}) and ({{}←X}:≥:{}) are all empty so X←{} ≥ {}←{} ≥ {}←X.

Further basic properties

Thus ≥ is reflexive and transitive, as are its reverse ≤ and their intersection, which is necessarily an equivalence relation. Since every surreal is a fixed point of ≥, hence also of ≤, every surreal is a fixed point of this equivalence, i.e. equivalent to itself. Two surreals are deemed equal precisely if this equivalence relates them to one another: x = y iff x≤y and y≤x, for surreal x and y. Define relations < correspondingly by x<y iff y≥x and ({x}:≥:{y}) is empty (those more relaxed than me about reductio can read not(x≥y) where I say ({x}:≥:{y}) is empty) and let > be its reverse.

Now, transitivity of ≥ implies that, for given collections X, Y, Z of surreals, (X:≥:Z) subsumes (X:≥:Y)&on;(Y:≥:Z), since the latter relates a right value x of X to a left value z of Z precisely if ≥ relates x to some y in Y which ≥ relates to z, i.e. x≥y≥z, which implies x≥z. Further, if (X:≥:Z) is empty and either (X:≥|Y) or (Y|≥:Z) then (respectively) (Y:≥:Z) or (X:≥:Y) must be empty. For the first case: {} = (X:≥:Z) subsumes (X:≥|Y)&on;(Y:≥:Z); for any y in Y, there's an x in X with x≥y and any surreal v with y≥v also has x≥v, whence v is not in Z; so (Y:≥:Z) is empty. The second case, {} = (X:≥:Z) with (Y|≥:Z) implies (X:≥:Y) = {}, is similar. I shall use this result extensively below.

we have x≥x, so ({x}:≥:Y) = {} = (W:≥:{x}). For any w in W or y in Y we thus have ({x}:≥:{y}) = {} = ({w}:≥:{x}) as required for y>x>w; it remains to show y≥x≥w. Since x is surreal, (W:≥:Y) = {} so ({w}:≥:Y) = {} = (W:≥:{y}). Suppose w ≡ U←V; our inductive principle lets us assert that (U|<:{w}), whence ({w}:≥|U); and {} = (W:≥:{x}) subsumes ({w}:≥{x}) subsumes ({w}:≥|U)&on;(U:≥:{x}), whence (U:≥:{x}) is empty; taken with ({w}:≥:Y) = {}, this implies x≥w. Likewise with y = U←V we get ({y}:<|V) so (V|≥:{y}); and {} = ({x}:≥:Y) subsumes ({x}:≥:{y}) subsumes ({x}:≥:V)&on;(V|&on;:{y}), so ({x}:≥:V) is empty, as was (W:≥:{y}), so y≥x – QED.

let x ≡ W←X and y ≡ Y←Z. From the definition, x≥y iff ({y}:≥:X) = {} = (Y:≥:{x}) so we have three possibilities: x≥y or; there is some v in Y with v≥x or; there is some u in X with y≥u. Now v in Y implies v<y and u in X implies x<u so our three cases reduce to: x≥y or y>v≥x or y≥u>x and the last two both imply y≥x – QED.

Thus our comparisons constitute a total order on surreals. Note that the proof actually gives us x≥y or y>x directly; but, in any case, the theorem itself implies this since x≥y or y≥x implies x≥y or y≥x and not(x≥y) which is exactly the statement x≥y or y>x from the definition of >.

It remains, for this section, to prove that when two surreals are equivalent, replacing one with the other as an option of a third surreal yields a surreal equal to this third – that is, our equivalence is good for what we want of it. I shall deal with this by proving a stronger result.

Theorem: given surreal W←Y and collections V and Z of surreals; (W|≤:V), (V|≤:W), (Y|≥:Z) and (Z|≥:Y) implies V←Z is surreal and W←Y = V←Z.

W←Y is surreal implies {} = (W:≥:Y), which subsumes (W:≥:Z)&on;(Z|≥:Y) whence {} = (W:≥:Z) which subsumes (W:≥|V)&on;(V:≥Z) so (V:≥:Z) is empty and V←Z is surreal.