Skip to article frontmatterSkip to article content

It will be important for us to extend the usual counting process beyond the natural numbers. To give an example, let us return for a moment to perfect subsets of the reals. To show that every uncountable closed subset of R\mathbb{R} contains a perfect subset, we considered the condensation points of the set. There is another, more constructive way, to arrive at a perfect subset. When Cantor studied convergence of Fourier series, he introduced the derivative of a set:

A={xA ⁣:x is a limit point of A}A' = \{ x \in A \colon x \text{ is a limit point of } A\}

We can iterate the derivative and consider A,A,A,A^\prime, A^{\prime\prime}, A^{\prime\prime\prime}, \dots. This yields a descending sequence of sets

AAAA(n)A^\prime \supseteq A^{\prime\prime} \supseteq A^{\prime\prime\prime} \supseteq \dots \supseteq A^{(n)} \supseteq \dots

As the sequence is nested, we can take a “limit”:

A=nA(n)A^{\infty} = \bigcap_n A^{(n)}

But the process does not necessarily stop here. AA^\infty may have isolated points again, so that A(A)A^\infty \supsetneq (A^\infty)^\prime.

Let us introduce ω{\omega} as a new number to be used in place of \infty above. We can continue the counting process:

1,2,3,,ω,ω+1,ω+2,,ω+ω,ω+ω+1,,ω+ω+ω,,ωω,1,2,3, \dots, \omega, \omega+1, \omega+2, \dots, \omega + \omega, \omega + \omega +1, \dots, \omega + \omega + \omega, \dots, \omega\cdot \omega, \dots

We can then define, for example, Aω+1:=(Aω)A^{\omega+1} := (A^{\omega})'. As intuitively clear from above, the new transfinite numbers come with a natural ordering, so we can put, for example, Aω+ω:=α<ω+ωAαA^{\omega+\omega} := \bigcap_{\alpha < \omega+\omega} A^{\alpha}.

To obtain a perfect subset, one can show that the above process of taking derivatives, eventually reaches a fixed point: some stage α\alpha for which (Aα)=Aα(A^{\alpha})' = A^\alpha. If AA is uncountable, AαA^\alpha will be a perfect subset, since one can show that at each derivative step we remove an at most countable set and it will take at most countably many steps to reach the fixed point. But what does “countably many steps” exactly mean if we have to count past ω\omega? The formal notions of ordinal and cardinal numbers will answer this question.

The fundamental concept on which the ordinal numbers are based if that of a well-order.

Orders and well-orders

With any reflexive partial order \leq we can associate an irreflexive or strict partial order by letting

a<b:    ab    ab.a < b \quad :\iff \quad a \leq b \; \wedge \; a \neq b.

Likewise, we can obtain a reflexive order from a strict one by defining

ab:    a<b    a=b.a \leq b \quad :\iff \quad a < b \; \vee \; a = b.

We will use \leq or similar symbols to indicate the order is reflexive, while <<, \prec, etc. are reserved for strict orders.

We can enumerate the natural numbers one after another, but for the other standard ordered number domains this is not possible: We cannot find a place to begin counting (as in the case of Z\mathbb{Z}) or there is no “next bigger” element (as in the case of Q\mathbb{Q} or R\mathbb{R}).

To enumerate these domains in the form {a0,a1,,an,an+1,}\{a_0, a_1,\ldots, a_n, a_{n+1}, \ldots \} we have to reorder them in a way that

  • we can start with a smallest element,

and once we have arrived at an element aa,

  • we know with which element we continue the enumeration (i.e. there is an immediate successor to aa),
  • we can continue the enumeration even if we have already enumerated infinitely many elements before aa (but elements of the domain still remain).

These requirements can be combined into a single property: every non-empty subset (i.e. the elements not enumerated yet) has a least element (to be enumerated next).

Orders themselves can be compared using embeddings.

Of course every order is isomorphic to itself (automorphic) via the identity. But many orders allow automorphisms other than the identity (e.g. Z\mathbb{Z} or R\mathbb{R} with zz+1z \mapsto z+1). As we will see, well-orders are very rigid in this regard.

We start with a simple observation.

Proof

If the set {xA ⁣:f(x)<x}\{x \in A \colon f(x) < x\} is non-empty, it has a minimal element zz. But since ff is increasing, this would imply f(f(z))<f(z)f(f(z)) < f(z), contradicting the minimality of zz.

We immediately obtain

An initial segment of an order (A,<)(A,<) is given by all elements that are smaller than a given element bb. We denote this initial segment by AbA\mid_b.

Proof

Suppose f:AAbf: A \to A\mid_b is an isomorphism. Then ran(f)=Ab\operatorname{ran}(f) = A\mid_b and f(x)<bf(x) < b for all xAx \in A. In particular, f(b)<bf(b) < b, contradicting Proposition 1.

Ordinal numbers

Cantor defined ordinal numbers (or ordinals) as isomorphism classes of well-orders. Later, von Neumann suggested a system of representatives particularly suitable for set theoretic considerations. The idea is to define the order << through the \in-relation on a set. For example,

{,{},{,{}}}\{ \emptyset, \{\emptyset\}, \{\emptyset, \{\emptyset\}\}\}

represents a 3-element well-order.

We will impose some conditions on the sets we allow as ordinals. Given a set AA, we use A\in_A to denote the \in-relation on AA:

A={(x,y) ⁣:x,yA    xy}\in_A = \{ (x,y) \colon x,y \in A \; \wedge \; x \in y \}

In other words, transitive sets cannot “hide” elements in subsets.

It is customary to use lower case Greek letters α,β,γ,\alpha, \beta, \gamma, \dots to denote ordinals.

If we exclude certain pathological sets from the beginning, we can further simplify this definition.

Sets which contain themselves (AAA \in A) are not well-founded- {A}\{A\} would be a subset without a \in-minimal element. Similarly, well-founded sets cannot have cycles like abcaa \in b \in c \in a.

If we write out the formulas in full, we see the characterization given in Proposition 2 is much simpler than the original one. Most notably, in Proposition 2 we only use only bounded quantifiers (of the form ya\forall y \in a), whereas in the original form we have to quantify over arbitrary subsets of aa. This is an important difference whose impact will become clear later on.

Proof

Let α{\alpha} be an ordinal, and assume bαb \in \alpha. Any subset of a linear order is again a linear order under the induced order relation. It remains to show that (b,b)(b, \in_b) is transitive (as a set). Let xcbx \in c \in b. We claim xbx \in b. Since α{\alpha} is transitive, bαb \subseteq \alpha and hence cαc \in \alpha. By transitivity of α{\alpha} again, xαx \in \alpha. Thus x,bαx,b \in \alpha, and since α\in_\alpha linearly orders α{\alpha}, we must have

xb    x=b    bx.x \in b \; \vee \; x = b \; \vee \; b \in x.

If x=bx = b, we get bcbb \in c \in b, contradicting well-foundedness. Similar for bxb \in x. Therefore, xbx \in b.

The well-ordering of ordinals

Proposition 3 suggests we can order ordinals by letting

α<β  :      αβ.\alpha < \beta \; :\iff \; \alpha \in \beta.

By Proposition 3, an ordinal then contains precisely the ordinals smaller than it:

α={β:β<α}.\alpha = \{ \beta : \beta < \alpha \}.

\in defines a partial order on all ordinals: As all sets are well-founded, irreflexivity holds, and since ordinals are transitive sets, << is a transitive relation.

Proof

The \Rightarrow direction follows directly from the transitivity of ordinals. For \Leftarrow, we show something more general, namely that any transitive proper subset of an ordinal is itself an ordinal and is an element of the superset ordinal:

trans(a)    aβOrd(a)    aβ.\Op{trans}(a) \; \wedge \; a \subset \beta \quad \Rightarrow \quad \Op{Ord}(a) \; \wedge \; a \in \beta.

If aβa \subset \beta, aa is linearly ordered by \in (as a subset of β{}\beta). Further, if aa is transitive, aa is an ordinal.

It remains to show aβa \in \beta. Since aa is a proper subset of β{}\beta, by well-foundedness there exists a \in-minimal element of γβa\gamma \in \beta \setminus a. We claim a=γa = \gamma. By \in-minimality of γ{}\gamma, every element of γ{}\gamma cannot be in βa\beta\setminus a and therefore has to be in aa. Hence γa\gamma \subseteq a. On the other hand, if xax \in a, then, by assumption xβx \in \beta, and since \in linearly orders β{}\beta,

xγ    x=γ    γx.x \in \gamma \; \vee \; x = \gamma \; \vee \; \gamma \in x.

The latter two are impossible due to γa\gamma \notin a. Hence xγx \in \gamma and therefore aγa \subseteq \gamma, yielding a=γa =\gamma.

Proof

We first show << is a linear order. Irreflexivity follows from well-foundedness of \in. Transitivity of << follow from the transitivity of ordinals as sets. To show

α<β    α=β    β<α,\alpha < \beta \; \vee \; \alpha = \beta \; \vee \; \beta < \alpha,

observe that the intersection of two ordinals is an ordinal, the minimum of the two ordinals. Let γ=αβ\gamma = \alpha \cap \beta. Then γα\gamma \subseteq \alpha, so by Proposition 4, γα\gamma \in \alpha or γ=α\gamma = \alpha and similarly γβ\gamma \in \beta or γ=β\gamma = \beta. But in the case γα,γβ\gamma \in \alpha, \gamma \in \beta we would have γαβ=γ\gamma \in \alpha \cap \beta = \gamma, contradicting well-foundedness.

Finally, if AA is a non-empty set of ordinals, the well-ordering condition on << spells out as

αA  βA    βα.\exists \alpha \in A \; \forall \beta \in A \; \; \beta \notin \alpha.

But this holds since we assume all sets are well-founded.

Basic properties of ordinals

Using the results obtained so far, we can now deduce some basic facts about the structure of ordinals:

  • 0=0 = \emptyset is the smallest ordinal.

  • Every ordinal α{\alpha} has an immediate successor under the ordering <<:

    α=α+1=α{α}.\alpha' = \alpha+1 = \alpha \cup \{\alpha\}.

    Clearly α<α+1\alpha < \alpha+1. If α<β\alpha < \beta, then by Proposition 4, αβ\alpha \subset \beta and αβ\alpha \in \beta. Hence α+1β\alpha+1 \subseteq \beta and therefore α+1β\alpha+1 \leq \beta.

  • The finite ordinals are exactly the natural numbers (“set theoretic version”):

0=,1=0+1={}={},2=1+1={,{}},0 = \emptyset, \quad 1 = 0 + 1 = \emptyset \cup \{\emptyset\} = \{\emptyset\}, \quad 2 = 1+1 = \{\emptyset, \{\emptyset\} \}, \dots
  • The set of all natural numbers is transitive and well-ordered by \in and thus itself an ordinal, the first infinite ordinal ω{\omega}:

    ω={,{},{,{}},}\omega = \{\emptyset, \{\emptyset\}, \{\emptyset, \{\emptyset\}\}, \dots \}
  • ω{\omega} is also the first instance of a limit ordinal: A successor ordinal is any ordinal of the form α+1\alpha+1. Any ordinal λ{\lambda} that is not a successor is called a limit ordinal. Being limit is equivalent to the following property:

    λ0α<λ  (α+1<λ).\lambda \neq 0 \: \wedge \: \forall \alpha < \lambda \; (\alpha+1 < \lambda).

    This shows immediately that ω{\omega} is limit.

  • More generally, if AA is a set of ordinals, supA=αAα\sup A = \bigcup_{\alpha \in A} \alpha is an ordinal and is the least upper bound for AA.

  • The first limit ordinal ω{\omega} is followed by a number of successor ordinals as well as their limits as limit ordinals:

ω,ω+1,ω+2,ω+ω,ω+ω+1,ω+ω+2,ω+ω+ω,ω+ω+ω+1,ω+ω+ω+2,ωω,ωω+1,,ωωωωω\begin{gather*} \omega, \omega+1, \omega+2, \ldots \omega+\omega, \omega+\omega+1, \omega + \omega+2, \ldots \omega+\omega+ \omega, \qquad \quad \\ \omega + \omega+ \omega+1, \omega + \omega+ \omega+2, \ldots \omega \cdot \omega, \omega \cdot \omega +1,\ldots, \omega^{\omega} \ldots \omega^{\omega^{\omega}} \ldots \end{gather*}

All of the ordinals listed here are still countable (as sets). The supremum of the set of all countable ordinals is denoted by ω1\omega_1, the first uncountable ordinal. After ω1\omega_1, we have again successors, limits, and so on.

Metamathematical issues

Is there a set Ord\Ord of all ordinals? If so, it would be well-ordered by \in and also transitive (since, by Proposition 3, every element of an ordinal is an ordinal) and therefore an ordinal. Hence we would have OrdOrd\Ord \in \Ord. But no ordinal can be an element of itself. This holds even if we do not assume every set is well-founded, since ordinals are by definition strictly ordered by \in.

This is known as the Anomaly of Burali-Forti. It tells us that somehow the collection of all ordinals is too big to form a set. It also warns us that if we handle the intuitive concept of a set too carelessly, it might lead to contradictions and inconsistencies.

Later on we will develop an axiomatic approach to sets which aims to exclude antinomies like this. In this framework, we will be able to formally show that Ord\Ord is not a set. It forms what we will call a proper class.

Representing well-orders as ordinals

We introduced ordinals with the goal to have a specific representation for any well-order.

The ordinal α{\alpha} is called the order type of (A,<)(A,<). We will delay the proof of this theorem for a while, until we learn how to extend induction and recursion into the transfinite. This will, in particular, give us the following generalization of the usual induction principle for the natural numbers.

Of course, informally one would prove this as follows: If PP does not hold for some ordinal, since the ordinals are well-ordered, there has to be a least ordinal α{}\alpha for which PP does not hold. But then each of the three cases (0, successor, or limit ordinal) would lead to a contradiction based on the assumption.

However, we have seen that there is no set of all ordinals, so we have to tread a little carefully here. We will take this up again once we have developed an axiomatic foundation for sets.