Skip to article frontmatterSkip to article content

In the previous chapters, we have repeatedly brought up a metamathematical context, such as the use of the Axiom of Choice, or Solovay’s model in which every set of reals is measurable. But we have not really distinguished between results in a formal theory and in the metatheory, mostly because we did not really establish a formal theory to begin with. We have treated descriptive set theory like most other mathematical theories in that we defined our basic objects (Polish spaces, Borel sets, etc.) and then started proving facts about them “the usual way”, as we would prove facts about commutative rings or locally compact topological spaces. But in order to make better sense of the metamathematical issues, we have to “pause” and talk a bit about the axioms of set theory.

Comprehension and Russell’s Antinomy

To develop set theory formally as a theory of first order logic, we first need to fix the language. We want to consider the notion of a set as foundational (with the intention to develop everything else from it), we provide only one binary relation symbol, \in. We will denote this as the language of set theory, L\mathcal{L}_\in.

How can we axiomatize the concept of set?

In his famous 1895 paper “Beiträge zu Begründung der transfiniten MengenlehreCantor (1895), Georg Cantor writes

Unter einer “Menge” verstehen wir jede Zusammenfassung MM von bestimmten wohlunterschiedenen Objekten unserer Anschauung oder unseres Denkens (welche die “Elemente” von MM genannt werden) zu einem Ganzen.

This can be translated approximately as: A set is any collection of certain distinct objects of our intuition or our thought into a whole.

We can try to make this more precise as follows:

For every property P(x)P(x) exists a set MM of all objects xx having property PP: M={x:P(x)}M =\{x: P(x) \}.

This can be formalized in the language of set theory as an axiom scheme: For every L\mathcal{L}_\in-formula φ(x)\varphi(x),

(Comprehension)φ_\varphi yx  (xyφ(x))\exists y \, \forall x \; ( x \in y \leftrightarrow \varphi(x))

This axiom, however, is contradictory.

We obtain similar contradictions if we choose as φ(x)\varphi(x) the formula

x=xantinomy of the set of all sets (Cantor)x  is a cardinalCantor, around 1899, published 1932x  is an ordinalantinomy of Burali-Forti\begin{align*} x = x \qquad & \text{antinomy of the set of all sets (Cantor)} \\ x\;\text{is a cardinal} \qquad & \text{Cantor, around 1899, published 1932}\\ x \; \text{is an ordinal} \qquad & \text{antinomy of Burali-Forti} \end{align*}

Regarding the existence of sets, we have to distinguish between

  • classes, which we will denote by capital letters A,B,A,B,\dots (for some specific, important classes we will also use boldface) and
  • sets, denoted in this context by lower case letters a,b,,x,ya,b,\ldots,x,y \ldots.

An arbitrary property φ(x)\varphi(x) will define a corresponding class

A={x ⁣:φ(x)}A = \{x \colon \varphi(x)\}

As we saw above, classes are not necessarily sets: some are “too large” to be a set, as for example the class of all sets,

V={x ⁣:x=x}\mathbf{V} = \{ x \colon x = x \}

Classes that are not sets are called proper classes.

You should keep in mind that, in the formal theory ZF\ZF, we do not have variables for classes, so the definition above is informal. Any class variable, as well as expressions like aAa \in A, should be seen as abbreviations for a formal expression using the underlying formula. There are formal systems (such as Bernays-Gödel set theory) that use classes explicitly, but they are used less frequently.

The Axioms of ZFC\mathsf{ZFC}

We begin with the Axiom of Extensionality, which is essential for the equality relation between sets.

Consequently, two sets coincide if they have the same elements.

The basic idea of the Zermelo-Fraenkel axiom system ZF\ZF is that we avoid introducing sets that are “too large” (and hence would lead to contradictions) by allowing new sets only if they can be “generated” from a given set by a number of fixed, well-behaved operations.

So let us postulate that at least one set exists:

This axiom is not strictly necessary, as the existence of a set also follows from other axioms in ZF\ZF (or usually even from the underlying axioms of first-order logic). But it is good to have it as a starting point here for emphasis.

We have seen we cannot use full comprehension for sets. In its place we introduce the scheme of Separation. Let φ(x,z,w1,,wn)\varphi(x, z, w_1, \dots, w_n) be a L\mathcal{L}_\in-formula in which yy does not occur as a free variable.

By Extensionality, the set yy is unique. Note that there is one axiom for every formula φ{}\varphi, so Separation is an axiom scheme.

Separation allows us to select from any class {x ⁣:φ(x,)}\{x \colon \varphi(x,\ldots)\} those elements that are in a given set aa and collect them in a set

{xa ⁣:φ(x,)}\{x \in a \colon \varphi(x,\ldots)\}

Next, we have

This axioms allows to form pairs of sets, specifically

{a,b}:={x ⁣:x=a    x=b}{a}:={x ⁣:x=a}singleton set(a,b):={{a},{a,b}}ordered pair\begin{align*} \{a,b\} & : = \{x\colon x=a \; \vee \; x=b \} & \\ \{a\} &: = \{x\colon x=a \} & \text{singleton set}\\ (a,b) &: = \{\{a\},\{a,b\}\} & \text{ordered pair} \end{align*}

While for the pair set {a,b}={b,a}\{a,b\} = \{b,a\} the order is not important, we have for the ordered pair

(a,b)=(c,d)a=cb=d.(a,b) = (c,d) \leftrightarrow a = c \wedge b = d.

Hence, we can introduce binary relations as classes of ordered pairs

Rel(R):uR  x,y  (u=(x,y)).\begin{gather*} \Op{Rel}(R) :\leftrightarrow \forall u \in R \; \exists x,y \; (u =(x,y)). \end{gather*}

As usual, by identifying functions with their graphs, we can introduce functions as a special kind of relation:

F:AB:xA  !yB  (x,y)F.\begin{gather*} F: A \to B :\leftrightarrow \forall x \in A \; \exists ! y \in B \; (x,y) \in F. \end{gather*}

We write Fun(a)\Op{Fun}(a) to denote that fact that aa is a function.

Further elementary axioms:

In words, if φ{}\varphi defines a function, then the image of any set under that function is a set. Note that, as in the case of Separation, φ{}\varphi is allowed to have parameters, which we dropped here to improve readability.

It follows that, for a given set aa, the following classes are sets:

a=xax:={z ⁣:xa  zx}unionF[a]={F(x)xa}:={y ⁣:xa  y=F(x)}image setP(a):={x ⁣:xa}power set\begin{align*} \bigcup \, a = \bigcup_{x \in a} x & : = \{z \colon \exists x \in a \; z \in x \} & \qquad \text{union}\\ F[a] = \{F(x)|x \in a\} &: = \{y\colon \exists x \in a \; y = F(x) \} & \qquad \text{image set}\\ \mathcal{P}(a) &: = \{x\colon x \subseteq a\} & \qquad \text{power set} \end{align*}

The axiom of Infinity is a “pure” set existence axiom, that is, it does not depend on another set already existing. It therefore renders the axiom of Set Existence above redundant. It also implies the existence of the set ω{}\omega (the set theoretic version of the natural numbers), along with the operation ++.

Using N\Nat, we can introduce the other basic number sets:

  • Z=(N×N)/Z\Integer = (\Nat\times\Nat)/ \sim_\Integer, where (x,y)Z(u,v):x+v=y+u(x,y) \sim_\Integer (u,v) :\Leftrightarrow x+v = y+u. Multiplication on Z\Integer can be defined inductively (see below).
  • Q=(Z×Z)/Q\Rat = (\Integer\times\Integer)/\sim_\Rat, where (x,y)Q(u,v):xv=yu(x,y) \sim_\Rat (u,v) :\Leftrightarrow xv = yu.
  • We can extend the linear order << of N\Nat to Z\Integer and then to Q\Rat in the usual way. Then we can introduce the real numbers as the set of Dedekind cuts:
R={xP(Q) ⁣:xxQzxyQy<zyx}.\Real = \{ x \in \mathcal{P}(\Rat) \colon x \neq \emptyset \: \wedge \: x \neq \Rat \: \wedge \: \forall z \in x \forall y \in \Rat \: y < z \to y \in x \}.

We complete the list of ZF\ZF axioms with the Axiom of Foundation:

As we discussed before, Foundation rules out, for example, that a set can be an element of itself. More precisely, the axiom states that \in-relation is well-founded on any set.

We can also formalize the Axiom of Choice in the language of set theory:

We denote the axiom system ZF+AC\ZF + \AC as ZFC\ZFC, Zermelo-Fraenkel with Choice.

References
  1. Cantor, G. (1895). Beiträge zur Begründung der transfiniten Mengenlehre. Mathematische Annalen, 46(4), 481–512.