You may have noticed that, when introducing the axioms of , we never really answered the question “What is a set?” Instead, we developed a formal theory of axioms for a binary relation that somehow describe “how sets work”, that is, how we can obtain sets from given ones using well-known operations like power set and union.
We have then seen how we can develop a lot of standard mathematical objects (like , ) and techniques (like induction and definition by recursion) inside this formal system. In fact, most of mathematics can be developed formally inside this system. Almost all proofs you find in any standard math book are proofs that can be formalized in . It is very tedious to do this for us humans, but there is little doubt it can be done, and in fact, looking at the recent work on proof assistants (like Coq or Lean), many parts of mathematics have been formalized (albeit not directly in ).
This expressiveness gives its foundational importance, but it is also the cause for much confusion for someone who first studies set theory.
From a pedagogical point of view, in what follows it is helpful to assume a “Platonist” perspective of mathematics, and set theory in particular, namely that sets and the relations between exist independently (and outside) of the axioms. The set of real numbers exists, and our development of inside is just a formal way to describe them. From this perspective, the axioms of ( is a little different) are just obvious truths about sets, just like the Peano axioms are obvious truths about natural numbers.
Among other things, this perspective allows us to treat just like any other mathematical theory, like group theory or the theory of algebraically closed fields. In particular, we can think about models of set theory the way we would think about models of group theory, in the sense of model theory.
A model would simply be a set together with a binary relation on such that
that is, all axioms hold when interpreted in . Note that we use “set” in this context not in the formal sense, but in the “meta”-sense (the Platonist world of sets).
Working in the meta-theory (“that what is mathematically true”), we know by Gödel’s completeness theorem that
if is consistent, then it has a model.
This model should be seen as a set-theoretic universe: Its elements can be seen as sets, and the interpretation of the -symbol will tell us how these sets are connected via the element-relation.
Note that does not have to be the actual element relation on a set (of sets), but just some binary relation so that the axioms are satisfied.
In the meta-world, there are, of course, sets other than , but that does not matter here, since all we are interested in is giving some universe in which our axioms hold. (Timothy Chow has suggested that set theory should rather be called "universe theory. He is right in the sense that what axiomatic set theory does is to define such universes of sets, rather than what a set is.)
In the meta-theory, we can then follow the usual techniques to show provability or non-provability results.
If we want to prove that is consistent with (assuming is consistent), we need to find a model in which both hold.
One difficulty in working with models of set theory is that they can look very different depending on whether you look at a model “from the inside” or “from the outside”.
To illustrate this, assume is consistent. Then, by the Löwenheim-Skolem theorem, there exists a countable model for . Yet it is a theorem of that there exists an uncountable set. This is often referred to as Skolem’s paradox, although it is not really an antinomy.
If we break this down a bit, we see that the apparent paradox is really just a matter of perspective (inside or outside). Assume is a countable model of . Then there exists such that there is no injection from to the natural numbers. Since is countable, can have at most countably many elements. So why is this not a contradiction? We should really read the statement above as
there is no injection in from to ’s version of the natural numbers.
In other words, even though is countable from the outside, appears uncountable inside since a mapping witnessing its countability does not exist in .
This is a first warning sign that models of can behave in very unexpected ways. For another example, recall the axiom of Foundation asserts that the -relation is well-founded. But again, this means only “from the inside”.
Proof
Introduce new constant symbols and add the formulas to the axioms of . It is not hard to show, using the compactness theorem, that has a model , for which the set is ill-founded.
Since, as mentioned above, the model satisfies Foundation, the set is actually not in the model (and neither can be any other set with an infinite descending -chain).
Mostowski collapse¶
If we restrict ourselves to models on which the -relation is actually well-founded (i.e. from the outside), then interestingly these models look, in a way, “natural”: They can be assumed to be the -relation on a set. Such models are also called standard.
Given a set theoretic structure (not necessarily a model of ), for each let
If behaves “set-like”, then it will respect the axiom of Extensionality, i.e. two sets are identical if and only if they have the same elements. Therefore we say that is extensional if
Furthermore, as stated above, we want to exclude infinite descending -chains. Recall we say that is well-founded if
every non-empty set has an -minimal element.
Proof
We construct and by recursion on , which is possible since it is well-founded.
For each , let
and set .
The injectivity of follows from the extensionality of by induction along : Suppose we have shown
and we have to show that it holds for . Assume for some . Then
Similarly, we get , hence as desired due to the extensionality of . Finally we have
Thus is an isomorphism.
To see the uniqueness of and , assume , are such that the statement of the theorem is satisfied. Then is an isomorphism between and . Now apply the following lemma.
Proof
By induction on the well-founded relation . Assume that for all and let .
We have because if , then .
We also have : Let . Since , there is with . Since and , we have , and thus .
Hence , and this also implies .
Proof
Every linear order is extensional. Hence we can apply the Mostowski Collapse. It is easy to see that the resulting set is an ordinal.
Absoluteness and transitive models¶
Even if we consider well-founded standard models, interpreting set-theoretic statements in them can still lead to very different results, even for very simple formulas.
In the example above, the set is not transitive, which allowed it to “hide” its elements. It turns out that if we require our model to be transitive, the truth of simple formulas cannot vary between the “inside” and the “outside” perspective. We call such properties absolute.
Given a formula and some class , we can relativize with respect to essentially by restricting all quantifiers in to range over , i.e. becomes and becomes . Note that classes are defined by formulas, so the resulting formula, which we denote by , is still a formula of set theory.
Proof (Sketch)
Clearly and are absolute for any . It is also not hard to see that if and are absolute for , then so are and . Hence all quantifier free formulas are absolute.
Finally, if is absolute for , so is : If holds for , then we have , i.e., . Since if and only if , it follows that , i.e. .
Conversely, if for , , then since is transitive, belongs to , and since if and only , we have and so .
We leave the proof as an exercise.
We can extend the previous absoluteness result to a slightly larger family of formulas.
The absoluteness property of “simple” formulas can be put to use to identify many “almost” models of .
Proof (Sketch)
We verify a few axioms and leave the rest as an exercise.
Extensionality: The relativized version of this axiom is
This simply states that the -relation (as a binary relation) is extensional on (as defined above). It is easy to see that for every transitive set, the -relation is extensional.
Power Set: We have seen that is absolute for transitive classes. Therefore, the relativized version of the Power Set axiom becomes
This means we only need to have those subsets in the power set that are in . In other words, we need to verify
If , then for some , since is limit. Since is transitive, and thus , which in turn yields
If , then . Therefore, . It follows that , as desired.
Infinity: We have seen that is absolute for transitive classes. Hence will satisfy the Axiom of Infinity if . But this is true since we assume .
What is the problem with Replacement? The axiom says that if defines a function, then the image of any set under this function is a set. Relativized to some , this means we need to find a set in that contains the image. Here we run into a cardinality problem. The cardinality of can be much bigger than the cardinality of . For example, but is countable. We could have a function that maps to sets of rank cofinal in , which implies that the image cannot be in .
For to be a model, we would need to be “unreachable” by such mappings. This gives rise to the notion of inaccessible cardinals, which we will introduce in the next section.