Skip to article frontmatterSkip to article content

Relative consistency proofs

In this section, we are going to show that if ZF\ZF is consistent, so are ZF+AC\ZF + \AC and ZF+GCH\ZF + \GCH. The usual way to do this is to exhibit a model ZF\ZF in which the additional axioms holds, too, assuming a model of ZF\ZF exists. The universe of a model is supposed to be a set, and we will work with such set models when we will construct a model of ZF\ZF in which CH\CH does not hold.

In this section, we will work with class models instead, in particular, LL. The satisfaction relation is not formalizable for arbitrary classes, so we have to argue syntactically.

In the previous section, we showed that LL is an inner model for ZF\ZF. What the “model” part here means is simply that we can prove in ZF\ZF that every axiom of ZF\ZF holds relative to LL, or, using the standard notation for provability,

ZFσLfor all axioms σZF.\ZF \vdash \sigma^L \qquad \text{for all axioms } \sigma \in \ZF.

In this section, we will also show that

ZFτL\ZF \vdash \tau^L

for τ=AC\tau = \AC and τ=GCH\tau = \GCH. We claim that this yields

If ZF\ZF is consistent, then ZF+τ\ZF + \tau is consistent.

For suppose ZF+τ\ZF+\tau is inconsistent. Then there exists a proof of θ¬θ\theta \wedge \neg \theta from ZF+τ\ZF + \tau, for some formula θ\theta. Every formal proof uses only finitely many steps, so there exists a finitely many σ1,,σnZF+τ\sigma_1, \dots, \sigma_n \in \ZF + \tau such that

σ1σn    θ¬θ.\sigma_1\wedge \dots \wedge \sigma_n \; \vdash \; \theta \wedge \neg \theta.

By the Deduction Theorem of first-order logic, we have

(σ1σn)    (θ¬θ).\vdash (\sigma_1\wedge \dots \wedge \sigma_n) \; \to \; (\theta \wedge \neg \theta).

This means (σ1σn)    (θ¬θ)(\sigma_1\wedge \dots \wedge \sigma_n) \; \to \; (\theta \wedge \neg \theta) is a validity and derivable by purely logical arguments (not assuming any additional axioms). But any such validity will remain valid when relativized (recall that classes are always defined via a formula φ{}\varphi):

(σ1σn)L    (θ¬θ)L.\vdash (\sigma_1\wedge \dots \wedge \sigma_n)^L \; \to \; (\theta \wedge \neg \theta)^L.

By assumption, ZF(σ1σn)L\ZF \vdash (\sigma_1\wedge \dots \wedge \sigma_n)^L, hence

ZF(θ¬θ)L.\ZF \vdash (\theta \wedge \neg \theta)^L.

By the definition of relativization, the right-hand side is equivalent to θL¬θL\theta^L \wedge \neg \theta^L, which implies ZF\ZF is inconsistent - contradiction!

The Axiom V=L\VL

We can add to ZF\ZF the axiom that all sets are constructible, i.e.

(V=L)xy(y is an ordinal   xLy).(\VL) \qquad \forall x \exists y \: (y \text{ is an ordinal } \wedge \; x \in L_y).

This axiom is usually denoted by V=L\VL. We may be tempted to think that LL is then trivially a model of ZF+V=L\ZF + \VL. But this is not at all clear, since this has to hold relative to LL, i.e. (V=L)L(\VL)^L.

This means that

xLyL(y is an ordinal   (xLy)L).\forall x \in L \: \exists y \in L \: (y \text{ is an ordinal } \wedge \; (x \in L_y)^L).

To verify this, we need to make sure that inside LL, LL “means the same as” LL. This is, of course, an absoluteness property, and we therefore revisit the complexity of the formulas defining the constructible universe.

We have seen that the map aPDef(a)a \mapsto \mathcal{P}_{\Op{Def}}(a) is Σ1\Sigma_1. This has important implications for the map αLα\alpha \mapsto L_\alpha.

Proof

We first show that the mapping is Σ1\Sigma_1. The mapping is obtained by ordinal recursion over the functions aPDef(a)a \mapsto \mathcal{P}_{\Op{Def}}(a) and aaa \mapsto \bigcup a.

In general, if a function G:VVG: \V \to \V is Σ1\Sigma_1 and F:OrdVF: \Ord \to \V is obtained by recursion from GG, i.e. F(α)=G(Fα)F(\alpha) = G(F\Rest{\alpha}), then FF is also Σ1\Sigma_1. This is because

y=F(α)    αOrdf(f function dom(f)=αβ<α(f(β)=G(fβ))y=G(f)).\begin{align*} y= F(\alpha) \; \leftrightarrow \; \alpha \in \Ord \: \wedge \: \exists f \: & ( f \text{ function } \wedge \Op{dom}(f) = \alpha \\ & \quad \wedge \forall \beta < \alpha (f(\beta) = G(f \Rest{\beta})) \wedge y = G(f)). \end{align*}

Applying some of the various prefix transformations for Σ1\Sigma_1-formulas, and using that being an ordinal, being an function, being the domain of a function, etc., are all Δ0\Delta_0 properties, the above formula can be shown to be Σ1\Sigma_1, too.

In our case, GG is a function that applies either PDef\mathcal{P}_{\Op{Def}} or \bigcup (both at most Δ1\Delta_1), depending on whether the input is a function defined on a successor ordinal or a limit ordinal (or applies the identity if neither is the case). Fortunately, this case distinction is also Δ0\Delta_0, and hence we obtain that F:αLαF: \alpha \mapsto L_\alpha is Σ1\Sigma_1.

Finally, as in Theorem 2, observe that if FF is a Σ1\Sigma_1 function with a Δ1\Delta_1 domain (Ord\Ord), then FF is actually Δ1\Delta_1, since we have

F(x)y    xdom(F)z(F(x)=zyz)F(x) \neq y \; \Leftrightarrow \; x \notin \Op{dom}(F) \: \vee \: \exists z (F(x)=z \: \wedge \: y \neq z)

so the complement of the graph of FF is Σ1\Sigma_1-definable, too.

We can relativize the definition of LL to other classes MM. If MM is is an inner model, then the development of LL can be done relative to MM. Since MM is a ZF\ZF-model, it has to contain all the sets LαML_\alpha^M (as we developed definability and proved facts about it inside ZF\ZF). As MM is transitive, the mapping F:αLαF: \alpha \mapsto L_\alpha is absolute for MM and we obtain, for all α{}\alpha,

LαM=Lα.L_\alpha^M = L_\alpha.
Proof

(1) follows immediately from the fact that for such MM, LαM=LαL_\alpha^M = L_\alpha.

(2) We have

(V=L)LxLyL(y is an ordinal   xLy)LxLα(xLα)L(OrdL and absolute)xLα(xLα)(by (1))\begin{align*} (\VL)^L & \leftrightarrow \: \forall x\in L \exists y \in L \: (y \text{ is an ordinal } \wedge \; x \in L_y)^L & \\ & \leftrightarrow \: \forall x\in L \exists \alpha \: (x \in L_\alpha)^L & \qquad \text{($\Ord \subset L$ and absolute)}\\ & \leftrightarrow \: \forall x\in L \exists \alpha \: (x \in L_\alpha) & \qquad \text{(by (1))} \end{align*}

The last statement is true since L=αLαL = \bigcup_{\alpha} L_\alpha.

Constructibility and the Axiom of Choice

Every well-ordering on a transitive set XX can be extended to a well-ordering of PDef(X)\mathcal{P}_{\Op{Def}}(X).

Note that every element of PDef(X)\mathcal{P}_{\Op{Def}}(X) is determined by a pair (ψ,a)(\psi, \vec{a}), where ψ{}\psi is a set-theoretic formula, and a=(a1,,an)X<ω\vec{a} = (a_1, \dots, a_n) \in X^{<\omega} is a finite sequence of parameters.

For each zPDef(X)z \in \mathcal{P}_{\Op{Def}}(X) there may exist more than one such pair (i.e. zz can have more than one definition), but by well-ordering the pairs (ψ,a)(\psi, \vec{a}), we can assign each zPDef(X)z \in \mathcal{P}_{\Op{Def}}(X) its least definition, and subsequently order PDef(X)\mathcal{P}_{\Op{Def}}(X) by comparing least definitions. Elements already in XX will form an initial segment.

Such an order on the pairs (ψ,a)(\psi, \vec{a}) can be obtained in a definable way: First use the order on XX to order X<ωX^{<\omega} length-lexicographically, order the formulas by their Gödel numbers, and finally put

(ψ,a)<(φ,b) iff ψ<φ or (ψ=φ and a<b).(\psi,\vec{a}) < (\varphi, \vec{b}) \quad \text{ iff } \quad \psi < \varphi \text { or } (\psi = \varphi \text { and } \vec{a} < \vec{b}).

Based on this, we can define an order <L<_L all levels of LL so that the following hold:

  • (1) \quad <LVω<_L \Rest{V_\omega} is a standard well-ordering of VωV_\omega (as for example given by a canonical isomorphism (Vω,)(N,E)(V_\omega, \in) \leftrightarrow (\Nat, E), see Ackermann (1937))
  • (2) \quad <LLα+1<_L\Rest{L_{\alpha+1}} is the order on PDef(Lα)\mathcal{P}_{\Op{Def}}(L_\alpha) induced by <LLα<_L|L_\alpha.
  • (3) \quad <LLλ=α<λ<LLα<_L\Rest{L_\lambda} = \bigcup_{\alpha < \lambda} <_L \Rest{L_\alpha} for a limit ordinal λ>ω\lambda > \omega.

It is straightforward to verify that this is indeed a well-ordering on LL. Moreover, the relation <L<_L is Δ1\Delta_1. (To verify this, we have to spell out all the details of the above definition. This is a little involved, so we skip this here and refer to Jech (2003).)

Since LL is a model of ZF+V=L\ZF+\VL, we obtain

Condensation and the Continuum Hypothesis

We now show that V=L\VL implies the Continuum Hypothesis. The proof works by showing that under V=L\VL, every subset of a cardinal κ{}\kappa will be constructed by stage κ+\kappa^+. This is made possible by a “condensation” argument: If any subset xx of κ{}\kappa is in LL, then it must show up at some stage LλL_\lambda. κ{}\kappa and xx generate an elementary substructure MM of LλL_\lambda of cardinality κ{}\kappa. If we could show that this MM itself must be an LβL_\beta, we can use the fact following fact. Essentially, it tells us that the cardinality of the LαL_\alpha evolves “tamely” along the ordinals (as opposed to (Vα)(V_\alpha)).

Proof

We know that αLα\alpha \subseteq L_\alpha. Hence αLα|\alpha| \leq |L_\alpha|. To show αLα|\alpha| \geq |L_\alpha|, we work by induction on α{}\alpha.

If α=β+1\alpha = \beta +1, then by Proposition 1(4), Lα=Lβ=βα|L_\alpha| = |L_\beta| = |\beta| \leq |\alpha|.

If α{}\alpha is limit, then LαL_\alpha is a union of α|\alpha| many sets of cardinality α\leq |\alpha| (by inductive hypothesis), hence of cardinality α\leq |\alpha|.

But why would an elementary substructure of an LλL_\lambda have to be itself an LβL_\beta? This is where the absoluteness of the construction of LL strikes yet again!

Proof

Let TT be the axioms of ZF\ZF (including Pairing, Union, Set Existence) used to prove that all the theorems leading up to the fact that for all α{}\alpha, LαL_\alpha exists and that αLα\alpha \mapsto L_\alpha is Δ1\Delta_1 (and hence absolute). Any proof is finite, so we have used only finitely many (instances of) axioms of ZF\ZF to prove these facts. In particular, TT is finite. Let φV=L\varphi_{\VL} be the sentence we obtain by taking the conjunction (\wedge) of all axioms in TT together with the axiom V=L\VL.

Suppose for a transitive set MM, MφV=LM\models \varphi_{\VL}. Let λ{}\lambda be the least ordinal not in MM. We must have that OrdM=λ\Ord^M = \lambda, by the absoluteness of ordinals.  Moreover, λ{}\lambda must be a limit ordinal since for each αM\alpha \in M, α{α}\alpha \cup \{\alpha\} is in MM since MM satisfies Pairing and Union.

We have that

MxαOrd(xLα),M\models \forall x \exists \alpha\in \Ord (x \in L_\alpha),

thus

xMα<λ(xLαM).\forall x \in M \exists \alpha < \lambda (x \in L^M_\alpha).

By absoluteness of αLα\alpha \mapsto L_\alpha, we have LαM=LαL^M_\alpha = L_\alpha and therefore

MαMLα=α<λLα=Lλ.M \subseteq \bigcup_{\alpha \in M} L_\alpha = \bigcup_{\alpha < \lambda} L_\alpha = L_\lambda.

On the other hand, for each α<λ\alpha < \lambda, LαML_\alpha^M exists in MM (since TT is strong enough to prove this), and by absoluteness

Lλ=α<λLα=αMLαMM.L_\lambda = \bigcup_{\alpha < \lambda} L_\alpha = \bigcup_{\alpha \in M} L^M_\alpha \subseteq M.

We now put condensation to use as described above.

Proof

As we assume V=L\VL, by the Reflection Theorem, there exists limit λ>κ\lambda > \kappa such that xLλx \in L_\lambda and such that LλφV=LL_\lambda \models \varphi_{\VL}. Let X=κ{x}X = \kappa \cup \{x\}. By choice of λ{}\lambda, XLλX \subseteq L_\lambda.

By the Löwenheim-Skolem Theorem, there exists an elementary substructure NLλN \preceq L_\lambda such that

XNLλ and N=X.()\tag{$*$} X \subseteq N \subseteq L_\lambda \quad \text{ and } \quad |N| = |X|.

NN is not necessarily transitive, but since it is well-founded we can take its Mostowski collapse (Theorem 1) and obtain a transitive set MM together with an isomorphism π:(N,)(M,)\pi: (N,\in) \to (M,\in).

Since κ{}\kappa is contained in both MM and NN, and is already transitive, it is straightforward to show via induction that π(α)=α\pi(\alpha) = \alpha for all ακ\alpha \in \kappa. Since xκx \subseteq \kappa, this also yields π(x)=x\pi(x) = x. This implies in turn that xMx \in M.

As (M,)(M,\in) is isomorphic to (N,)(N,\in) and NLλN \preceq L_\lambda, MM satisfies the same sentences as (Lλ,)(L_\lambda, \in). In particular, MφV=LM \models \varphi_{\VL}. By the condensation lemma, M=LβM = L_\beta for some β{}\beta.

This implies, by Proposition 2,

β=Lβ=M=N=X=κ<κ+.|\beta| = |L_\beta| = |M| = |N| = |X| = \kappa < \kappa^+.

Since xLβx \in L_\beta and β<κ+\beta < \kappa^+, it follows that xLκ+x \in L_{\kappa^+}, as desired.

Proof

If V=L\VL, then by Lemma 2, P(κ)Lκ+\mathcal{P}(\kappa) \subseteq L_{\kappa^+}. With Proposition 2, we obtain

2κ=P(κ)Lκ+=κ+.2^\kappa = |\mathcal{P}(\kappa)| \leq |L_{\kappa^+}| = \kappa^+.
References
  1. Ackermann, W. (1937). Die Widerspruchsfreiheit der allgemeinen Mengenlehre. Mathematische Annalen, 114(1), 305–315.
  2. Jech, T. (2003). Set Theory. Springer-Verlag.
  3. Devlin, K. J. (1984). Constructibility. Springer-Verlag.
  4. Mathias, A. R. (2006). Weak systems of Gandy, Jensen and Devlin. In Set Theory (pp. 149–224). Springer.