Skip to article frontmatterSkip to article content

In this chapter, we take a further look at Borel subsets of NN\Baire. As is common in this setting, we call the elements of NN\Baire reals. This is motivated by the fact that, via the continued fration expansion, NN\Baire is homeomorphic to the set of irrational real numbers. Suppose UNNU \subseteq \Baire is open. Then there exists a set WN<NW \subseteq \Nstr such that

U=σWNσ.U = \bigcup_{\sigma \in W} \Cyl{\sigma}.

Using a standard (effective) coding procedure, we can identify a finite sequence of natural numbers with a natural number, and thus can see WW as a subset of N\Nat.

If we provide a Turing machine with oracle WW, we can semi-effectively test for membership in UU as follows. Assume we want to determine whether some αNN\alpha \in \Baire is in UU. Write α{}\alpha on another oracle tape, and start scanning the WW oracle. If we retrieve a σ{}\sigma that coincides with an initial segment of α{}\alpha, we know αU\alpha \in U. On the other hand, if αU\alpha \in U, then we will eventually find some αn\alpha\Rest{n} in WW. If α∉U\alpha \not\in U, then the search will run forever. In other words, given WW, UU is semi-decidable, or, extending terminology from subsets of N\Nat to subsets of NN\Baire, UU is recusively enumerable (r.e.) relative to WW.

Similarly, we can identify a closed set FF with the code for the tree

TF={αn ⁣:αF,nN}.T_F = \{\alpha\Rest{n} \colon \alpha\in F,\, n\in \Nat \}.

Then determining whether αF\alpha \in F is co-r.e. in (the code of) TFT_F. If α∉F\alpha \not\in F we will learn so after a finite amount of time.

These simple observations suggest the following general approach to Borel sets.

  • Borel sets can be coded by a single infinite sequence in NN\Baire (or 2N\Cant).
  • Given the code, we can recover the Borel set effectively, by means of oracle computations.
  • The connection between degrees of unsolvability and definability results in a close correspondence between arithmetical sets (Σn0\Sigma^0_n) and Borel sets of finite order (Σn0\bSigma^0_n).

In this lecture we will fully develop this correspondence. Later, we will see that it even extends beyond the finite level.

Some notation for reals, strings, and numbers

We fix a computable bijection π:NN<N\pi: \Nat \to \Nstr. In general, we will often use string and their images under π{}\pi interchangeably, that is, for example, if ANA \subset \Nat, we will write σA\sigma \in A to denote π(σ)A\pi(\sigma) \in A. We will also freely identify infinite binary sequences with the set of natural numbers they represent as their characteristic function.

Furthermore, let .,.\Tup{.,.} be the standard coding function for pairs,

x,y=(x+y)(x+y+1)2+y.\Tup{x,y} = \frac{(x+y)(x+y+1)}{2}+y.

Finally, let us define the following operations on elements of Baire (or Cantor) space: Given βNN\beta\in \Baire,

  • Let β\beta' be the real defined by β(n)=β(n+1)\beta'(n) = \beta(n+1). (We cut the first entry.)
  • On the other hand, if kNk \in \Nat and βNN\beta \in \Baire, we obtain a new element of NN\Baire, which we denote by (k,β)(k, \beta), by concatenating kk and β{}\beta.
  • For m0m \geq 0, let (β)m(\beta)_m be the mm-th column of β{}\beta, (β)m(n)=β(m,n)(\beta)_m(n) = \beta(\Tup{m,n}).

Borel codes of finite order

Borel codes are defined inductively.

The first position in each code indicates the kind of set it codes - an open set, a complement, or a union.

Note that the definition of Borel code actually assigns codes to representations of sets. A Borel set can have (and has) multiple codes, just as it has multiple representations. We can, for example, represent an open set by different sets WW of initial segments.

Moreover, every Σ10\bSigma^0_1 set is also Σ20\bSigma^0_2, and thus a set has codes which reflect the more complicated definition of the Σ10\bSigma^0_1 set as a union of closed sets. It is useful to keep this distinction between a Borel set and its Borel representation in mind.

The following is a straightforward induction.

Computing with Borel codes

Suppose γ{}\gamma is a computable code for an FσF_\sigma set BB. We may assume γ{}\gamma is of the form (3,γ)(3,\gamma'), with each column (γ)m(\gamma')_m being of the form (2,1,(α)m)(2,1,(\alpha)_m), coding a closed set FmF_m.

With this, we can express membership in BB as follows:

βBm[β is in the set coded by (γ)m]mn[βn is not in the set coded by (α)m].mn[(α)m(βn)0].\begin{align*} \beta \in B \quad & \Leftrightarrow \quad \exists m \: [\text{$\beta$ is in the set coded by $(\gamma')_m$}] \\ & \Leftrightarrow \quad \exists m \forall n \: [\beta\Rest{n} \text{ is not in the set coded by } (\alpha)_m]. \\ & \Leftrightarrow \quad \exists m \forall n \: [(\alpha)_m(\beta\Rest{n}) \neq 0 ]. \end{align*}

Note that, since we assume γ{}\gamma to be computable, the inner predicate R(m,σ)R(m,\sigma) given by

R(m,σ):    (α)m(σ)0R(m,\sigma) :\iff (\alpha)_m(\sigma) \neq 0

is decidable, that is, it can be decided by a Turing machine.

Hence any Σ20\bSigma^0_2 set BB with a computable code can be represented in the following form:

There exists a decidable predicate R(m,σ)R(m,\sigma) such that

βB    mn  R(m,βn).\beta \in B \quad \iff \quad \exists m \: \forall n \; R(m, \beta\Rest{n}).

Conversely, if R(m,σ)R(m,\sigma) is a (decidable) predicate, let

Fm={β ⁣:nR(m,βn)}.F_m = \{ \beta \colon \forall n \: R(m,\beta\Rest{n}) \}.

We claim that FmF_m is closed: Define a tree TmT_m by letting

σTm:    τσR(m,τ).\sigma \in T_m : \iff \forall \tau \Sleq \sigma \: R(m, \tau).

Then [Tm]=Fm[T_m] = F_m. Moreover,

βmFm    mnR(m,βn)\beta \in \bigcup_m F_m \iff \exists m \forall n \: R(m,\beta\Rest{n})

Thus, there seems to be a close connection between FσF_\sigma sets with computable Borel codes and sets definable by Σ20\Sigma^0_2 formulas over computable predicates. Given that we introduced the notation Σ20\bSigma^0_2 for FσF_\sigma sets earlier, this is perhaps not very surprising.

In this analysis, there seems to be nothing specific about the FσF_\sigma set used in the example. Indeed, it can be extended to Borel sets of finite order, which we will do next.

We will introduce the lightface Borel hierarchy and show that it corresponds to Borel sets of finite order with recursive codes. Using relativization, we then obtain a complete characterization of Borel sets of finite order: They are precisely those sets definable by arithmetical formulas, relative to a real parameter.

But before we do that, we observe a basic fact about how we can compute with codes.

The effective Borel hierarchy

The following is an easy induction.

The following result is at the heart of the effective theory.

Proof

(\Rightarrow) We proceed by induction on the Borel complexity.

Suppose AA is Σ10\Sigma^0_1. Let RR be computable such that A={α ⁣:nR(αn)}A = \{ \alpha \colon \exists n \: R(\alpha\Rest{n})\}. Let

W={σN<N ⁣:R(σ)}.W = \{ \sigma \in \Nstr \colon R(\sigma)\}.

We have αA\alpha \in A if and only if ασWNσ\alpha \in \bigcup_{\sigma \in W} \Cyl{\sigma}. Since RR is decidable, WW is computable and γNN\gamma \in \Baire given by

γ(n)={1n=0,0n1&π(n1)W,17n1&π(n1)W,\gamma(n) = \begin{cases} 1 & n = 0, \\ 0 & n \geq 1 \: \& \: \pi(n-1) \in W,\\ 17 & n \geq 1 \: \& \: \pi(n-1) \notin W, \end{cases}

is a computable Σ10\bSigma^0_1 code for AA.

If AA is Πn0\Pi^0_n, then A=¬BA = \Co{B} for some Σn0\Sigma^0_n BB. By inductive hypothesis, BB has a computable Σn0\bSigma^0_n code γ{}\gamma. Then (2,γ)(2,\gamma) is a computable Πn0\bPi^0_n code for AA.

Finally, assume that AA is Σn+10\Sigma^0_{n+1}. Let PP be Πn0\Pi^0_n such that αA    n  (n,α)P\alpha \in A \iff \exists n \; (n,\alpha) \in P.

By inductive hypothesis, PP has a computable Πn0\bPi^0_n code γ{}\gamma. If we let Pm={β ⁣:(m,β)P}P_m = \{\beta \colon (m,\beta) \in P\}, then A=PmA = \bigcup P_m. Thus, it suffices to show that we can uniformly obtain codes for PmP_m.

We leave the proof as an exercise. Proceed by induction on the Borel complexity of γ{}\gamma.

(\Leftarrow) We proceed by induction on the complexity of the code γ{}\gamma.

If γ{}\gamma is of the form (1,α)(1,\alpha), with α{}\alpha coding an open set UU. Then

βU    nα(βn)=0.\beta \in U \iff \exists n \: \alpha(\beta\Rest{n})= 0.

Since γ{}\gamma is assumed to be computable, the computable relation

R(σ):    α(σ)=0R(\sigma) :\iff \alpha(\sigma) = 0

witnesses that UU is Π10\Pi^0_1.

If γ=(2,α)\gamma = (2, \alpha) is a Πn0\bPi^0_n code, then α{}\alpha is a Σn0\bSigma^0_n code. By inductive hypothesis, the set coded by α{}\alpha is Σn0\Sigma^0_n, so by definition of the effective hierarchy and the Borel codes, γ{}\gamma codes a Πn0\Pi^0_n set.

Finally, assume γ=(3,α)\gamma = (3,\alpha) is a Σn+10\bSigma^0_{n+1} code for a set BB. Then each (α)m(\alpha)_m is a Πn0\bPi^0_n code for a set AmA_m.

The proof is similar to Lemma 1

By inductive hypothesis, the set AA as in the Lemma is Πn0\Pi^0_n and we have

βB    m(m,β)A,\beta \in B \iff \exists m (m, \beta )\in A,

which implies BB is Σn+10\Sigma^0_{n+1}.

Relativization

Using relativized computations via oracles, we can define a relativized version of the effective Borel hierarchy. This way we can capture all Borel sets of finite order, not just the ones with computable codes.

A straightforward relativization gives the following analogue of Proposition 3.

We can now present the fundamental theorem of effective descriptive set theory.

Proof

If AA is Σn0\bSigma^0_n, then by Proposition 1 it has a Σn0\bSigma^0_n-code γ{}\gamma, and by Proposition 4, AA is Σn0(γ)\Sigma^0_n(\gamma). The other direction follows immediately from Proposition 4.

The argument for Πn0\bPi^0_n is analogous.

The theorem facilitates working with Borel sets considerably. As an example, consider the set

A={α ⁣:α eventually constant}.A = \{ \alpha \colon \text{${}\alpha$ eventually constant} \}.

We have

αA    nm[mnα(n)=α(m)]\alpha \in A \iff \exists n \forall m [ m \geq n \: \Rightarrow \: \alpha(n) = \alpha(m) ]

The predicate in the square brackets is computable and depends only on αm\alpha\Rest{m}. Therefore, AA is Σ20\Sigma^0_2 Borel set.