Coding Borel Sets - Descriptive Set Theory

In this chapter, we take a further look at Borel subsets of $\Baire$ . As is common in this setting, we call the elements of $\Baire$ reals. This is motivated by the fact that, via the continued fration expansion, $\Baire$ is homeomorphic to the set of irrational real numbers. Suppose $U \subseteq \Baire$ is open. Then there exists a set $W \subseteq \Nstr$ such that

U = \bigcup_{\sigma \in W} \Cyl{\sigma}.

(1)

Using a standard (effective) coding procedure, we can identify a finite sequence of natural numbers with a natural number, and thus can see $W$ as a subset of $\Nat$ .

If we provide a Turing machine with oracle $W$ , we can semi-effectively test for membership in $U$ as follows. Assume we want to determine whether some $\alpha \in \Baire$ is in $U$ . Write ${}\alpha$ on another oracle tape, and start scanning the $W$ oracle. If we retrieve a ${}\sigma$ that coincides with an initial segment of ${}\alpha$ , we know $\alpha \in U$ . On the other hand, if $\alpha \in U$ , then we will eventually find some $\alpha\Rest{n}$ in $W$ . If $\alpha \not\in U$ , then the search will run forever. In other words, given $W$ , $U$ is semi-decidable, or, extending terminology from subsets of $\Nat$ to subsets of $\Baire$ , $U$ is recusively enumerable (r.e.) relative to $W$ .

Similarly, we can identify a closed set $F$ with the code for the tree

T_F = \{\alpha\Rest{n} \colon \alpha\in F,\, n\in \Nat \}.

(2)

Then determining whether $\alpha \in F$ is co-r.e. in (the code of) $T_F$ . If $\alpha \not\in F$ we will learn so after a finite amount of time.

These simple observations suggest the following general approach to Borel sets.

Borel sets can be coded by a single infinite sequence in $\Baire$ (or $\Cant$ ).
Given the code, we can recover the Borel set effectively, by means of oracle computations.
The connection between degrees of unsolvability and definability results in a close correspondence between arithmetical sets ( $\Sigma^0_n$ ) and Borel sets of finite order ( $\bSigma^0_n$ ).

In this lecture we will fully develop this correspondence. Later, we will see that it even extends beyond the finite level.

Some notation for reals, strings, and numbers¶

We fix a computable bijection $\pi: \Nat \to \Nstr$ . In general, we will often use string and their images under ${}\pi$ interchangeably, that is, for example, if $A \subset \Nat$ , we will write $\sigma \in A$ to denote $\pi(\sigma) \in A$ . We will also freely identify infinite binary sequences with the set of natural numbers they represent as their characteristic function.

Furthermore, let $\Tup{.,.}$ be the standard coding function for pairs,

\Tup{x,y} = \frac{(x+y)(x+y+1)}{2}+y.

(3)

Finally, let us define the following operations on elements of Baire (or Cantor) space: Given $\beta\in \Baire$ ,

Let $\beta'$ be the real defined by $\beta'(n) = \beta(n+1)$ . (We cut the first entry.)
On the other hand, if $k \in \Nat$ and $\beta \in \Baire$ , we obtain a new element of $\Baire$ , which we denote by $(k, \beta)$ , by concatenating $k$ and ${}\beta$ .
For $m \geq 0$ , let $(\beta)_m$ be the $m$ -th column of ${}\beta$ , $(\beta)_m(n) = \beta(\Tup{m,n})$ .

Borel codes of finite order¶

Borel codes are defined inductively.

The first position in each code indicates the kind of set it codes - an open set, a complement, or a union.

Note that the definition of Borel code actually assigns codes to representations of sets. A Borel set can have (and has) multiple codes, just as it has multiple representations. We can, for example, represent an open set by different sets $W$ of initial segments.

Moreover, every $\bSigma^0_1$ set is also $\bSigma^0_2$ , and thus a set has codes which reflect the more complicated definition of the $\bSigma^0_1$ set as a union of closed sets. It is useful to keep this distinction between a Borel set and its Borel representation in mind.

The following is a straightforward induction.

Computing with Borel codes¶

Suppose ${}\gamma$ is a computable code for an $F_\sigma$ set $B$ . We may assume ${}\gamma$ is of the form $(3,\gamma')$ , with each column $(\gamma')_m$ being of the form $(2,1,(\alpha)_m)$ , coding a closed set $F_m$ .

With this, we can express membership in $B$ as follows:

\begin{align*} \beta \in B \quad & \Leftrightarrow \quad \exists m \: [\text{$\beta$ is in the set coded by $(\gamma')_m$}] \\ & \Leftrightarrow \quad \exists m \forall n \: [\beta\Rest{n} \text{ is not in the set coded by } (\alpha)_m]. \\ & \Leftrightarrow \quad \exists m \forall n \: [(\alpha)_m(\beta\Rest{n}) \neq 0 ]. \end{align*}

(4)

Note that, since we assume ${}\gamma$ to be computable, the inner predicate $R(m,\sigma)$ given by

R(m,\sigma) :\iff (\alpha)_m(\sigma) \neq 0

(5)

is decidable, that is, it can be decided by a Turing machine.

Hence any $\bSigma^0_2$ set $B$ with a computable code can be represented in the following form:

There exists a decidable predicate $R(m,\sigma)$ such that
$\beta \in B \quad \iff \quad \exists m \: \forall n \; R(m, \beta\Rest{n}).$
(6)

Conversely, if $R(m,\sigma)$ is a (decidable) predicate, let

F_m = \{ \beta \colon \forall n \: R(m,\beta\Rest{n}) \}.

(7)

We claim that $F_m$ is closed: Define a tree $T_m$ by letting

\sigma \in T_m : \iff \forall \tau \Sleq \sigma \: R(m, \tau).

(8)

Then $[T_m] = F_m$ . Moreover,

\beta \in \bigcup_m F_m \iff \exists m \forall n \: R(m,\beta\Rest{n})

(9)

Thus, there seems to be a close connection between $F_\sigma$ sets with computable Borel codes and sets definable by $\Sigma^0_2$ formulas over computable predicates. Given that we introduced the notation $\bSigma^0_2$ for $F_\sigma$ sets earlier, this is perhaps not very surprising.

In this analysis, there seems to be nothing specific about the $F_\sigma$ set used in the example. Indeed, it can be extended to Borel sets of finite order, which we will do next.

We will introduce the lightface Borel hierarchy and show that it corresponds to Borel sets of finite order with recursive codes. Using relativization, we then obtain a complete characterization of Borel sets of finite order: They are precisely those sets definable by arithmetical formulas, relative to a real parameter.

But before we do that, we observe a basic fact about how we can compute with codes.

The effective Borel hierarchy¶

The following is an easy induction.

Proposition 2

A set $A \subseteq \Baire$ is $\Sigma^0_n$ if and only if there exists a computable relation $R \subseteq \Nat^n$ such that

\alpha \in A \; \iff \; \exists k_1 \forall k_2 \exists k_3 \dots Q k_n \; R(k_1, k_2, k_3 \dots, \alpha\Rest{k_n}).

(10)

Similarly, a set $A \subseteq \Baire$ is $\Pi^0_n$ if and only if there exists a computable relation $R \subseteq \Nat^n$ such that

\alpha \in A \; \iff \; \forall k_1 \exists k_2 \forall k_3 \dots Q k_n \; R(k_1, k_2, k_3 \dots, \alpha\Rest{k_n}).

(11)

The following result is at the heart of the effective theory.

Proof

( $\Rightarrow$ ) We proceed by induction on the Borel complexity.

Suppose $A$ is $\Sigma^0_1$ . Let $R$ be computable such that $A = \{ \alpha \colon \exists n \: R(\alpha\Rest{n})\}$ . Let

W = \{ \sigma \in \Nstr \colon R(\sigma)\}.

(12)

We have $\alpha \in A$ if and only if $\alpha \in \bigcup_{\sigma \in W} \Cyl{\sigma}$ . Since $R$ is decidable, $W$ is computable and $\gamma \in \Baire$ given by

\gamma(n) = \begin{cases} 1 & n = 0, \\ 0 & n \geq 1 \: \& \: \pi(n-1) \in W,\\ 17 & n \geq 1 \: \& \: \pi(n-1) \notin W, \end{cases}

(13)

is a computable $\bSigma^0_1$ code for $A$ .

If $A$ is $\Pi^0_n$ , then $A = \Co{B}$ for some $\Sigma^0_n$ $B$ . By inductive hypothesis, $B$ has a computable $\bSigma^0_n$ code ${}\gamma$ . Then $(2,\gamma)$ is a computable $\bPi^0_n$ code for $A$ .

Finally, assume that $A$ is $\Sigma^0_{n+1}$ . Let $P$ be $\Pi^0_n$ such that $\alpha \in A \iff \exists n \; (n,\alpha) \in P$ .

By inductive hypothesis, $P$ has a computable $\bPi^0_n$ code ${}\gamma$ . If we let $P_m = \{\beta \colon (m,\beta) \in P\}$ , then $A = \bigcup P_m$ . Thus, it suffices to show that we can uniformly obtain codes for $P_m$ .

We leave the proof as an exercise. Proceed by induction on the Borel complexity of ${}\gamma$ .

( $\Leftarrow$ ) We proceed by induction on the complexity of the code ${}\gamma$ .

If ${}\gamma$ is of the form $(1,\alpha)$ , with ${}\alpha$ coding an open set $U$ . Then

\beta \in U \iff \exists n \: \alpha(\beta\Rest{n})= 0.

(15)

Since ${}\gamma$ is assumed to be computable, the computable relation

R(\sigma) :\iff \alpha(\sigma) = 0

(16)

witnesses that $U$ is $\Pi^0_1$ .

If $\gamma = (2, \alpha)$ is a $\bPi^0_n$ code, then ${}\alpha$ is a $\bSigma^0_n$ code. By inductive hypothesis, the set coded by ${}\alpha$ is $\Sigma^0_n$ , so by definition of the effective hierarchy and the Borel codes, ${}\gamma$ codes a $\Pi^0_n$ set.

Finally, assume $\gamma = (3,\alpha)$ is a $\bSigma^0_{n+1}$ code for a set $B$ . Then each $(\alpha)_m$ is a $\bPi^0_n$ code for a set $A_m$ .

The proof is similar to Lemma 1

By inductive hypothesis, the set $A$ as in the Lemma is $\Pi^0_n$ and we have

\beta \in B \iff \exists m (m, \beta )\in A,

(18)

which implies $B$ is $\Sigma^0_{n+1}$ .

Relativization¶

Using relativized computations via oracles, we can define a relativized version of the effective Borel hierarchy. This way we can capture all Borel sets of finite order, not just the ones with computable codes.

A straightforward relativization gives the following analogue of Proposition 3.

We can now present the fundamental theorem of effective descriptive set theory.

Proof

If $A$ is $\bSigma^0_n$ , then by Proposition 1 it has a $\bSigma^0_n$ -code ${}\gamma$ , and by Proposition 4, $A$ is $\Sigma^0_n(\gamma)$ . The other direction follows immediately from Proposition 4.

The argument for $\bPi^0_n$ is analogous.

The theorem facilitates working with Borel sets considerably. As an example, consider the set

A = \{ \alpha \colon \text{${}\alpha$ eventually constant} \}.

(19)

We have

\alpha \in A \iff \exists n \forall m [ m \geq n \: \Rightarrow \: \alpha(n) = \alpha(m) ]

(20)

The predicate in the square brackets is computable and depends only on $\alpha\Rest{m}$ . Therefore, $A$ is $\Sigma^0_2$ Borel set.

The Borel hierarchy

Counterexamples through Choice

The Borel hierarchy

The Structure of Borel Sets