[Book] Untyped Systems (TAPL 3-7)

Types and Programming Languages by Benjamin C. Pierce

Untyped Systems

Chapter 3: Untyped Arithmetic Expressions

Syntax

There are three ways to define the syntax for our basic untyped arithmetic language:

Inductively: The set of terms is the smallest set such that: 1. \(\{ \textbf{true}, \textbf{false}, 0 \} \subseteq \mathcal{T}\) 2. If \(t_1 \in \mathcal{T}\), then \(\{ \text{succ } t_1, \text{pred } t_1, \text{iszero } t_1 \} \subseteq \mathcal{T}\) 3. If \(t_1 \in \mathcal{T}\), \(t_2 \in \mathcal{T}\), and \(t_3 \in \mathcal{T}\), then \(\text{if } t_1 \text{ then } t_2 \text{ else } t_3 \in \mathcal{T}\)

The word “smallest” here just means that \(\mathcal{T}\) has no elements besides the ones required to satisfy these three clauses. Here, since you only build elements that are, by definition, in the set – this is equivalent to saying that there are no duplicates (?).

This is an infinite set!

By inference rules: The set of terms \(T\) is defined by the following rules:

\[ \text{true} \in T \quad \text{false} \in T \quad 0 \in T \]

\[ \frac{t_1 \in T}{\text{succ } t_1 \in T} \qquad \frac{t_1 \in T}{\text{pred } t_1 \in T} \qquad \frac{t_1 \in T}{\text{iszero } t_1 \in T} \]

\[ \frac{t_1 \in T \quad t_2 \in T \quad t_3 \in T} {\text{if } t_1 \text{ then } t_2 \text{ else } t_3 \in T} \]

These are read as: “If premise is true (stuff above the line), then we can derive the conclusion (stuff below the line)”.

The rules with no premises are known as “axioms”.

What we are calling inference rules here are actually “rule schemas” – since their premises and conclusions may include metavariables. Each schema represents an infinite set of concrete rules that can be obtained by replacing the metavariables with all their appropriate values.

Concretely: For each natural number \(i\), define a set \(S_i\) as follows:

\[ S_0 = \varnothing \]

\[ \begin{aligned} S_{i+1} =\;& \{ \text{true}, \text{false}, 0 \} \\ &\cup \{ \text{succ } t_1, \text{ pred } t_1, \text{ iszero } t_1 \mid t_1 \in S_i \} \\ &\cup \{ \text{if } t_1 \text{ then } t_2 \text{ else } t_3 \mid t_1, t_2, t_3 \in S_i \} \end{aligned} \]

Finally, let

\[ S = \bigcup_i S_i. \]

3.2.4 How many elements does \(S_3\) have?

A general formula for the number of elements in each set is given by \(|S_{i+1}| = 3 + 3 \times S_i + |S_i|^3\). We have that \(|S_0| = 0\), so \(|S_3| = 59,439\).

3.2.5 Show that the sets \(S_i\) are cumulative–that is, that for each \(i\), we have \(S_i \subseteq S_{i+1}\).

Proof. This can be shown by a simple inductive proof.

Base Case: For \(i=0\), we have that \(S_0 = \emptyset\), so it follows trivially that \(S_0 \subseteq S_1\)

Inductive Hypothesis: Assume that for some \(j \geq 0\), we have that for all \(i < j\) \(S_i \subseteq S_{i+1}\).

Inductive Step: We need to show that \(S_i \subseteq S_j\). We do a case-by-case analysis on the types of terms:

It follows by the definition of \(S_j\) that all the constants are in \(S_j\).
For some term of the type \(\text{succ }t\), \(\text{pred }t\), or \(\text{iszero }t\), it follows that \(t \in S_{i-1}\), and therefore, by the inductive hypothesis, \(t \in S_i\). Then, by the construction of \(S_j\), we can see that \(\text{succ }t\), \(\text{pred }t\), and \(\text{iszero }t\) are all in \(S_j\).
Follows similarly to 2.

The first two definitions for defining the set of possible terms in the language “simply characterize the set as the smallest set satisfying certain closure properties”, while the concrete definition shows you how to actually construct the set as a limit of a sequence.”
We can prove that these two definitions are equivalent by showing that \(\mathcal{T} = \mathcal{S}\) by showing that \(\mathcal{S}\) satisfies the conditions satisfied by \(\mathcal{T}\) and by showing that if any set \(\mathcal{S}'\) satisfies these conditions, then \(\mathcal{S} \subseteq \mathcal{S}'\)

Induction is a prominent theme in working with programming language semantics

How terms are evaluated in a language is known as the “semantics” of the language

There are 3 main approaches to formalizing semantics: - Operational Semantics - Denotational Semantics - Axiomatic Semantics

This book deals exclusively with operational semantics.

Evaluation

An evaluation relation is a binary relation between terms \(t \rightarrow t'\) (pronounced \(t\) evaluates to \(t'\) in one step).

An inference rule is a set of premises and a conclusion

Definition An instance of an inference rule is obtained by consistently substituting each metavariable with the same term in the conclusion and all premises.

A rule is satisfied by a relation (i.e \(t \rightarrow t'\)) if, for each instance of the rule, either the conclusion is in the relation, or one of its premises is not.

Not sure what is meant here by “or one of its premises is not” since the evaluation relations are evaluated against inference rules using just their statements. A: OH, it means that in the case that a relation matches a rule, the premises of the rule are also derivable (using values for meta-variables based on the relation).

When the pair \((t, t')\) is in the evaluation relation that satisfies the inference rules, we say that the evaluation statement (or judgement) \(t \rightarrow t'\) is derivable. Another way of reading this is that an evaluation statement is derivable iff you could conjure a derivation tree with \(t \rightarrow t'\) as its root.

^ This property leads directly to a proof technique called induction on derivations.

Determinacy of one-step evaluations If \(t \rightarrow t'\) and \(t \rightarrow t''\) then \(t' = t''\)

3.5.5 Induction on derivations If, for each evaluation statement \(t \rightarrow t'\), for each inference rule deriving \(t \rightarrow t'\), given \(P(t^*)\) for all sub-derivations (premises) \(t^*\) we can show \(P(t \rightarrow t')\) then we can show \(P(J)\) holds for all evaluation statements \(J\) of the form \(t \rightarrow t'\)

We say that a term \(t\) is in normal form if there is no evaluation rule that applies to it. It follows that every value is in normal form. However, the converse is not always true – there are other normal forms other than values.

3.5.10 The multi-step evaluation relation \[ \frac{}{t \rightarrow^* t} \qquad \frac{t \rightarrow t'}{t \rightarrow^* t'} \qquad \frac{t \rightarrow t' \; t' \rightarrow t''}{t \rightarrow^* t''} \]

Uniqueness of normal forms If \(t \rightarrow^* u\) and \(t \rightarrow^* u'\), where \(u\) and \(u'\) are both normal forms, then \(u = u'\)

In our simple arithmetic language, it guaranteed that every term can be evaluated to a value. Of course, this need not be necessary in languages with a richer set of features.
Termination proofs in computer science follow a similar structure: - Establish some well-founded set \(\mathcal{S}\) - Establish a “termination measure”, i.e a function \(f\) that maps the terms / states of your abstract machine \(t \rightarrow t'\) such that \(f(t') < f(t)\). - It follows from the definition of \(\mathcal{S}\) that evaluation must terminate.

A set is well-founded (more precisely, a relation on a set is well-founded) when there are no infinite descending chains with respect to that relation.

Termination of evaluation (For our UAE language) For every term \(t\) there is some normal \(t'\) such that \(t \rightarrow^* t'\)

Proof: Observe that every evaluation step reduces the size of the term, and that size is a termination measure because the usual order is well-founded on the set of natural numbers.

A closed term (i.e no unbound variables) is stuck if it is in normal form but not a value.

3.5.16 TODO