nLab substitution

Contents

Context

Type theory

natural deduction metalanguage, practical foundations

judgement
hypothetical judgement, sequent
- antecedents $\vdash$ consequent, succedents

type theory (dependent, intensional, observational type theory, homotopy type theory)

calculus of constructions

syntax object language

theory, axiom
proposition/type (propositions as types)
definition/proof/program (proofs as programs)
theorem

computational trinitarianism =
propositions as types +programs as proofs +relation type theory/category theory

logic	set theory (internal logic of)	category theory	type theory
proposition	set	object	type
predicate	family of sets	display morphism	dependent type
proof	element	generalized element	term/program
cut rule		composition of classifying morphisms / pullback of display maps	substitution
introduction rule for implication		counit for hom-tensor adjunction	lambda
elimination rule for implication		unit for hom-tensor adjunction	application
cut elimination for implication		one of the zigzag identities for hom-tensor adjunction	beta reduction
identity elimination for implication		the other zigzag identity for hom-tensor adjunction	eta conversion
true	singleton	terminal object/(-2)-truncated object	h-level 0-type/unit type
false	empty set	initial object	empty type
proposition, truth value	subsingleton	subterminal object/(-1)-truncated object	h-proposition, mere proposition
logical conjunction	cartesian product	product	product type
disjunction	disjoint union (support of)	coproduct ((-1)-truncation of)	sum type (bracket type of)
implication	function set (into subsingleton)	internal hom (into subterminal object)	function type (into h-proposition)
negation	function set into empty set	internal hom into initial object	function type into empty type
universal quantification	indexed cartesian product (of family of subsingletons)	dependent product (of family of subterminal objects)	dependent product type (of family of h-propositions)
existential quantification	indexed disjoint union (support of)	dependent sum ((-1)-truncation of)	dependent sum type (bracket type of)
logical equivalence	bijection set	object of isomorphisms	equivalence type
	support set	support object/(-1)-truncation	propositional truncation/bracket type
		n-image of morphism into terminal object/n-truncation	n-truncation modality
equality	diagonal function/diagonal subset/diagonal relation	path space object	identity type/path type
completely presented set	set	discrete object/0-truncated object	h-level 2-type/set/h-set
set	set with equivalence relation	internal 0-groupoid	Bishop set/setoid with its pseudo-equivalence relation an actual equivalence relation
	equivalence class/quotient set	quotient	quotient type
induction		colimit	inductive type, W-type, M-type
higher induction		higher colimit	higher inductive type
-		0-truncated higher colimit	quotient inductive type
coinduction		limit	coinductive type
	preset		type without identity types
	set of truth values	subobject classifier	type of propositions
domain of discourse	universe	object classifier	type universe
modality		closure operator, (idempotent) monad	modal type theory, monad (in computer science)
linear logic		(symmetric, closed) monoidal category	linear type theory/quantum computation
proof net		string diagram	quantum circuit
(absence of) contraction rule		(absence of) diagonal	no-cloning theorem
		synthetic mathematics	domain specific embedded programming language

homotopy levels

semantics

Edit this sidebar

Idea
Syntactic substitution

Simultaneous substitution
Avoiding variable capture
As an admissible type inference rule

Explicit substitution
Categorical semantics

Definition
Examples

Related concepts
References

Idea

Syntactic substitution of/for variables is one of the basic operations in formal mathematics, such as in formal logic and type theories, where it is part of the structural rules.

Syntactic substitution

Substitution is usually defined as an operation on expressions (such strings of letters from an alphabet and representing terms, formulas, propositions, dependent types, etc.) containing variables.

Suppose that $P$ is an expression in the context of a variable $x \,\colon\, X$ of type, and that $t \colon X$ is an expression which has the same type as $x$ . Then one denotes by

P[t/x]

the result of substituting $t$ for all occurrences of $x$ in $P$ .

For example, if $P$ is $x^2 + 2x y + 3$ and $t$ is $(y+z)$ , then $P[t/x]$ is $(y+z)^2 + 2(y+z)y + 3$ .

Remark

(substitution is a meta-operation on syntax) In this approach, substitution is an operation on syntax, not an element of syntax itself. In particular, the bracket notation $[t/x]$ is part of “meta-syntax”, not the syntax in question.

That is, the literal string of symbols “ $P[t/x]$ ” is not itself an expression in the language under consideration, but denotes such an expression, in the same way that “ $2+2$ ” is not literally an integer but denotes the numeral “ $4$ ”.

Compare also Rem. below.

Simultaneous substitution

Substitution for multiple variables does not, in general, commute. That is, the expressions

P[t/x][s/y] \qquad\text{and}\qquad P[s/y][t/x]

are not in general the same: The former substitutes $s$ for occurrences of $y$ in $t$ , but not $t$ for occurrences of $x$ in $s$ , while the latter has the opposite behavior. We also write

P[t,s/x,y]

to denote the simultaneous substitution of $t$ for $x$ and $s$ for $y$ , in which neither occurrences of $x$ in $s$ nor occurrences of $y$ in $t$ are substituted for; this is generally not the same as either iterated substitution.

Avoiding variable capture

If the language in question contains variable binders, then there is a subtlety to substitution: if $t$ contains free variables that are bound in $P$ , then we cannot simply textually substitute $t$ for $x$ and obtain an expression with the desired meaning.

For instance, if $P$ is $\exists y (x + y = 1)$ , and $t$ is the free variable $y$ , then a literal interpretation of $P[t/x]$ would produce $\exists y (y + y = 1)$ . But $P$ is true (universally in its free variables) if the variables have type $\mathbb{Z}$ , while $\exists y (y + y = 1)$ is not. The free variable $y$ in $t$ has been “captured” by the binder $\exists y$ in $P$ .

We say that $t$ is substitutable for $x$ in $P$ if performing a literal textual substitution as above would not result in undesired variable capture. If $t$ is not substitutable for $x$ in $P$ , then we can always replace $P$ by an alpha-equivalent expression in which $t$ is substitutable for $x$ . Since we often consider formulas only up to $\alpha$ -equivalence anyway, one usually defines the notation “ $P[t/x]$ ” to include an $\alpha$ -conversion of $P$ , if necessary, to make $t$ substitutable for $x$ .

In computer implementations of type theories, however, the issue of variable binding and capture is one of the trickiest things to get right. Performing $\alpha$ -conversions is difficult and tedious, and other solutions exist, such as using de Bruijn indices to represent bound variables.

As an admissible type inference rule

A general property of type theories (and other formal mathematics) is that substitution is an admissible rule.

Roughly, this means that if $P$ is an expression of some type, then so is the result $P[t/x]$ of substitution (as long as $t$ and $x$ have the same type). This is generally not a rule “put into” the theory, but rather a property one proves about the theory; type theorists say that substitution is an admissible rule rather than a derivable rule?.

For instance, in the language of dependent type theory the following substitution rule (one of the structural rules) is an admissible rule:

(1)

Sub \frac{ \Gamma,\; (x:A) \vdash P \;type \qquad \Gamma \vdash (t:A) }{ \Gamma \vdash P[t/x] \;type } \,.

Here “admissibility” means that if there exist derivations of $\Gamma \vdash (t:A)$ and $(x:A) \vdash P \;type$ , then there also exists a derivation of $\Gamma \vdash P[t/x] \;type$ . By contrast, saying that this is a derivable rule would mean that it can occur itself as part of a derivation, rather than being a meta-statement about derivations.

The substition rule is closely related to the cut rule, and admissibility of such rules is generally proven by cut elimination.

Remark

(alternative typesetting of substitution rule)
In view of Remark we may re-express the substitution rule (1) by using actual syntactic substitutions instead of the meta-instruction “[t/x]” to perform these, if only we make explicit the variable dependency of all terms, say by a subscript:

Sub \frac{ \gamma \colon \Gamma ,\;\; a_\gamma \colon A_\gamma \;\;\; \vdash \;\;\; P_{a_\gamma} \;type \;\;\;\;\;\;\;\;\;\;\; \gamma \colon \Gamma \;\;\; \vdash \;\;\; t_\gamma \colon A }{ \gamma \colon \Gamma \;\;\; \vdash \;\;\; P_{t_\gamma} \;type }

Explicit substitution

An alternative approach to substitution is to make substitution part of the object language rather than the metalanguage. That is, the notation

P[t/x]

is now actually itself a string of the language under consideration. One then needs reduction or equality rules describing the relationship of this string $P[t/x]$ to the result of actually substituting $t$ for $x$ as in the usual approach. See explicit substitution for more details.

Categorical semantics

Definition

In the categorical semantics of type theory:

Recalling that terms are interpreted by morphisms, substitution of a term into another term is interpreted by composition of the relevant morphisms.
Recalling that propositions are interpreted by subobjects, substitution of a term $t$ into a proposition $P$ is interpreted by pullback or inverse image of the subobject interpreting $P$ along the morphism interpreting $t$ .
Recalling that dependent types are interpreted by display maps, substitution of a term $t$ into a dependent type $B$ is interpreted by pullback of the display map interpreting $B$ along the morphism interpreting $t$ .
Or else, since dependent types are also given by classifying morphisms into a type of types, substitution corresponds to composition of these classifying morphisms with the given morphism.

In the third case, there is a coherence issue: syntactic substitution in the usual approach is strictly associative, whereas pullback in a category is not. One way to deal with this is by using explicit substitution as described above. Another way is to strictify the category before modeling type theory; see categorical model of dependent types. For literature see (Curien-Garner-Hofmann, Lumsdaine-Warren 13).

Examples

Let $\mathcal{C}$ be a suitable ambient category in which we are interpreting logic/type theory.

Suppose $X$ and $Y$ are types, hence interpreted as objects of $\mathcal{C}$ . Then a term of function type $f : X \to Y$ is interpreted by a morphism, going by the same symbols.

Now a proposition about terms of type $Y$

y : Y \vdash P(y)

is interpreted as an object of the slice category $\mathcal{C}_{/Y}$ , specifically as a (-1)-truncated object if it is to be a proposition, hence by a monomorphism

\array{ P \\ \downarrow \\ Y } \,.

For instance if $\mathcal{C} =$ Set then this is the inclusion of the subset of elements of $Y$ on which $P$ is true. And generally we may write

P = \{y : Y | isInhab(P(y)) \} \,.

Now finally the substitution of $f(x)$ for $y$ in $P$ , hence the proposition

\array{ P(f(-)) \\ \downarrow \\ X }

is interpreted as the pullback

\array{ P(f(-)) \coloneqq & f^* P &\to& P \\ & \downarrow && \downarrow \\ & X &\stackrel{f}{\to}& Y } \,.

Notice that monomorphisms are preserved by pullback, so that this is indeed again the correct interpretation of a proposition.

Specifically, if $X$ is the unit type it is interpreted as a terminal object of $\mathcal{C}$ , and then the function $f$ is identified simply with a term $y_0 \coloneqq f(*)$ . In this case the substitution is evaluation of the proposition at $y_0$ , the resulting monomorphism

\array{ P(y_0) &\longrightarrow& P \\ \big\downarrow && \big\downarrow \\ \ast & \overset {y_0} {\longrightarrow} & Y }

over the terminal object is a truth value: the truth value of $P$ at $y_0$ .

Related concepts

	type theory	category theory
	syntax	semantics
	natural deduction	universal construction
	substitution…………………….	pullback of display maps
	$\frac{ x_2 \colon X_2\; \vdash\; A(x_2) \colon Type \;\;\;\; x_1 \colon X_1\; \vdash \; f(x_1)\colon X_2}{ x_1 \colon X_1 \;\vdash A(f(x_1)) \colon Type}$	$\,$ $\array{ f^* A &\to& A \\ \downarrow && \downarrow \\ X_1 &\stackrel{f}{\to}& X_2 }$

References

Discussion of the substitution rule in intuitionistic dependent type theory:

Bart Jacobs, p. 123 in: Categorical Logic and Type Theory, Studies in Logic and the Foundations of Mathematics 141, Elsevier (1998) [ISBN:978-0-444-50170-7, pdf]

(emphasis on the categorical model of dependent types)
Univalent Foundations Project, p. 423 of: Homotopy Type Theory – Univalent Foundations of Mathematics (2013) [web, pdf]

(in the context of homotopy type theory)
Egbert Rijke, p. 4 of: Dependent type theory [pdf], Lecture 1 in: Introduction to Homotopy Type Theory, lecture notes, CMU (2018) [pdf, pdf, webpage]

(in the context of homotopy type theory)

The observation that substitution forms an adjoint pair/adjoint triple with quantifiers is due to

Bill Lawvere, Adjointness in Foundations, (TAC), Dialectica 23 (1969), 281-296

and further developed in

Bill Lawvere, Equality in hyperdoctrines and
comprehension schema as an adjoint functor_, Proceedings of the AMS Symposium on Pure Mathematics XVII (1970), 1-14.

Exposition of the interpretation of substitution as pullback:

Andrej Bauer, Substitution is pullback, 2012

The coherence issue involved in making this precise is discussed in

Pierre-Louis Curien, Richard Garner, Martin Hofmann, Revisiting the categorical interpretation of dependent type theory (pdf)
Peter LeFanu Lumsdaine, Michael Warren, An overlooked coherence construction for dependent type theory, CT2013 (pdf)

Last revised on August 15, 2023 at 19:34:17. See the history of this page for a list of all contributions to it.