module 1Lab.intro where

Introduction🔗

This page aims to present a first introduction to cubical type theory, from the perspective of a mathematician who has heard about type theory but has no previous familiarity with it. Specifically, the kind of mathematician that we are appealing to is one who is familiar with some of the ideas in category theory and homotopy theory — however, the text also presents the concepts syntactically, in a way that can be read without any prior mathematical knowledge.

Note that whenever code appears, all of the identifiers are hyperlinked to their definitions, which are embedded in articles that describe the concepts involved. For instance, in the code block below, you can click the symbol to be taken to the page on path types.

_ : 2 + 24
_ = refl

For some operations, we use the built-in Agda definitions, which do not have explanations attached. This is the case for the + operation above. However, in any case, those definitions are either built-ins (in which case only the type matters), or defined as functions, in which case the implementation is visible. The most important built-ins for Cubical Agda, and those most likely to lead you to a built-in Agda page, have their own explanations, linked below:

  • The 1Lab.Path page explains the primitives I, i0, i1, _∧_, _∨_, ~, PathP, Partial, _[_↦_], and hcomp.

With that out of the way, the first thing to do is the aforementioned refresher on type theory. If you’re already familiar with type theory, feel free to skip to What’s next?. If you’re not familiar with some of the connections that HoTT has to other areas of mathematics, like higher category theory and topology, I recommend that you read the type theory section anyway!

Terminology On the term “homotopy type theory”.

The text below refers to the foundational system that it is presenting as “homotopy type theory”, while it would be technically more correct to refer to it as a homotopy type theory. HoTT is not a specific type theory, but a family of type theories that put emphasis on the homotopy-theoretic interpretation of type theory. In particular, a homotopy type theory is expected to validate univalence and have some class of higher inductive types.

Below, we describe cubical type theory, which is one particular homotopy type theory. In cubical type theory, we interpret types as the Kan complices in the category of De Morgan cubical sets, PSh()\mathrm{PSh}(\square). This interpretation gives rise to a model structure on the category of cubical sets, referred to as a “cubical-type model structure”.

Type theory🔗

\ Warning

The text below is subject to change! It’s still very much a document in flux. In particular, the type theory section doesn’t talk about induction, which is something that will be expanded on in the future.

Type theory is a foundational system for mathematics which, in contrast to a set-theoretic foundation like ZFC, formalises mathematical constructions rather than mathematical proofs. That is, instead of specifying which logical deductions are valid, and then giving a set of axioms which characterise the behaviour of mathematical objects, a type-theoretic foundation specifies directly which mathematical constructions are valid.

Formally speaking, it is impossible to construct objects in set theoretic foundations; Rather, one can, by applying the deduction rules of the logical system and appealing to the axioms of the set theory, prove that an object with the desired properties exists. As an example, if we have a way of referring to two objects xx and yy, we can prove that there exists a set representing the ordered pair (x,y)(x, y), using the axiom of pairing. Writing quasi-formally, we could even say that this set is constructed using the expression {x,{x,y}}\{x, \{x, y\} \}, but this is only a notational convenience.

In contrast, a type-theoretic foundational system, such as HoTT (what we use here), specifies directly how to construct mathematical objects, and proving a theorem becomes a special case of constructing an object. To specify what constructions are well-behaved, we sort the mathematical objects into boxes called types. Just like in set theory, the word “set” is a primitive notion, in type theory, the word “type” is a primitive notion. However, we can give an intuitive explanation: “A type is a thing that things can be”. For instance, things can be natural numbers, but things can not be “two” — hence, the natural numbers are a type, but two is not. Formally, a type is specified by giving:

  • A formation rule, which tells us which expressions denote valid types at all. For instance, the formation rule for product types tells us that N×N\mathbb{N} \times \mathbb{N} denotes a well-formed type, but “N×\mathbb{N} \times \to” does not. The formation rules also have the duty of preventing us from running into size issues like Russell’s paradox, by constraining the use of universes.

  • A handful of introduction rules, which tell us how to piece previously-constructed objects together to form an object of a different type. Sticking with products as an example, if we have previously constructed objects called aa and bb, we can construct an ordered pair (a,b)(a, b).

    Since the introduction rules are inseparable from their types, it is impossible to construct an object without knowing in which type it lives. If we used the rule which introduces ordered pairs, we know for sure that we built an object of the product type. To specify that an object lives in a certain type, we use the notation a:Aa : A — which can be read as “aa inhabits AA”, or “aa is a point of AA”, or “aa has type AA”.

    The notation for stating inhabitation agrees with mathematical parlance in when both make sense. For example, a function from the reals to the reals is commonly introduced with the notation f:RRf : \mathbb{R} \to \mathbb{R}. In type theory, this is made formal as a typing declaration: ff is a point of the type of functions RR\mathbb{R} \to \mathbb{R}, hence we know that the rules of function types can be used with ff. However, when introducing a more complex object, we’re forced to break into informality and conventional shorthand: “Let G be a group”. In HoTT, we can directly write G:GroupG : \mathrm{Group}1, which tells us that the rules for the type of groups apply to GG.

  • A handful of elimination rules, which say how inhabitants of that type can be manipulated. Keeping with the running example, the product type has elimination rules corresponding to projecting either the first or second component of a pair.

    Furthermore, each elimination rule comes with specified behaviour for when it “interacts” with the introduction rules, a process which we generally call reduction. Repeatedly applying these “interactions” tells us how a given construction can have computational behaviour — by performing a delicate dance of elimination and introduction, we can go from 2+22 + 2 to 44.

Now that we have an intuitive motivation for the notions of type theory, we can more precisely discuss the structure of formal type theory and how it differs from formal (material) set theory. The first (and perhaps most important) difference is that formal set theory is a “two layered” system, but type theory only has “one layer”. In (for example) ZFC, we have the deductive framework of first-order logic (the “first layer”), and, using the language of that logical system, we assert the axioms of set theory. This remains valid if we switch set theories: For instance, IZF would substitute first-order logic with first-order intuitionistic logic, and remove the assertion that the axiom of choice holds.

In contrast, type theory only has one layer: The layer of types. What in formal set theory would be a deduction in FOL, in type theory becomes the construction of an inhabitant of a certain type, where propositions are identified with certain types as in the table below. Note that up to a first approximation, we can read each type former as denoting a specific set, or as denoting a specific space.

Types Logic Sets Spaces
AA proposition set space
a:Aa : A proof element point
B(x)B(x) predicate family of sets fibration
b(x):B(x)b(x) : B(x) conditional proof family of elements section
,\bot, \top ,\bot, \top ,{}\emptyset, \{\emptyset\} ,{}\emptyset, \{*\}
A×BA \times B ABA \land B Cartesian product product space
ABA \uplus B ABA \lor B disjoint union coproduct space
ABA \to B ABA \to B function set function space
(x:A)B(x)\prod_{(x : A)}B(x) (x:A)B(x)\forall{(x : A)}B(x) n-ary product space of sections
(x:A)B(x)\sum_{(x : A)}B(x) (x:A)B(x)\exists{(x : A)}B(x) n-ary disjoint union total space
xAyx \equiv_A y equality x=yx = y diagonal {(x,x):xA}\{ (x,x) : x \in A\} path space AIA^\mathbb{I}

This segues nicely into another difference between type theory and set theory, which concerns the setup of their deductive systems. A deductive system is a specification for how to derive judgements using rules. We can think of the deductive system as a game of sorts, where the judgements are to be thought of as board positions, and the rules specify the valid moves.

In the deductive system of first-order logic, there is only one kind of judgement, stating that a proposition has a proof. If AA is a proposition, then the judgement “AA has a proof” is written “A\vdash A”. Note that the judgement A\vdash A is a part of the metatheory, and the proposition AA is a part of the theory itself. That is, A\vdash A is an external notion, and AA is an internal notion.

In type theory, the basic judgement is inhabitation, written a:A\vdash a : A. If AA is a type that denotes a proposition, then this is analogous to the judgement A\vdash A of first-order logic, and we refer to aa as a witness of AA, or simply as a proof of AA. Even better, if AA is a purely logical proposition such that A\vdash A holds without any appeal to the principle of excluded middle, then there is a term aa for which a:A\vdash a : A holds in type theory.

When AA is being thought of as a collection of things, we might imagine that a:Aa : A is analogous to aAa \in A. However, this is not the case: While aAa \in A is a proposition (an internal statement), a:Aa : A is a judgement, an external statement. That is, we can not use a:Aa : A with the propositional connectives mentioned in the table above — we can not make statements of the form “if it is not the case that a:Aa : A, then b:Bb : B”. This is the second difference we will mention between set theory and type theory: The proposition aAa \in A is an internal relation between previously-constructed terms aa and AA, but the external judgement a:A\vdash a : A is indivisible, and it does not make sense to talk about aa without also bringing up that it’s in AA.

The third and final difference is the treatment of equality, which is arguably also the most important part of homotopy type theory. In set theory, given sets aa and bb, we have a proposition a=ba = b, which is given meaning by the axiom of extensionality: a=ba = b whenever aa and bb behave the same with respect to \in. Correspondingly, in type theory, we have an equality type, written aAba \equiv_A b (where a,b:Aa, b : A), giving us the internal notion of equality. But we also have an external notion of equality, which encodes the aforementioned interactions between introduction and elimination rules. This is written a=b:A\vdash a = b : A, and it’s meant to indicate that aa and bb are the same by definition — hence we call it definitional equality.

Since definitional equality is a judgement, it’s also not subject to the internal propositional connectives — we can not prove, disprove, assume or negate it when working inside type theory, for it does not make sense to say “if aa and bb are equal by definition, then […]”. Correspondingly, in the Agda formalisation, definitional equality is invisible, since it’s represented by the computation rules of the type theory.

In the rest of this section, we will recall the presentation of the most ubiquitous types. The idea is that if you’ve encountered type theory before, then skimming the explanation here should help snap your mind back into focus, while if this is your first time interacting with it, then the content here should be enough for you to understand the rest of the 1lab development.

Functions🔗

If we have two previously-constructed types AA and BB, we can form the type of functions from AA to BB, written ABA \to B. Often, functions will also be referred to as maps. A function is, intuitively, a rule prescribing how to obtain an inhabitant of BB given an inhabitant of AA. In type theory, this is not only an intuition, but rather a definition. This is in contrast with set theory, where functions are defined to be relations satisfying a certain property.

Given a function f:ABf : A \to B, we can apply it to an inhabitant a:Aa : A, resulting in an inhabitant f af\ a of the codomain BB, called the value of ff at aa. For clarity, we sometimes write f(a)f(a) for the value of ff at aa, but in Agda, the parentheses are not necessary.

The most general form for constructing an inhabitant of ABA \to B is called lambda abstraction, and it looks like (λ(x:A).e)(\lambda (x : A). e), though shorthands exist. In this construction, the subexpression ee denotes the body of a function definition, and xx is the parameter. You can, and should, think of the expression λx.e\lambda x. e as being the same as the notation xex \mapsto e often used for specifying a map. Indeed, Agda supports definition of functions using a more traditional, “rule-like” notation, as shorthand for using λ\lambda abstraction. For instance, the following definitions of inhabitants f and g of NatNat\mathrm{Nat} \to \mathrm{Nat} are definitionally the same:

  f : Nat  Nat
  f x = x * 3
  -- Function definition
  -- with a "rule".
  g : Nat  Nat
  g = λ x  x * 3
  -- Function definition
  -- with λ abstraction

In general, we refer to a way of constructing an inhabitant of a type as an introduction rule, and to a way of consuming an inhabitant of a type as an elimination rule. Concisely, the introduction rule for ABA \to B is λ\lambda abstraction, and the elimination rule is function application. A general principle of type theory is that whenever an elimination rule “meets” an introduction rule, we must describe how they interact.

In the case of λ\lambda-abstraction meeting function application, this interaction is called β\beta-reduction2, and it tells us how to compute a function application. This is the usual rule for applying functions defined by rules: The value of (λx. e)e(\lambda x.\ e) e' is ee, but with ee' replacing xx whenever xx appears in ee. Using the notation that you’d find in a typical3 type theory paper, we can concisely state this as:

(λx. e)ee[e/x] (\lambda x.\ e) e' \longrightarrow e[e'/x]

In addition, function types enjoy a definitional uniqueness rule, which says “any function is a λ\lambda expression”. Symbolically, this means that if we have f:ABf : A \to B, the terms ff and (λx. f x)(\lambda x.\ f\ x) are equal, definitionally. This is called η\eta-equality, and when applied right-to-left as a reduction rule, η\eta-reduction.

Functions of multiple arguments are defined by iterating the unary function type. Since functions are types themselves, we can define functions whose codomain is another function — where we abbreviate A(BC)A \to (B \to C) by ABCA \to B \to C. By the uniqueness rule for function types, we have that any inhabitant of this type is going to be of the form λx. λy. f x y\lambda x.\ \lambda y.\ f\ x\ y, which we abbreviate by λx y. f x y\lambda x\ y.\ f\ x\ y. In Agda, we can write this as λ x y → ..., or by mentioning multiple variables in the left-hand side of a rule, as shown below.

  f : Nat  Nat  Nat
  f x y = x + 3 * y

Universes🔗

Instead of jumping right into the syntactic definition (and motivation) for universes, I’m going to take a longer route, through topos theory and eventually higher topos theory, which gives a meaning explanation for the idea of universes by generalising from a commonly-understood idea: the correspondence between predicates and subsets. Initially, we work entirely in the category of sets, assuming excluded middle for simplicity. Then we pass to an arbitrary elementary topos, and finally to an arbitrary higher topos. If you don’t want to read about categories, click here

First, let’s talk about subsets. I swear I’m going somewhere with this, so bear with me. We know that subsets correspond to predicates, but what even is a subset?

  • If we regard sets as material collections of objects, equipped with a membership relation \in, then SS is a subset of TT if every element of SS is also an element of TT. However, this notion does not carry over directly to structural set theory (where sets are essentially opaque objects of a category satisfying some properties), or indeed to type theory (where there is no membership relation) — but we can salvage it.

    Instead of thinking of a subset directly as a subcollection, we define a subset STS \subseteq T to be an injective function f:STf : S \hookrightarrow T. Note that the emphasis is on the function and not on the index set SS, since we can have many distinct injective functions with the same domains and codomains (exercise: give two distinct subsets of N\mathbb{N} with the same domain4).

    To see that this definition is a generalisation of the material definition in terms of \in, we note that any injective function f:STf : S \hookrightarrow T can be restricted to a bijective function f:Sim(f)f : S \xrightarrow{\sim} \mathrm{im}(f), by swapping the original codomain for the image of the function. Since the image is a subset of the codomain, we can interpret any injective f:STf : S \hookrightarrow T as expressing a particular way that SS can be regarded as a subset of TT.

  • Alternatively, we can think of a subset as being a predicate on the larger set TT, which holds of a given element xx when xx is in the subset SS. Rather than directly thinking of a predicate as a formula with a free variable, we think of it as a characteristic function χ:T{0,1}\chi : T \to \{0,1\}, from which we can recover the subset SS as {xT:χ(x)=1}\{ x \in T : \chi(x) = 1 \}.

    This definition works as-is in structural set theory (where the existence of solution sets for equations — called “equalisers” — is assumed), but in a more general setting (one without excluded middle) like an arbitrary elementary topos, or indeed type theory, we would replace the 2-point set {0,1}\{0,1\} by an object of propositions, normally denoted with Ω\Omega.

These two definitions are both appealing in their own right — the “subsets are injective functions” can be generalised to arbitrary categories, by replacing the notion of “injective function” with “monomorphism”, while “subsets are predicates” is a very clean notion conceptually. Fortunately, it is well-known that these notions coincide; For the purposes of eventually generalising to universes, I’d like to re-explain this correspondence, in a very categorical way.

Consider some sets A,BA, B and an arbitrary function f:ABf : A \to B between them. We define the fibre of ff over yy to be the set f(y)={xA:f(x)=y}f^*(y) = \{ x \in A : f(x) = y \}, that is, the set of points of the domain that ff maps to yy — in the context of set theory, this would be referred to as the preimage of yy, but we’re about to generalise beyond sets, so the more “topological” terminology will be useful. We can often characterise properties of ff by looking at all of its fibres. What will be useful here is the observation that the fibres of an injective function have at most one element.

More precisely, let f:ABf : A \hookrightarrow B be injective, and fix a point yBy \in B so we can consider the fibre f(y)f^*(y) over y. If we have two inhabitants a,bf(y)a, b \in f^*(y), then we know (by the definition of fibre) that f(a)=y=f(b)f(a) = y = f(b), hence by injectivity of ff, we have a=ba = b — if f(y)f^*(y) has any points at all, then it has exactly one. It turns out that having at most one point in every fibre precisely characterises the injective functions: Suppose f(a)=f(b)f(a) = f(b); Then we have two inhabitants of f(f(a))f^*(f(a)), aa and bb, but since we assumed that f(y)f^*(y) has at most one inhabitant for any yy, we have a=ba = b, so ff is injective.

We call a set (type) XX with at most one element (at most one point) a subsingleton, since it is a subset of the singleton set (singleton type), or a proposition, since a choice of point xXx \in X doesn’t give you any more information than merely telling you “XX is inhabited” would. Since there is a 1-to-1 correspondence between subsingletons XX and propositions “XX is inhabited”, the assignment of fibres yf(y)y \mapsto f^*(y) defines a predicate χf:BΩ\chi_f : B \to \Omega, where we have implicitly translated between a subsingleton and its corresponding proposition.5

Conversely, if we have such an assignment χ:BΩ\chi : B \to \Omega, we can make a map into BB with subsingleton fibres by carefully choosing the domain. We take the domain to be the disjoint union of the family χ\chi, that is, the set (x:B)χ(x)\sum_{(x : B)} \chi(x). The elements of this set can be described, moving closer to type-theoretic language, as pairs (i,p)(i, p) where iBi \in B and pχ(i)p \in \chi(i) is a witness that the subset predicate holds of ii.

I claim: the map π1:((x:B)χ(x))B\pi_1 : (\sum_{(x : B)} \chi(x)) \to B which takes (i,p)i(i, p) \mapsto i has subsingleton fibres. This is because an element of π1(y)\pi^*_1(y) is described as a pair (i,q)(i , q), and since it’s in the fibre over yy, we have i=yi = y; Since χ\chi (by assumption) takes values in subsingletons, concluding that i=yi = y is enough to establish that any two (i,q)(i, q) and (i,q)π1(y)(i',q') \in \pi_1^*(y) must be equal.

The point of the entire preceding discussion is that the notions of “injections into BB” and “maps BΩB \to \Omega” are equivalent, and, more importantly, this is enough to characterise the object Ω\Omega6.

Let me break up this paragraph and re-emphasise, since it’ll be the key idea later: A type of propositions is precisely a classifier for maps with propositional fibres.

Since the subobjects of BB are defined to be injections XBX \to B, and maps BΩB \to \Omega represent these subobjects (in a coherent way), we call Ω\Omega a subobject classifier, following in the tradition of “classifying objects”. A subobject classifier is great to have, because it means you can talk about logical propositions — and, in the presence of functions, predicates — as if they were points of a type.

More on general classifying objects.

A classifying object for some data is an object — normally written B(data)B(\text{data}) — such that maps XB(data)X \to B(\text{data}) correspond to data “over” XX, in the sense that the data admits some map into XX. Above, we were discussing the subobject classifier that one can find in an elementary topos; Maps XΩX \to \Omega (the object of propositions) correspond to maps into XX with “propositional data”.

To find another example in category theory, we pass to higher category theory, i.e. nn-category theory for high values of nn, such as n=2n = 2. As a 1-dimensional structure, we can consider the data of a category EE lying over another category BB to be a functor EBE \to B which has certain lifting properties - some sort of categorical fibration.

One would like to find a correspondence between these “categories over categories” and some data that we already know how to conceptualise; It turns out that Grothendieck fibrations over BB — i.e., the functors F:EBF : E \to B satisfying said “certain lifting properties” — correspond to weak 2-functors into Cat\mathrm{Cat}.

Conversely, this equivalence lets us think of a family of categories indexed by BB — a 2-dimensional concept — corresponds precisely to 1-dimensional data (a special class of functors between ordinary categories). Furthermore, the equivalence is induced by a construction on (weak 2-)functors — the Grothendieck construction, written F\int F — that exactly mirrors the disjoint union F\sum F that we used above!

To summarise:

  • A classifying object B(data)B(\text{data}) is an object where maps XB(data)X \to B(\text{data}) corresponds to data over (“pointing towards”) XX.

  • The kind of classifying objects that we care about here are ubiquitous in all levels of nn-category theory, but especially in (n,1)(n,1)-category theory, where some behave like universes of a given truncation level.

  • Since the objects in a nn-category act like (n1)(n-1)-categories, the “most complex” classifier that a nn-topos can have is for internal the (n1)(n-1)-category of internal (n2)(n-2)-categories, and this all works better in the case where we’re actually talking about (n,1)(n,1)-topoi.

However, we often want to talk about sets and families of sets as if they were points of a type, instead of just subsets. We could pass to a higher category that has a discrete object classifier — this would be a “type of sets” —, but then the same infelicity would be repeated one dimension higher: we’d have a classifier for “sets”, but our other objects would look like “categories”, and we wouldn’t be able to classify those!

Take a moment to convince yourself that the interpretation of a discrete object classifier as a type of sets makes sense.

Note that we defined “proposition” to mean “at most one element”. We can generalise this to a notion which applies to categories by defining a discrete category to be one that has propositional hom-sets, i.e. one in which the only morphisms are identities. Note that a discrete category is automatically a discrete groupoid, since the identity map is an isomorphism.

A discrete object classifier (say Set\mathrm{Set}) would then be a classifier for functors which have discrete groupoids as fibres — the fibres have no categorical information, being nothing more than a set. Given any set XX, we’d have a name for it as the interpretation of the unique map Disc(X)\mathrm{Disc}(X) \to * as a map Set* \to \mathrm{Set}. The name Set\mathrm{Set} was chosen because in the (2,1)(2,1)-category Grpd\mathfrak{Grpd}, the discrete object classifier is exactly the category of sets.

Thus we might hope to find, in addition to a subobject classifier, we could have a general object classifier, an object Type\mathrm{Type} such that maps BTypeB \to \mathrm{Type} correspond to arbitrary maps ABA \to B. Let’s specify this notion better using the language of higher category theory:

A subobject classifier is a representing object for the functor Sub:EopSets\mathrm{Sub} : \mathcal{E}^{\mathrm{op}} \to \mathbf{Sets} which takes an object to its poset of subobjects. Since Sub(x)\operatorname*{Sub}(x) is a subcategory of E/x\mathcal{E}/x, restricted to the objects which are (-1)-truncated, an object classifier would be a representing object for E/\mathcal{E}/-, the functor which sends an object XX to the category of arbitraryXX-indexed families”, the slice category over XX. If capturing the directed structure isn’t possible, then, at the very least, it should represent Core(E/)\mathrm{Core}(\mathcal{E}/-) — the core of E\mathcal{E}, so that the isomorphisms between families are accurately represented.

However, there is an issue here: there is, a priori, an entire category E/x\mathcal{E}/x, but only a set HomE(x,U)\mathbf{Hom}_{\mathcal{E}}(x, U). Objects in a category can have non-trivial isomorphisms, but there’s no such notion between elements in a set! Let’s suppose that we had such an equivalence, in the context E=Sets\mathcal{E} = \mathbf{Sets}, setting aside the size issues for now: we have an object UU equipped with an isomorphism r:Hom(,U)Sets/Setsr : \mathbf{Hom}(*, U) \cong \mathbf{Sets}/* \cong \mathbf{Sets}. There’s an element b=r1(2):Ub = r^{-1}(2) : U classifying the type of booleans, which should have a non-trivial identification b=bb = b corresponding to the involution on 22. However, this is not possible, since a set (by definition) only has trivial identifications.

One might attempt to remedy the issue by looking for an object classifier in a bicategory, where Hom(,U)\mathbf{Hom}(-, U) would be a genuine category. However, this didn’t actually fix anything, since we only shifted the issue one level up: we now have an entire bicategory E/x\mathcal{E}/x, which can’t be classified by the mere category of maps HomE(,U)\mathbf{Hom}_{\mathcal{E}}(-, U)!

The issue does eventually resolve itself, if we’re very persistent: if we have an \infty-category E\mathcal{E}, where Hom(,)\mathbf{Hom}(-,-) can be a genuine \infty-groupoid, then it’s actually perfectly possible for there to be an object Type\mathrm{Type} with an isomorphism E/Hom(,Type)\mathcal{E}/- \cong \mathbf{Hom}(-, \mathrm{Type}) — well, at least as long as we restrict ourselves to Core(E/)\mathrm{Core}(\mathcal{E}/-). We’ve finally found a setting where object classifiers can exist: (,1)(\infty,1)-categories!

Again, the importance of object classifiers is that they let us talk about arbitrary objects as if they were points of an object Type\mathrm{Type}. Specifically, any object BB has a name, a map B:Type\ulcorner B \urcorner : * \to \mathrm{Type}, which represents the unique map BB \to *. To make this connection clearer, let’s pass back to a syntactic presentation of type theory, to see what universes look like “from the inside”.

Universes, internally🔗

Inside type theory, object classifiers present themselves as types which contain types, which we call universes. Every type is contained in some universe, but it is not the case that there is a universe containing all types; In fact, if we did have some magical universe Type:Type\mathrm{Type}_\infty : \mathrm{Type}_\infty, we could reproduce Russell’s paradox, as is done here.

The solution to this is the usual stratification of objects into “sizes”, with the collection of all “small” objects being “large”, and the collection of all “large” objects being “huge”, etc. On the semantic side, a (,1)(\infty,1)-topos has a sequence of object classifiers for maps with κ\kappa-compact fibres, where κ\kappa is a regular cardinal; Syntactically, this is reflected much more simply as having a tower of universes with Typei:Type1+i\mathrm{Type}_i : \mathrm{Type}_{1+i}. So, in Agda, we have:

  _ : Type
  _ = Nat

  _ : Type₁
  _ = Type

In Agda, we do not have automatically that a “small” object can be considered a “large” object. When this is the case, we say that universes are cumulative. Instead, we have an explicit Lifting construction that lets us treat types as being in higher universes:

  _ : Type₂
  _ = Lift (lsuc (lsuc lzero)) Type

To avoid having Lift everywhere, universes are indexed by a (small) type of countable ordinals, called Level, and constructions can be parametrised by the level(s) of types they deal with. The collection of all \ell-level types is Typelsuc \mathrm{Type}_{\mathrm{lsuc}\ \ell}, which is itself a type in the next universe up, etc. Since levels are themselves inhabitants of a type, we do in fact have a definable function λ. Type\lambda \ell.\ \mathrm{Type}_\ell. To avoid further Russellian paradoxes, functions out of Level must be very big, so they live in the “first infinite universe”, Typeω\mathrm{Type}_\omega. There is another hierarchy of infinite universes, with Typeω+i\mathrm{Type}_{\omega+i} inhabiting Typeω+(1+i)\mathrm{Type}_{\omega+(1+i)}. To avoid having even more towers of universes, you can’t quantify over the indices ii in Typeω+i\mathrm{Type}_{\omega+i}. To summarize, we have:

Type:Type1:Type2:Typeω:Typeω+1:Typeω+2: \mathrm{Type} : \mathrm{Type}_1 : \mathrm{Type}_2 : \cdots \mathrm{Type}_{\omega} : \mathrm{Type}_{\omega+1} : \mathrm{Type}_{\omega+2} : \cdots

To represent a collection of types varying over an value of AA, we use a function type into a universe, called a type family for convenience: A type family over AA is a function ATypeiA \to \mathrm{Type}_i, for some choice of level ii. An example of a type family are the finite types, regarded as a family Fin:NType\mathrm{Fin} : \mathbb{N} \to \mathrm{Type} — where Fin(n)\mathrm{Fin}(n) is a type with nn elements. Type families are also used to model type constructors, which are familiar to programmers as being the generic types. And finally, if we have a type B:TypeiB : \mathrm{Type}_i, we can consider the constant family at BB, defined to be the function λ(x:A). B\lambda (x : A).\ B.

To cap off the correspondence between universes and object classifiers, thus of type families and maps, we have a theorem stating that the space of maps into BB is equivalent to the space of type families indexed by BB. As a reminder, you can click on the name Fibration-equiv to be taken to the page where it is defined and explained.

  _ :  {B : Type}  (Σ[ E ∈ Type ] (E  B))(B  Type)
  _ = Fibration-equiv

Universes and size issues🔗

Perhaps one of the most famous paradoxes in the history of formal logic, Russell’s paradox reminds us that, for most reasonable kinds of “collection” — be they sets, types, or categories — considering the collection of all collections that do not contain themselves is a road that leads to madness. The standard way of getting around these issues, at least in set-theoretic foundations as applied to category theory, is to refer to such problematic collections as “classes”, and to only axiomatise the sets which are not too big.

In the branches of mathematics that Alexander Grothendieck influenced, a more common approach is to work instead with Grothendieck universes: A Grothendieck universe is a set U\mathcal{U} that forms a transitive model of ZF, closed under U\mathcal{U}-small indexed products. The classification of models of ZF implies that any Grothendieck universe U\mathcal{U} is equivalent to a segment of the von Neumann universe Vκ\mathbf{V}_\kappa, where κ\kappa is a strongly inaccessible cardinal — essentially, an upper bound on the size of the sets in U\mathcal{U}.

In traditional mathematical writing, it is common to entirely ignore size issues, save perhaps for an offhand mention of Grothendieck universes (or strongly inaccessible cardinals) within the first few sections. Working in a proof assistant forces us to be honest about the size of our constructions: Correspondingly, we try to be precise in our prose as well. As mentioned above, Agda universes are stratified into a pair of hierarchies Typeκ\mathrm{Type}_\kappa and Typeω+n\mathrm{Type}_{\omega+n}, where we’re using κ\kappa to stand for a variable of Level type. The Level is a built-in type which contains a constant representing 00, is closed under taking successors, and is closed under taking binary maxima.

We refer to structured type (e.g. a category, or a group) as κ\kappa-small when its underlying type inhabits the κ\kappath universe; For the specific case of categories, we also use the terminology locally κ\kappa-small to refer to the universe in which the family Hom(,)\mathbf{Hom}(-,-) lands. We use subscript Greek letters in prose to index our structures by their universe level. For example, Setsκ\mathbf{Sets}_\kappa is the ((1+κ)(1+\kappa)-small, locally κ\kappa-small) category of κ\kappa-small sets.

Small (hah!) note on small categories.

For those familiar with the notions of internal and enriched categories, we might rephrase the classical definitions of κ\kappa-small and locally κ\kappa-small categories as follows:

  • A category C\mathcal{C} is κ\kappa-small if it is an internal category in Setsκ\mathbf{Sets}_\kappa;

  • A category C\mathcal{C} is locally κ\kappa-small if it is enriched over the Cartesian monoidal category Setsκ\mathbf{Sets}_\kappa.

Because every type is contained in some universe, we note that every category that appears in our development is locally κ\kappa-small for some universe level κ\kappa. In particular, we have no “type of categories”, but instead a type of “κ\kappa-small, locally λ\lambda-small categories”.

\ Warning

Note that our use of the term “κ\kappa-small” is nonstandard. In set-theoretic foundations, where the only objects are sets anyway, this means “category internal to Setsκ\mathbf{Sets}_\kappa”, as mentioned in the infobox above. In the 1Lab, the objects we manipulate are higher groupoids rather than sets, and so very few categories will be internal to a category of sets. Instead, when we describe a category C\mathcal{C} is κ\kappa-small, we mean that the type of objects of C\mathcal{C} is an inhabitant of the universe Typeκ\mathrm{Type}_\kappa, and that the HomC(,)\mathbf{Hom}_\mathcal{C}(-,-) family is valued in Setsκ\mathbf{Sets}_\kappa. Our shorthand for the traditional notion is a “strict κ\kappa-small” category.

Our policy is to keep the use of universes “as precise as possible”: Definitions are polymorphic over as many levels as they can possibly be, and they are always placed in the smallest universe in which they fit. This is, however, an iterative process: For example, our definition of “sheaf topos” was originally parametrised over 5 levels, but after formalising more desirable properties of topoi, we have found that the definition only really mentions two levels.

Interlude: Basics of paths🔗

Since the treatment of paths is the most important aspect of homotopy type theory, rest assured that we’ll talk about it in more detail later. However, before discussing the dependent sum of a type family, we must discuss the fundamentals of paths, so that the categorical/homotopical connections can be made clear. Before we even get started, though, there is something that needs to be made very clear:

Paths are not equality!

The only conceptualisation of paths that is accurate in all cases is that types are \infty-groupoids, and hence path spaces are the Hom-\infty-groupoids given by the structure of each type. However, this understanding is very bulky, and doesn’t actually help understanding paths all that much — so we present a bunch of other analogies, with the caveat that they only hold for some types, or are only one of the many possible interpretations of paths in those types.

The ideas presented below are grouped in blue <details> elements, but not because they aren’t important; You should read all of them.

Idea #1: Some types behave like sets, providing an embedding of “classical” equality behaviour into homotopy type theory. While these types may not be as “exciting” as others, they aren’t any less important!

Contradicting the big bold warning up there, in a very restricted class of types, paths do behave like the equality relation! We call these types sets, and their defining property is that they “only have identity paths” — hence they could also be called “discrete types”, by analogy with “discrete category”, but this terminology is not employed because “discrete” already has a distinct type-theoretic meaning.7

Types in this class include the natural numbers, the integers, the rationals and the reals; Also included is every finite type, and the power set of any type at all. The type of functions ABA \to B, where BB is a set, is again a set. We can generalise this and say that “most” type formers preserve set-ness.

Note that the meaning of “is a set”/“is not a set” in HoTT is distinct from how that sentence is used in classical mathematics, where a collection is a set if it’s definable in the language of whatever foundational system you’re using. For example, in ZFC, the collection of all sets is not a set — because if it were, we could reproduce Russell’s paradox.

In HoTT, being a set has to do with spatial structure: a set has only trivial paths. Examples of non-sets would then be given by types that have interesting paths, like the circle, which has a non-trivial loop. Coincidentally, the type of all sets is not a set, since its spatial structure is given by the isomorphisms of sets — and sets can have non-trivial automorphism groups.

Idea #2: Some types are best understood as representing spaces, with intrinsic homotopic information, which we can investigate using type-theoretic methods. This interpretation is the core idea of “synthetic homotopy theory”, a whole field of maths!

We conceptualise some types (like the circle mentioned above) as being representatives for spaces, though this does not mean that they have any intrinsically-detectable topology in the sense of “open sets”-topological spaces. Instead, we think of these types as being an incarnation of spaces à la homotopy theory, with structure similar to what you’d find there: Homology groups, cohomology groups, but most easily seen are the homotopy groups. In these types, type-theoretic paths quack like topological paths.

Indeed, any type in homotopy type theory has an associated sequence of homotopy groups, and these behave precisely like the homotopy groups of classical algebraic topology — with the difference that the way we compute them in HoTT has a distinctly more “typey” flavour.

This interpretation of types as spaces leads us to look at the statement “there is a type XX with π1(X)Z\pi_1(X) \cong \mathbb{Z}” and rightly call XX “the circle”, even though a topologist might object that our circle is not any subset of R2\mathbb{R}^2. Similarly, we might take the integer 22 and apply the isomorphism above, to get a path in XX which “walks clockwise around the circle twice”.

Idea #3: HoTT provides a setting for doing category theory where all constructions are intrinsically isomorphism-invariant. Under this light, types provide the underlying groupoids of categories.

When doing category theory in HoTT, we have some types that naturally arise as the type of objects in a category, such as the type of groups relative to a universe. In category theory done set-theoretically, the collection of objects in CC has two different notions of sameness: The set-theoretic equality a=ba = b, with behaviour given by the axiom of extensionality and divorced from the categorical structure, and the class of isomorphisms of CC, aba \cong b, which naturally has to do with the categorical structure.

A big part of doing category theory in set-theoretic foundations is learning in which situations we should talk about the set-theoretic equality of objects, and when we should talk about isomorphism — what you end up discovering is that constructions that talk about equality of objects should be done very sparingly, so much so that they’re colloquially called “evil”.

Conversely, the possibility of talking about equality of objects at all means that we have to spend effort verifying that constructions appropriately respect isomorphisms, though of course this verification can be (and often ends up being) handwaved.

In HoTT, we can do better: We add a condition to the definition of category that says the paths of the type of objects must correspond to the categorical isomorphisms — which we call “univalence”. While this condition might initially seem hard to fulfill, it’s surprisingly common to come by univalent categories, since most constructions on categories preserve univalence! For instance:

  • If AA and BB are univalent categories, then so is the product A×BA \times B and the disjoint union ABA \uplus B; The initial and terminal categories are trivially univalent.

  • If CC is a univalent category, then so is the slice category C/cC/c, and the coslice c/Cc/C. More general comma categories are also univalent.

  • If CC is a univalent category, then for any DD, so is the functor category [D,C][D,C]; This implies that a very large class of categories is univalent, e.g. the models of any Lawvere theory in a univalent category.

    Further, since Sets\mathbf{Sets} is a univalent category, the Yoneda lemma implies that every CC can be embedded into a univalent category - the functor category [Cop,Sets][C^{\mathrm{op}},\mathbf{Sets}]. If we restrict to the subcategory of representable presheaves, we have that every CC admits a full, faithful and essentially surjective functor into a univalent category, called the Rezk completion of CC; If CC was already univalent, this functor is an equivalence.

In addition to the benefit of univalent categories that will be presented outside the fold, they’re also interesting in their own right because they are better constructively behaved than categories with a set of objects: For instance, in the last bullet point above we appealed to the statement “every fully faithful, essentially surjective functor is an equivalence”. For univalent categories, this is a theorem without any form of choice axiom!

One benefit that HoTT brings to the table is that it very naturally leads to mixing and matching the interpretations above. For example, the underlying groupoid of a univalent category can just as well be interpreted as a space, mixing interpretations #2 and #3. The spatial interpretation of categories really shines when we pass to higher categories, where the notion of “unique” behaves very badly.

When thinking nn-categorically, even for small values of nn, one must first weaken “unique” to “unique up to unique isomorphism”, and then to something like “unique up to isomorphism, which is unique up to unique invertible 2-cell”. Reasoning spatially, we see that “unique up to isomorphism” really means that there is a contractible space of choices for that object — a notion which is well-behaved at any level of higher categorical complexity.

Every type AA has an associated family of path types — to form a path type, we must have two inhabitants of AA handy, called the endpoints. The type of paths between xx and yy in AA is written xAyx \equiv_A y, where the subscript AA is omitted when it can be inferred from context. The name path is meant to evoke the topological idea of path, after which the paths in cubical type theory were modelled. In topology, a path is defined to be a continuous function from the real unit interval [0,1]X[0,1] \to X, so that we can interpret a path as being a point “continuously varying over time”.

It would be incredibly difficult (and oversized!) to formalise the entire real line and its topology to talk about paths, which are meant to be a fundamental concept in type theory, so we don’t bother! Instead, we look at which properties of the unit interval [0,1][0,1] give paths their nice behaviour, and only add those properties to our type theory. The topological interval is contractible, and it has two endpoint inclusions, with domain the one-point space. Those are the properties that characterise it (for defining paths), so those are the properties we keep!

The interval type, I\mathbb{I}, has two introduction rules — but they are so simple that we refer to them as “closed inhabitants” instead. These are written i0\mathrm{i0} and i1\mathrm{i1}, and they denote the left- and right- hand side of the unit interval. A classic result in the interpretation of constructive mathematics says that any function defined inside type theory is automatically continuous, and so, we define the type xAyx \equiv_A y to be the type of functions f:IAf : \mathbb{I} \to A, restricted to those where f(i0)=xf(\mathrm{i0}) = x and f(i1)=yf(\mathrm{i1}) = y definitionally.

The introduction rule for paths says that if e:IAe : \mathbb{I} \to A, with e(i0)=xe(\mathrm{i0}) = x and e(i1)=ye(\mathrm{i1}) = y (those are definitional equalities), then we can treat it as a path xAyx \equiv_A y. Conversely, if we have a term p:xAyp : x \equiv_A y, then we can apply it to an argument i:Ii : \mathbb{I} to get a term p(i)p(i) of type AA. The type of paths satisfies the same reduction rule (β\beta-reduction) as function types, but with an additional rule that says p(i0)p(\mathrm{i0}) is definitionally equal to xx (resp. i1\mathrm{i1} and yy), even if pp has not reduced to a λ\lambda-abstraction.

By considering the constant function at a point x:Ax : A, we obtain a term (λi. x)(\lambda i.\ x) of type xAxx \equiv_A x, called the trivial path, and referred to (for historical reasons) as refl, since for the types that are sets, it expresses reflexivity of equality. In the interpretation of types as \infty-groupoids, refl is the identity morphism. The rest of the groupoid structure is here too! Any path p:xyp : x \equiv y has an inverse p1:yxp^{-1} : y \equiv x, and for any p:xAyp : x \equiv_A y and q:yAzq : y \equiv_A z, there is a composite path pq:xAzp \bullet q : x \equiv_A z. There are paths between paths which express the groupoid identities (e.g. pp1reflp \bullet p^{-1} \equiv \mathrm{refl}), and those paths satisfy their own identities (up to a path between paths between paths), and so on.

This little introduction to paths was written so that we can make some of the ideas in the following section, about dependent sums, precise. It’s not meant to be a comprehensive treatment of the path type; For that, see the section Paths, in detail later on.

Sums🔗

Recall that in the construction of a map into BB from a predicate χ:BΩ\chi : B \to \Omega, we interpreted χ\chi as a family of sets with at most one element, and then took the disjoint union of that family, written (x:B)χ(x)\sum_{(x : B)} \chi(x), which admits a projection onto BB. That was actually a sneaky introduction of the dependent sum type former, \sum! Indeed, the short discussion there also made some mention of the introduction rule, but let’s reiterate with more clarity here.

If we have some index type A:TypeA : \mathrm{Type}_\ell, and a family of types BTypeB \to \mathrm{Type}_{\ell'}, then we can form the dependent sum type (x:A)B\sum_{(x : A)} B. When the index is clear from context, we let ourselves omit the variable, and write B\sum B. Since B\sum B is a type, it lives in a universe; Since it admits a projection onto AA and a projection onto a fibre of BB, it must be a universe bigger than both \ell and \ell', but it need not be any bigger: The dependent sum of BB lives in the universe indexed by {\ell \sqcup \ell'}.

The introduction rule for B\sum B says that if you have an a:Aa : A, and a b:B(a)b : B(a), then the pair (a,b)(a,b) inhabits the type B\sum B. Thus the inhabitants of the dependent sum are ordered pairs, but where the second coordinate has a type which depends on the value of the first coordinate. If we take BB to be a constant type family, we recover a type of ordered pairs, which we call the product of AA and BB, which is notated by A×BA \times B.

For the elimination rules, well, pairs would be rather useless if we couldn’t somehow get out what we put in, right? The first elimination rule says that if we have an inhabitant e:(x:A)B(x)e : \sum_{(x : A)} B(x), then we can project out the first component, which is written e.fste \mathrm{.fst}, to get an inhabitant of AA. The computation rule here says that if you project from a pair, you get back what you put in: (a,b).fst=a(a,b)\mathrm{.fst} = a.

Projecting out the second component is trickier, so we start by considering the special case where we’re projecting from a literal pair. We know that in (a,b)(a,b), the terms have types a:Aa : A and b:B(a)b : B(a), so in this case, the second projection is (a,b).snd:B(a)(a,b)\mathrm{.snd} : B(a). When we’re not projecting from a literal pair, though, we’re in a bit of a pickle. How do we state the type of the second projection, when we don’t know what the first component is? Well - we generalise. We know that for a pair, a=(a,b).fsta = (a,b)\mathrm{.fst}, so that we may replace the aforementioned typing for .snd\mathrm{.snd} with the wordy (a,b).snd:B((a,b).fst)(a,b)\mathrm{.snd} : B((a,b)\mathrm{.fst}). Now we don’t use any subterms of the pair in the typing judgement, so we can generalise:

e.snd:B(e.fst) e\mathrm{.snd} : B (e\mathrm{.fst})

A very common application of the dependent sum type is describing types equipped with some structure. Let’s see an example, now turning to Agda so that we may see how the dependent sum is notated in the formalisation. We define a pointed magma structure on a type to be a point of that type, and a binary function on that type. If we let the type vary, then we get a type family of pointed magma structures:

  Pointed-magma-on : Type  Type
  Pointed-magma-on A = A × (A  A  A)

Taking the dependent sum of this family gives us the type of pointed magmas, where inhabitants can be described as triples (T,i,m)(T,i,m) where T:TypeT : \mathrm{Type} is the underlying type, i:Ti : T is the distinguished point, and m:TTTm : T \to T \to T is the binary operation. Note that, to be entirely precise, the pair would have to be written (T,(i,m))(T,(i,m)), but since by far the most common of nested pairs is with the nesting on the right, we allow ourselves to omit the inner parentheses in this case.

  Pointed-magma : Type₁
  Pointed-magma = Σ Type Pointed-magma-on

Note that, since we fixed the family to TypeType\mathrm{Type} \to \mathrm{Type}, and recalling that Type:Type1\mathrm{Type} : \mathrm{Type}_1 are the first two universes, the type of all pointed magmas in Type\mathrm{Type} lives in the next universe, Type1\mathrm{Type}_1. If the type of all pointed magmas were in Type\mathrm{Type}, then we could again reproduce Russell’s paradox.

Just like functions, pairs enjoy a uniqueness principle, which (since coming up with names is very hard), also goes by η\eta-expansion. The rule says that every inhabitant of a Σ\Sigma-type is definitionally equal to the pairing of its projections: p=(p.fst,p.snd)p = (p\mathrm{.fst}, p\mathrm{.snd}). This essentially guarantees us that the inhabitants of a dependent sum type can’t have any “extra stuff” hiding from our projection maps; They are exactly the pairs, no more, and no less.

The dependent sum is a construction that goes by many names. In addition to the ones mentioned below (which are the names people actually use), you could call it the Grothendieck construction, denote it by B\int B, and have a hearty chuckle — if it weren’t for the fact that the Grothendieck construction is generally taken for functors between categories rather than just groupoids.

  • Σ\mathbf{\Sigma}-type, named after its shape;

  • Disjoint union, since if we take AA to be the 2-point type, then the dependent sum B\sum B of B:2TypeB : 2 → \mathrm{Type} is exactly the disjoint union B(0)B(1)B(0) \uplus B(1);

  • Thinking homotopically, you can consider a type family B:ATypeB : A \to \mathrm{Type} to be a fibration, since as we shall see later it satisfies the path lifting property; In that sense, B\sum B gives the total space, and the actual fibration is the first projection map xx.fstx \mapsto x\mathrm{.fst}.

This last correspondence will be particularly important later, and it inspires much of the terminology we use in HoTT. Since universes are object classifiers, we know that there is an equivalence between type families F:BTypeF : B \to \mathrm{Type} and maps into BB. Dependent sums and paths let us be more precise about this equivalence: We can turn any type family F:BTypeF : B \to \mathrm{Type} into a map fst:FB\mathrm{fst} : \sum F \to B, and looking at the fibre of first over a point yy recovers F(y)F(y). We thus blur this distinction a bit and refer to F(x)F(x) for some x:Bx : B as a fibre of FF, and we say that something happens fibrewise if it happens for each F(x)F(x).

One last thing to mention about Σ\Sigma-types is that they are the realization of general subsets. This is a special case of the idea of stuff-with-structure, where the structure is merely a witness for the truth of some predicate of the first value. For example, recall that we defined the fibre of ff over yy as {xA:f(x)=y}\{ x \in A : f(x) = y \}. In type theory, this is rendered with a Σ\Sigma-type, as below:

  fibre : (A  B)  B  Type
  fibre f y = Σ[ x ∈ A ] (f x ≡ y)

Dependent products🔗

Imagine that we have two types, AA and BB. For simplicity, they live in the same universe, which can be any universe of our choice. We know that we can consider BB to be a constant type family taking indices in AA: λ(x:A). B\lambda (x : A).\ B. Furthermore, we know that the total space of this family is the product type A×BA \times B, and that it comes equipped with a projection map fst:BA\mathrm{fst} : \sum B \to A.

Given any function f:ABf : A \to B, we define its graph graph(f):AA×B\mathrm{graph}(f) : A \to A \times B to be the function λx. (x,f(x))\lambda x.\ (x, f(x)), which takes each point of the input space to an ordered pair where the first coordinate is the input, and the second coordinate is the output of ff at that input. The graph of a function is special because of its interaction with the projection map fst:A×BA\mathrm{fst} : A \times B \to A. In particular, we can see that λx. graph(f).fst\lambda x.\ \mathrm{graph}(f)\mathrm{.fst} is the identity function! This property turns out to precisely characterise the functions AA×BA \to A \times B which are the graphs of functions ABA \to B.

By rewriting the equality we noticed above in terms of function composition, we obtain .fstgraph(f)=id\mathrm{.fst} \circ \mathrm{graph}(f) = \operatorname{id}_{}. A function, like graph(f)\mathrm{graph}(f), which composes on the right with some other function fst\mathrm{fst} to give the identity function, is called a section of fst\mathrm{fst}. This lets us characterise the functions ABA \to B as the sections of fst:A×BA\mathrm{fst} : A \times B \to A. In this sense, we see that a function ABA \to B is precisely a choice of values f(x)f(x), where f(x)f(x) is in the fibre over xx — in this case, the fibres over all points are BB!

Given a map g:AA×Bg : A \to A \times B that we know is a section of fst\mathrm{fst}, we can recover a function f:ABf : A \to B. Specifically, given a point x:Ax : A, we know that the second projection g(x).sndg(x)\mathrm{.snd} has type BB — so we can simply define ff to be λx. g(x).snd\lambda x.\ g(x)\mathrm{.snd}. Since we know that gg is a section of fst\mathrm{fst}, we conclude that graph(f)=g\mathrm{graph}(f) = g.

This idea generalises cleanly from the case where BB is just some type, to the case where BB is a type family indexed by AA. We define a section of BB to be a function g:ABg : A \to \sum B, where fstg=id\mathrm{fst} \circ g = \operatorname{id}_{}. Note that this terminology conflates the type family BB with the projection map BA\sum B \to A, keeping with the trend of identifying type families with fibrations. Generalising the argument from the previous paragraph, we show that a section gg gives a rule for assigning a point in B(x)B(x) to each point in xx. We again take the second projection of g(x).sndg(x)\mathrm{.snd}, but since now BB is a type family, this lives in the type B(g(x).fst)B(g(x)\mathrm{.fst}) — but since we know that gg is a section of fst\mathrm{fst}, we can correct this to be B(x)B(x).

We would like to package up such a rule into a function ABA \to B, but this is not a well-formed type, since BB is a type family. Moreover, this would “forget” that xx is sent to the fibre B(x)B(x). Thus, we introduce a new type former, the dependent product type, written (x:A)B(x)\prod_{(x : A)} B(x) or more directly (x:A)B(x)(x : A) \to B(x). The inhabitants of this type are precisely the sections of BB! More importantly, the introduction, elimination, computation and uniqueness rules of this type are exactly the same as for the function type.

The dependent product type takes on another very important role when universes are involved: Supporting polymorphism. In programming, we say that a function is polymorphic when it works uniformly over many types; Dependent products where the index type is a universe reflect this exactly. For example, using polymorphism, we can write out the types of the first and second projection maps:

  project-first : (A : Type) (B : A  Type)  Σ A B  A
  project-first A B x = x .fst

  project-second : (A : Type) (B : A  Type)
                  (p : Σ A B)  B (project-first A B p)
  project-second A B x = x .snd

In the logical interpretation, the dependent product type implements the universal quantifier \forall. For example (keep in mind that in general, we should not read path types as equality), we could read the type (x y:A)xy(x\ y : A) \to x \equiv y as the proposition “for all pairs of points xx and yy of AA, xx is equal to yy”. However, an inhabitant of this type has a much more interesting topological interpretation!

Worked example: Interpreting dependent sums as fibrations and dependent products as sections lets us derive topological interpretations for types. We show how to do this for the type (x y:A)xy(x\ y : A) \to x \equiv y, and show that a space AA admitting an inhabitant of this type has the property of “being contractible if it is inhabited”.

First, we’ll shuffle the type so that we can phrase it in terms of a single dependent product, and a single type family. We define a type AA to be a subsingleton if there is a term

f:(xy:A×A)(xy.fstxy.snd)f : (xy : A \times A) \to (xy\mathrm{.fst} \equiv xy\mathrm{.snd})

An ff of this type can be seen as a section of the type family F:A×ATypeF : A \times A \to \mathrm{Type} defined by the rule

F(x,y)(xy) F(x,y) \mapsto (x \equiv y)

The total space F\sum F of this family is the space of all paths in AA, which will be written as AIA^\mathbb{I} to emphasise its nature as a path space. Since it is the total space of a type family, it comes equipped with a fibration fst:AIA×A\mathrm{fst} : A^\mathbb{I} \to A \times A. Given that a term in AIA^\mathbb{I} can be seen to be a pair ((x,y),p)((x, y), p) where p:xyp : x \equiv y, we see that this fibration projects both endpoints of a path — so we will write it (d0,d1):AIA×A(d_0,d_1) : A^\mathbb{I} \to A \times A, since it is the pairing of the maps which evaluate a path at the left and right endpoints of the interval.

As a very quick aside, there is a map r:λx. (x,x,reflr : \lambda x.\ (x, x, \mathrm{refl} making the diagram below commute. This diagram expresses that the diagonal AA×AA \to A \times A can be factored as a weak equivalence followed by a fibration through AIA^\mathbb{I}, which is the defining property of a path space object. AA.

Ar˜AI(d0,d1)A×A A \xrightarrow{\~r} A^\mathbb{I} \xrightarrow{(d0,d1)} \to A \times A

A section of this fibration — that is, a dependent function like ff — is then a continuous function A×AAIA \times A \to A^\mathbb{I}, with the property that (d0,d1)f=id(d_0,d_1) \circ f = \operatorname{id}_{}. Now assume that our space AA is pointed, say with a point y0:Ay_0 : A, and that we have a section ff. We can then define the homotopy (x,t)f(x,y0,t)(x,t) \mapsto f(x,y_0,t), mapping between the identity function on AA and the constant function at y0y_0, exhbiting the space AA as contractible.

If you assume the law of excluded middle, every space is either pointed or empty. In this case, we can describe a space AA equipped with a map like ff as being “either empty or contractible”, but we prefer the terminology “contractible if inhabited” or subsingleton instead.

What next?🔗

While the text above was meant to serve as a linearly-structured introduction to the field of homotopy type theory, the rest of the 1Lab is not organised like this. It’s meant to be an explorable presentation of HoTT, where concepts can be accessed in any order, and everything is hyperlinked together. However, we can highlight the following modules as being the “spine” of the development, since everything depends on them, and they’re roughly linearly ordered.

Paths, in detail🔗

open import 1Lab.Path

The module 1Lab.Path develops the theory of path types, filling in the details that were missing in Interlude - Basics of paths above. In particular, the Path module talks about the structure of the interval type I\mathbb{I}, the definition of dependent paths, squares, and the symmetry involution on paths. It then introduce the transport operation, turns a path between types into an equivalence of types.

It also exposits about the structure of types as Kan complices; In particular, the Path module defines and talks about partial elements, which generalise the notion of “horn of a simplex” to any shape of partial cube; Then, it defines extensibility, which exhibits a partial element as being a subshape of some specific cube, and the composition and filling operations, which express that extensibility is preserved along paths. Finally, it introduces the Cartesian coercion operation, which generalises transport.

To reiterate, paths are the most important idea in cubical type theory, so you should definitely read through this module!

Equivalences🔗

The idea of subsingleton type, mentioned in passing in the discussion about universes and expanded upon in the section on dependent products, generalises to give the notion of h-level, which measures how much interesting homotopical information a type has. A h-level that comes up very often is the level of sets, since these are the types where equality behaves like a logical proposition. Due to their importance, we provide a module defining equivalent characterisations of sets.

open import 1Lab.HLevel

Contractible types can be used to give a definition of equivalence that is well-behaved in the context of higher category theory, but it is not the only coherent notion. We also have half-adjoint equivalences and bi-invertible maps. Finally, we have a module that explains why these definitions are needed in place of the simpler notion of “isomorphism”.

open import 1Lab.Equiv
open import 1Lab.Equiv.Biinv
open import 1Lab.Equiv.HalfAdjoint
open import 1Lab.Counterexamples.IsIso

Univalence🔗

open import 1Lab.Univalence

The module 1Lab.Univalence defines and explains the Glue type former, which serves as the Kan structure of the universe. It then concretises this idea by describing how Glue can be used to turn any equivalence into a path between types, which leads to a proof of the univalence principle. After, we prove that the type of equivalences satisfies the same induction principle as the type of paths, and that universes are object classifiers.

open import 1Lab.Univalence.SIP

While univalence guarantees that equivalence of types is equivalent to paths in types, it does not say anything about types-with-structure, like groups or posets. Fortunately, univalence implies the structure identity principle, which characterises the paths in spaces of “structured types” as being homomorphic equivalence.


  1. As usual, we do not have a type of all groups. Instead, we have a family of types of groups that depends on how “big” their underlying type is allowed to be: what universe it lives in.↩︎

  2. Consequently, a term of the form (λx. e)e(\lambda x.\ e) e' is referred to as a β\beta redex (plural β\beta redices), short for β\beta-reducible expression, but since we’re not really interested in the theory of programming languages here, this terminology will not come up.↩︎

  3. A survey paper of POPL proceedings by Guy L. Steele (Steele 2017) identified twenty-eight different notations for substitution, so the word “typical” is.. questionable, at best.↩︎

  4. Solution: Recall what it means for a map to be injective - that f(a)=f(b)a=bf(a) = f(b) \to a = b. If the domain of a function is a set with a single point, then the consequent is automatically satisfied! Since the maps N* \to \mathbb{N} are in 1-to-1 correspondence with the elements of N\mathbb{N}, we can consider any two numbers to be distinct subsets with the same domain.↩︎

  5. In some foundational systems — including HoTT, the one we are discussing — “subsingleton” is literally taken as the definition of “proposition”, so this translation isn’t merely omitted, it’s not necessary at all.↩︎

  6. Let me make this more precise. In any (univalent) category with finite limits E\mathscr{E}, we have a functor Sub:EopSet\mathrm{Sub} : \mathscr{E}^{\mathrm{op}} \to \mathrm{Set}, which takes cc to the full subcategory of E/c\mathscr{E}/c on the injections. We say that ΩE\Omega \in \mathscr{E} is a subobject classifier if there is a natural isomorphism SubHom(,Ω)\mathrm{Sub} \cong \mathbf{Hom}(-,\Omega), i.e., if Ω\Omega is a representing object for the subobject functor. Since the Yoneda lemma implies that representing objects are unique, this characterises Ω\Omega.↩︎

  7. Discreteness, type-theoretically, has to do with whether we can decide equality in that type by an algorithm. For example, the natural numbers are discrete, because we can compare the numerals they represent in time bounded by the size of the number. On the other hand, sequences of natural numbers are not discrete, because an algorithm can not run forever. Any discrete type is a set, and assuming excluded middle, any set is discrete.↩︎


References