module Data.Int.Base where
The integers🔗
The familiar set of integers, in addition to its intuitive characterisation in terms of positive and negative numbers, can be specified by a wide variety of universal properties:
The set of integers, with the operation of addition, is the free group on one generator.
The set of integers, with the operations of addition and multiplication, is the initial ring.
The set of integers under addition, considered together with the embedding of the positive numbers, is the free group generated by the monoid of natural numbers.
The set of integers is the loop space of the circle. More generally, it appears (as a group under addition) as the first non-trivial higher homotopy group of all spheres:
The set of integers, with the number zero and the successor equivalence, is the initial pointed type with an equivalence.
All of these specifications can be turned into code, and regardless of your choice, it would be provably equivalent to any of the others. Therefore, there is no mathematical reason to pick one over the others. However, the real world imposes two constraints on us: convenience and efficiency.
On the convenience front, it’s simply more convenient to formalise further results if there is a definition of the integers which we take as the definition. And, since this definition will be ubiquitous, it’s best if it has compact normal forms for numbers, and supports definition of the relevant structure in a computationally nice way.
For definiteness, we go with the most elementary, inductive representation:
data Int : Type where
: Nat → Int
pos : Nat → Int negsuc
The definition above is isomorphic to a sum type, where both summands are the natural numbers. However, if we interpret this naïvely, then we would have a problem: there are now two copies of the number zero! This is, essentially, a problem of intent. We have to choose one of the two summands to contain the number zero, and the names we choose for the constructors must reflect this.
The constructor pos
embeds the
positive numbers — incl. zero! — while the constructor negsuc
constructs the
negation of a successor. This means
that negsuc 0
is the
representation of the number
Other than these constructors, we can define a difference operation, between natural numbers, valued in the integers. This difference correctly reflects, in its sign, whether we tried to subtract a large quantity from a smaller quantity.
_ℕ-_ : Nat → Nat → Int
= pos x
x ℕ- zero = negsuc y
zero ℕ- suc y = x ℕ- y suc x ℕ- suc y
We can also use this to demonstrate the offsetting built into negsuc
:
_ : 0 ℕ- 20 ≡ negsuc 19
_ = refl
Equality🔗
We mentioned in the introductory paragraph that the type of integers is a set. We will show something stronger: it’s actually discrete. This means that we have a procedure that can tell whether two integers are equal, and produce a refutation when they are not equal. Intuitively, this is because the natural numbers are discrete, and it’s embedded in the integers.
The first thing to do is discriminate between the two constructors. If they match, we can compare the underlying natural numbers:
instance
: Discrete Int
Discrete-Int {pos x} {pos y} with x ≡? y
Discrete-Int ... | yes p = yes (ap pos p)
... | no ¬p = no λ path → ¬p (pos-injective path)
{negsuc x} {negsuc y} with x ≡? y
Discrete-Int ... | yes p = yes (ap negsuc p)
... | no ¬p = no λ path → ¬p (negsuc-injective path)
If they’re mismatched, we have pre-existing refutations.
{pos x} {negsuc y} = no pos≠negsuc
Discrete-Int {negsuc x} {pos y} = no negsuc≠pos Discrete-Int
As the universal symmetry🔗
One of the mentioned characterisations of the integers was as the initial type equipped with a point and an auto-equivalence. This equivalence is the successor function: if we picture the integers as a number line, the effect of this equivalence is to “rotate” the line to the right.
: Int → Int
sucℤ (pos n) = pos (suc n)
sucℤ (negsuc zero) = 0
sucℤ (negsuc (suc n)) = negsuc n
sucℤ
: Int → Int
predℤ (pos zero) = -1
predℤ (pos (suc n)) = pos n
predℤ (negsuc n) = negsuc (suc n) predℤ
The definition of the successor and predecessor functions is slightly complicated by the need to adjust by one when passing between the summands. The proof that these are inverses is a case bash precisely mirroring the structure of the functions.
: (x : Int) → sucℤ (predℤ x) ≡ x
suc-predℤ (negsuc x) = refl
suc-predℤ (pos zero) = refl
suc-predℤ (pos (suc x)) = refl
suc-predℤ
: (x : Int) → predℤ (sucℤ x) ≡ x
pred-sucℤ (pos x) = refl
pred-sucℤ (negsuc zero) = refl
pred-sucℤ (negsuc (suc x)) = refl
pred-sucℤ
: Int ≃ Int
suc-equiv = Iso→Equiv (sucℤ , iso predℤ suc-predℤ pred-sucℤ) suc-equiv
As signed numbers🔗
Considering the isomorphism we arrive at an equivalent representation of the integers: as a pair consisting of a natural number and its sign. We have projections and which correspond to this view: given an integer, we can determine its sign and its absolute value.
data Sign : Type where
: Sign
pos neg
: Int → Nat
abs (pos x) = x
abs (negsuc x) = suc x
abs
: Int → Sign
sign (pos x) = pos
sign (negsuc x) = neg sign
Conversely, if we have a signed number, we can build an integer. Note
that the assign
function sends the
natural number zero to the integer zero regardless of what sign is
specified.
: Sign → Nat → Int
assign = 0
assign s zero (suc n) = pos (suc n)
assign pos (suc n) = negsuc n assign neg
Algebra🔗
We also mentioned two more characterisations of the integers: as the free group on one generator, and as the initial ring. Therefore, we expect to find operations of addition, multiplication, and negation (additive inverse) on the set of integers. They are not that hard to define.
Addition on integers is defined by cases on the sign. If both numbers are positive (resp. negative), then we can compute their sum in the natural numbers. If the numbers have mismatched sign, then the addition function is actually computing a difference, and we already know how to compute differences.
_+ℤ_ : Int → Int → Int
= pos (x + y)
pos x +ℤ pos y = x ℕ- suc y
pos x +ℤ negsuc y = y ℕ- suc x
negsuc x +ℤ pos y = negsuc (suc (x + y)) negsuc x +ℤ negsuc y
The negation function is defined by a short case bash. Subtraction is defined as addition against the inverse, rather than as an operation in its own right.
: Int → Int
negℤ (pos zero) = pos zero
negℤ (pos (suc x)) = negsuc x
negℤ (negsuc x) = pos (suc x)
negℤ
_-ℤ_ : Int → Int → Int
= x +ℤ (negℤ y) x -ℤ y
The implementation of multiplication uses the decomposition of numbers into their signs and absolute values: The product is defined to be
There are actually three different “multiplication” signs in the
formula above. The first is sign multiplication, the second is
assign
, and the last is natural
number multiplication.
: Sign → Sign → Sign
sign× = pos
sign× pos pos = pos
sign× neg neg = neg
sign× pos neg = neg
sign× neg pos
_*ℤ_ : Int → Int → Int
= assign (sign× (sign i) (sign j)) (abs i * abs j) i *ℤ j
There are actually alternative definitions of addition and multiplication: as iterated successor/predecessor, and as iterated addition, respectively. These alternative representations have worse performance, but they behave in a way that is slightly easier to reason about. When establishing the algebraic properties of the integers, we’ll prove that these functions are equivalent to the definitions above, and change between them as appropriate.
: Int → Int → Int
rotℤ (pos zero) y = y
rotℤ (pos (suc x)) y = sucℤ (rotℤ (pos x) y)
rotℤ (negsuc zero) y = predℤ y
rotℤ (negsuc (suc x)) y = predℤ (rotℤ (negsuc x) y)
rotℤ
: Int → Int → Int
dotℤ = posz
dotℤ posz y (possuc x) y = y +ℤ (dotℤ (pos x) y)
dotℤ (negsuc zero) y = negℤ y
dotℤ (negsuc (suc x)) y = negℤ y +ℤ (dotℤ (negsuc x) y) dotℤ
Additional operations🔗
It is also straightforward to define maximum and minimum operations for integers:
: Int → Int → Int
maxℤ (pos x) (pos y) = pos (max x y)
maxℤ (pos x) (negsuc _) = pos x
maxℤ (negsuc _) (pos y) = pos y
maxℤ (negsuc x) (negsuc y) = negsuc (min x y)
maxℤ
: Int → Int → Int
minℤ (pos x) (pos y) = pos (min x y)
minℤ (pos _) (negsuc y) = negsuc y
minℤ (negsuc x) (pos _) = negsuc x
minℤ (negsuc x) (negsuc y) = negsuc (max x y) minℤ