numidr/docs/DataTypes.md

157 lines
5.9 KiB
Markdown
Raw Normal View History

2024-04-24 18:59:47 -04:00
# Fundamental Data Types
NumIdr exports a number of different datatypes. The most important type and the cornerstone of the library is the _array_, but there are other useful types as well.
2024-04-24 18:59:47 -04:00
## Arrays
### What is an Array?
2024-04-25 13:37:34 -04:00
In most programming languages, the word "array" is used to mean a one-dimensional list of values that is contiguous in memory. A typical array of integers may be written in list form like this:
2024-04-24 18:59:47 -04:00
```
[1, 4, 10, 2, -5, 18]
```
2024-04-25 13:37:34 -04:00
In this kind of array, elements are indexed by a single integer, starting at zero and increasing from left to right.
2024-04-24 18:59:47 -04:00
NumIdr, however, uses the word a bit more generally: a NumIdr array is a multi-dimensional structure that can be indexed by any number of integers. NumIdr arrays are written as nested lists:
```
[[4, -9, -2],
[5, -6, 1]]
```
Unlike in other languages, however, this is not a nested structure. The above is a single array, and it is always manipulated as one object.
### Properties of Arrays
The `Array` datatype has the following parameters:
```idris
Array : (s : Vect rk Nat) -> (a : Type) -> Type
```
The first parameter is the _shape_, a list of numbers (the _dimensions_) where each dimension is the length of a particular _axis_ of the array. The second parameter is the _element type_, the type of the values inside the array.
Let's return to the array example from earlier:
```
[[4, -9, -2],
[5, -6, 1]]
```
This is a rank-2 array, meaning that it has two axes. Rank-2 arrays are typically called matrices. To determine the dimensions of the array, we count the size of each nested list from the outside in, which in the case of matrices means the row axis comes before the column axis. This matrix has 2 rows and 3 columns, making its shape `[2, 3]`. Thus, a possible type for this array could be `Array [2, 3] Int`.
When determining the index of a value inside the array, the order of the indices is the same as the order of the dimensions, and each index number counts from zero. For example, the index `[1, 0]` indicates the second row and first column, which contains `5`.
> [!NOTE]
> The word "dimensions" is often ambiguously used to either refer to the rank of an array
> (as in "multi-dimensional array" in the previous section), or to the lengths of its
> axes. Conventionally, NumIdr reserves "dimension" for the second meaning, and uses
> "rank" for the first meaning.
>
> This guide has ignored this convention until now to be more understandable to newcomers,
> but will follow it from this point onward.
## Types of Arrays
Arrays are loosely divided into multiple subtypes mostly based on their rank. Each array subtype has an alias for convenience.
### Scalars
A scalar is a rank-0 array, meaning that it is indexed by 0 integers. Its alias is `Scalar`:
```idris
Scalar : (a : Type) -> Type
2024-04-25 13:37:34 -04:00
Scalar a = Array [] a
2024-04-24 18:59:47 -04:00
```
A scalar has exactly one index, the empty list `[]`. This means that it is exactly the same as a single value and as such is largely pointless, but NumIdr still provides an alias for it just in case you need it.
### Vectors
A vector is a rank-1 array:
```idris
Vector : (n : Nat) -> (a : Type) -> Type
2024-04-25 13:37:34 -04:00
Vector n a = Array [n] a
2024-04-24 18:59:47 -04:00
```
2024-04-25 13:37:34 -04:00
A vector's type signature and stored data is effectively identical to that of the standard library type `Vect`, whose elements are confusingly also called "vectors"; we often refer to those as "vects" to differentiate.
2024-04-24 18:59:47 -04:00
### Matrices
As mentioned before, a matrix is a rank-2 array:
```idris
Matrix : (m, n : Nat) -> (a : Type) -> Type
2024-04-25 13:37:34 -04:00
Matrix m n a = Array [m, n] a
2024-04-24 18:59:47 -04:00
```
There is also an alias `Matrix'` for square matrices.
```idris
Matrix' : (n : Nat) -> (a : Type) -> Type
2024-04-25 13:37:34 -04:00
Matrix' n a = Array [n, n] a
2024-04-24 18:59:47 -04:00
```
As a linear algebra library, the majority of the operations in NumIdr revolve around matrices.
#### Homogeneous Matrices
NumIdr also provides aliases for homogeneous matrices:
```idris
HMatrix : (m, n : Nat) -> (a : Type) -> Type
2024-04-25 13:37:34 -04:00
HMatrix m n a = Array [S m, S n] a
2024-04-24 18:59:47 -04:00
HMatrix' : (n : Nat) -> (a : Type) -> Type
2024-04-25 13:37:34 -04:00
HMatrix' n a = Array [S n, S n] a
2024-04-24 18:59:47 -04:00
-- To use with homogeneous matrices
HVector : (n : Nat) -> (a : Type) -> Type
2024-04-25 13:37:34 -04:00
HVector n a = Array [S n] a
2024-04-24 18:59:47 -04:00
```
These are useful for clarity when working with both homogeneous and non-homogeneous matrices.
## Other Datatypes
### Transforms
A transform is a wrapper type for a matrix with certain properties that can be used to transform points in space.
```idris
Transform : (ty : TransType) -> (n : Nat) -> (a : Type) -> Type
```
The `TransType` parameter dictates what kind of transform it is. These eight options are currently available:
**Linear Types:**
2024-04-25 13:37:34 -04:00
- `Trivial` (always the identity transformation)
- `Rotation`
- `Orthonormal` (rotation + reflection)
- `Linear`
2024-04-24 18:59:47 -04:00
**Affine Types:**
2024-04-25 13:37:34 -04:00
- `Translation`
- `Rigid` (rotation + translation)
- `Isometry` (rotation + reflection + translation)
- `Affine`
2024-04-24 18:59:47 -04:00
The `TransType` value is obtained by prepending a capital T to these names. For example, an isometry may have the type `Isometry 3 Double`, which is an alias for `Transform TIsometry 3 Double`.
#### The Point Type
2024-05-06 02:35:07 -04:00
Transforms behave differently from regular matrices when applied to a vector. When a non-linear transform is used, the transform is first linearized, so that vectors only have linear transformations applied to them. **This is not a bug!**
2024-04-24 18:59:47 -04:00
In order to properly apply these transforms, the `Point` type must be used, which is a wrapper around the `Vector` type that supports these transforms. This separation between points and vectors is intended to make working with affine transformations more convenient, as it mirrors the separation between points and vectors in affine algebra.
### Permutations
The type `Permutation n` represents a permutation of `n` elements. Permutations are mostly used internally for various algorithms, but they are also an input in various operations, such as those that permute the axes of an array.
2024-04-25 13:37:34 -04:00
2024-05-06 02:35:07 -04:00
Permutations can be composed using `(*.)`, and a permutation can be converted into a matrix using `permuteM`.
2024-04-25 13:37:34 -04:00
[Contents](Intro.md) | [Next](Operations.md)