---
title: "SQUIDs"
output: rmarkdown::html_vignette
vignette: >
%\VignetteIndexEntry{SQUIDs}
%\VignetteEngine{knitr::rmarkdown}
%\VignetteEncoding{UTF-8}
---
It is often useful to produce short, quasi-unique
identifiers (SQUIDs) without the benefit of a central authority to
prevent duplication. Although Universally Unique Identifiers (UUIDs)
provide for this, these are also unwieldy; for example, the most used UUID,
version 4, is 36 characters long. SQUIDs are short (8 characters) at the
expense of having more collisions, which can be mitigated by combining them
with human-produced suffixes, yielding relatively brief, half
human-readable, almost-unique identifiers (see for example the identifiers
used for Decentralized Construct Taxonomies; Peters & Crutzen,
2024, [doi.org/mr4n](https://doi.org/10.15626/MP.2022.3638)).
SQUIDs are the number of centiseconds elapsed since the
beginning of 1970 converted to a base 30 system. This package contains
functions to produce SQUIDs as well as convert them back into dates and times.
## Details
SQUIDs are defined as 8-character strings that express a timestamp (the
number of centiseconds that passed since the UNIX Epoch) in a base 30
decimal system. The lowest possible SQUID, therefore, is `00000001` (which
corresponds to 1970-01-01 00:00:00 UTC), and the highest possible SQUID is
`zzzzzzzz`, which corresponds to 2177-11-28 11:59:59 UTC.
### The base 30 system
The characters used in SQUIDs are Arabic digits (0-9) and (lowercase)
Latin letters, omitting vowels. This yields the sequence listed at the
bottom of this page. This means that in the base 30 system used by SQUIDs:
- "10" represents "30" in the decimal system;
- "3b" represents "100" in the decimal system (which, for SQUIDs,
corresponds to 100 centiseconds, so one second);
- "6n0" represents "6000" in the decimal system (corresponding to
one minute);
- "fb00" represents "360000" in the decimal system (corresponding
to one hour);
- "bn00" represents "8640000" in the decimal system (corresponding
to one day);
### Avoiding collisions
Because SQUIDs represent centiseconds, if you generate two or more sequences
of SQUIDs in quick succession, these will likely overlap (i.e. contain the
same SQUIDs, called "collisions" in "identifier speak").
For example, if you produce a sequence of 1000 SQUIDs, this covers an
interval of 10 seconds, and if you produce a sequence of 6000 SQUIDs, this
covers an interval of one minute. This means that if you request 6000 SQUIDs
and after 30 seconds request another 6000 SQUIDs, and assuming you use the
default `origin` of the current time, the last 3000 SQUIDs of the first
sequence and the first 3000 SQUIDs of the second sequence will be the same.
To avoid this, `{squids}` allows you to specify a sequence of SQUIDs that
you want your new SQUIDs to follow using the `follow` argument. You can
also follow the first sequence of SQUIDs at a distance, using the `followBy`
argument; if you specify one or more SQUIDs in the `follow` argument, and
if you specify `followBy = 1000`, the new sequence of SQUIDs will have an
origin 1001 centiseconds after the last SQUID in the sequence you passed in
`follow`.
For example, let's create five SQUIDs and store them:
```{r}
exampleSQUIDs <-
squids::squids(5);
```
Let's look at what we got:
```{r}
#| comment: ""
exampleSQUIDs;
```
To then follow this sequence, we can specify them when creating new SQUIDs:
```{r}
#| comment: ""
squids::squids(
5,
follow = exampleSQUIDs
);
```
And we can use `followBy` to specify we want a gap:
```{r}
#| comment: ""
squids::squids(
5,
follow = exampleSQUIDs,
followBy = 30
);
```
## The SQUID "digits"
These are the 30 SQUID "digits" (on the right) and their corresponding
value in the base 10 decimal system (on the left).
```
0 = 0
1 = 1
2 = 2
3 = 3
4 = 4
5 = 5
6 = 6
7 = 7
8 = 8
9 = 9
10 = b
11 = c
12 = d
13 = f
14 = g
15 = h
16 = j
17 = k
18 = l
19 = m
20 = n
21 = p
22 = q
23 = r
24 = s
25 = t
26 = w
27 = x
28 = y
29 = z
```
```{r setup}
```