This file is used to specify the character sorting order and which characters are the 'same' for the purposes of searching. All accented versions of a character will have the same code.
The first section has three byte records that have the following meaning
Byte number | Description |
---|---|
1 | The low nibble is 1 for letters, 2 for digits and zero for everything else. |
2 | This is the sort order for the unaccented form of the character. So a, à, á, ä and å would all have the same code here. The next byte allows you to put them in order. |
3 | Upper nibble is 2 for uppercase letters, 1 for lowercase letters.
It can also have the values 3 or 4 for unknown reasons.
The lower nibble is a number that can be added to byte 2 to get the full sorting order. This sorts the accented versions of the characters |
The next section has two byte records with an unknown function.