tldm-universe/Ardent/UV/nls/locales/WT.TABLES/LATIN-ESTONIAN

66 lines
2.3 KiB
Plaintext
Raw Normal View History

2024-09-09 21:51:08 +00:00
* Sorting weight table for ESTONIAN order of (Latin) characters.
* This should be loaded after the LATIN1-DEFAULT table, (optionally
* after LATINX-DEFAULT, LATINX2-DEFAULT and LATINX3-default), and
* sets the weights of the Estonian alphabetic characters that
* differ from A-Z as follows:
*
* Between S and T: S WITH CARON, Z, Z WITH ACUTE, Z WITH DOT ABOVE,
* Z WITH CARON
* W's all sort as V's: W, W WITH ACUTE, W WITH GRAVE, W WITH CIRCUMFLEX,
* W WITH DOT BELOW, W WITH DOT ABOVE, W WITH DIAERESIS
* Between W and X: O WITH TILDE, A WITH DIAERESIS, O WITH DIAERESIS,
* U WITH DIAERESIS
*
* SYNTAX:
* Each non-comment line gives one or more weights for a character,
* as follows (character value in hex, weights in decimal):
* Field 1 = Unicode character value
* Field 2 = Shared weight (characters that sort together if accents
* and case were to be disregarded should have the same SW)
* Or, Block Weight/Shared Weight. This form allows characters
* in different Unicode blocks to have equal SWs. If BW is
* omitted, only SWs for characters in the same block are equal.
* Field 3 = Accent weight, or '-' to omit or copy from previous.
* Please use values as defined in the file NLS.WT.LOOKUP.
* Field 4 = Case weight, or 'U' for upper and 'L' for lower case chars.
*
**********************************************************************
* HEX (BW/)SW AW CW
* After S and before T:
0160 4/1222 19 U * S WITH CARON
0161 - 19 L
005A 4/1224 - U * Z
007A - - L
0179 4/1225 1 U * Z WITH ACUTE
017A - 1 L
017B 4/1226 35 U * Z WITH DOT ABOVE
017C - 35 L
017D 4/1227 19 U * Z WITH CARON
017E - 19 L
* W and diacritic versions same as V:
0057 4/1260 - U * W
0077 - - L
1E82 - 1 U * W WITH ACUTE
1E83 - 1 L
1E80 - 3 U * W WITH GRAVE
1E81 - 3 L
0174 - 13 U * W WITH CIRCUMFLEX
0175 - 13 L
1E84 - 24 U * W WITH DIAERESIS
1E85 - 24 L
1E86 - 35 U * W WITH DOT ABOVE
1E87 - 35 L
1E88 - 36 U * W WITH DOT BELOW
1E89 - 36 L
* Followed by accented vowels after W and before X:
00D5 4/1272 31 U * O WITH TILDE
00F5 - 31 L
00C4 4/1273 24 U * A WITH DIAERESIS
00E4 - 24 L
00D6 4/1274 24 U * O WITH DIAERESIS
00F6 - 24 L
00DC 4/1275 24 U * U WITH DIAERESIS
00FC - 24 L
*
* END