66 lines
2.3 KiB
Plaintext
66 lines
2.3 KiB
Plaintext
|
* Sorting weight table for ESTONIAN order of (Latin) characters.
|
||
|
* This should be loaded after the LATIN1-DEFAULT table, (optionally
|
||
|
* after LATINX-DEFAULT, LATINX2-DEFAULT and LATINX3-default), and
|
||
|
* sets the weights of the Estonian alphabetic characters that
|
||
|
* differ from A-Z as follows:
|
||
|
*
|
||
|
* Between S and T: S WITH CARON, Z, Z WITH ACUTE, Z WITH DOT ABOVE,
|
||
|
* Z WITH CARON
|
||
|
* W's all sort as V's: W, W WITH ACUTE, W WITH GRAVE, W WITH CIRCUMFLEX,
|
||
|
* W WITH DOT BELOW, W WITH DOT ABOVE, W WITH DIAERESIS
|
||
|
* Between W and X: O WITH TILDE, A WITH DIAERESIS, O WITH DIAERESIS,
|
||
|
* U WITH DIAERESIS
|
||
|
*
|
||
|
* SYNTAX:
|
||
|
* Each non-comment line gives one or more weights for a character,
|
||
|
* as follows (character value in hex, weights in decimal):
|
||
|
* Field 1 = Unicode character value
|
||
|
* Field 2 = Shared weight (characters that sort together if accents
|
||
|
* and case were to be disregarded should have the same SW)
|
||
|
* Or, Block Weight/Shared Weight. This form allows characters
|
||
|
* in different Unicode blocks to have equal SWs. If BW is
|
||
|
* omitted, only SWs for characters in the same block are equal.
|
||
|
* Field 3 = Accent weight, or '-' to omit or copy from previous.
|
||
|
* Please use values as defined in the file NLS.WT.LOOKUP.
|
||
|
* Field 4 = Case weight, or 'U' for upper and 'L' for lower case chars.
|
||
|
*
|
||
|
**********************************************************************
|
||
|
* HEX (BW/)SW AW CW
|
||
|
* After S and before T:
|
||
|
0160 4/1222 19 U * S WITH CARON
|
||
|
0161 - 19 L
|
||
|
005A 4/1224 - U * Z
|
||
|
007A - - L
|
||
|
0179 4/1225 1 U * Z WITH ACUTE
|
||
|
017A - 1 L
|
||
|
017B 4/1226 35 U * Z WITH DOT ABOVE
|
||
|
017C - 35 L
|
||
|
017D 4/1227 19 U * Z WITH CARON
|
||
|
017E - 19 L
|
||
|
* W and diacritic versions same as V:
|
||
|
0057 4/1260 - U * W
|
||
|
0077 - - L
|
||
|
1E82 - 1 U * W WITH ACUTE
|
||
|
1E83 - 1 L
|
||
|
1E80 - 3 U * W WITH GRAVE
|
||
|
1E81 - 3 L
|
||
|
0174 - 13 U * W WITH CIRCUMFLEX
|
||
|
0175 - 13 L
|
||
|
1E84 - 24 U * W WITH DIAERESIS
|
||
|
1E85 - 24 L
|
||
|
1E86 - 35 U * W WITH DOT ABOVE
|
||
|
1E87 - 35 L
|
||
|
1E88 - 36 U * W WITH DOT BELOW
|
||
|
1E89 - 36 L
|
||
|
* Followed by accented vowels after W and before X:
|
||
|
00D5 4/1272 31 U * O WITH TILDE
|
||
|
00F5 - 31 L
|
||
|
00C4 4/1273 24 U * A WITH DIAERESIS
|
||
|
00E4 - 24 L
|
||
|
00D6 4/1274 24 U * O WITH DIAERESIS
|
||
|
00F6 - 24 L
|
||
|
00DC 4/1275 24 U * U WITH DIAERESIS
|
||
|
00FC - 24 L
|
||
|
*
|
||
|
* END
|