122 lines
3.2 KiB
Plaintext
Executable File
122 lines
3.2 KiB
Plaintext
Executable File
* Sorting weight table for Cyrillic alphabet, default order.
|
|
* This covers Unicode block 12 CYRILLIC (0400 - 04FF)
|
|
* BUT not all the characters have a specified order.
|
|
*
|
|
* The Basic Russian alphabet, and other alphabetic characters that
|
|
* appear in ISO 8859-5, are all assigned weights. Any other characters in
|
|
* this block, essentially some 'historic' letters and 'extended Cyrillic',
|
|
* will sort before any of the ones mentioned, in their Unicode codepoint
|
|
* order. (That is the reason weights here start at 2000.)
|
|
*
|
|
* Note that accented characters are treated as characters in their
|
|
* own right, therefore accent weights are not assigned.
|
|
*
|
|
* SYNTAX:
|
|
* Each non-comment line gives one or more weights for a character,
|
|
* as follows (character value in hex, weights in decimal):
|
|
* Field 1 = Unicode character value
|
|
* Field 2 = Shared weight (characters that sort together if accents
|
|
* and case were to be disregarded should have the same SW)
|
|
* Or, Block Weight/Shared Weight. This form allows characters
|
|
* in different Unicode blocks to have equal SWs. If BW is
|
|
* omitted, only SWs for characters in the same block are equal.
|
|
* Field 3 = Accent weight, or '-' to omit or copy from previous.
|
|
* Please use values as defined in the file NLS.WT.LOOKUP.
|
|
* Field 4 = Case weight, or 'U' for upper and 'L' for lower case chars.
|
|
*
|
|
* HEX (BW/)SW AW CW
|
|
0410 12/2000 - U * A
|
|
0430 - - L
|
|
0411 12/2010 - U * BE
|
|
0431 - - L
|
|
0412 12/2020 - U * VE
|
|
0432 - - L
|
|
0413 12/2030 - U * GHE
|
|
0433 - - L
|
|
0403 12/2040 - U * GJE
|
|
0453 - - L
|
|
0490 12/2050 - U * GHE WITH UPTURN
|
|
0491 - - L
|
|
0414 12/2060 - U * DE
|
|
0434 - - L
|
|
0402 12/2070 - U * DJE
|
|
0452 - - L
|
|
0415 12/2080 - U * IE
|
|
0435 - - L
|
|
0401 12/2090 - U * IO
|
|
0451 - - L
|
|
0404 12/2100 - U * UKRAINIAN IE
|
|
0454 - - L
|
|
0416 12/2110 - U * ZHE
|
|
0436 - - L
|
|
0417 12/2120 - U * ZE
|
|
0437 - - L
|
|
0418 12/2130 - U * I
|
|
0438 - - L
|
|
0406 12/2140 - U * BYELORUSSIAN-UKRAINIAN I
|
|
0456 - - L
|
|
0407 12/2150 - U * YI
|
|
0457 - - L
|
|
0408 12/2160 - U * JE
|
|
0458 - - L
|
|
0419 12/2170 - U * SHORT I
|
|
0439 - - L
|
|
041A 12/2180 - U * KA
|
|
043A - - L
|
|
040C 12/2190 - U * KJE
|
|
045C - - L
|
|
041B 12/2200 - U * EL
|
|
043B - - L
|
|
0409 12/2210 - U * LJE
|
|
0459 - - L
|
|
041C 12/2220 - U * EM
|
|
043C - - L
|
|
041D 12/2230 - U * EN
|
|
043D - - L
|
|
040A 12/2240 - U * NJE
|
|
045A - - L
|
|
041E 12/2250 - U * O
|
|
043E - - L
|
|
041F 12/2260 - U * PE
|
|
043F - - L
|
|
0420 12/2270 - U * ER
|
|
0440 - - L
|
|
0421 12/2280 - U * ES
|
|
0441 - - L
|
|
0422 12/2290 - U * TE
|
|
0442 - - L
|
|
040B 12/2300 - U * TSHE
|
|
045B - - L
|
|
0423 12/2310 - U * U
|
|
0443 - - L
|
|
040E 12/2320 - U * SHORT U
|
|
045E - - L
|
|
0424 12/2330 - U * EF
|
|
0444 - - L
|
|
0425 12/2340 - U * HA
|
|
0445 - - L
|
|
0426 12/2350 - U * TSE
|
|
0446 - - L
|
|
0405 12/2360 - U * DZE
|
|
0455 - - L
|
|
0427 12/2370 - U * CHE
|
|
0447 - - L
|
|
040F 12/2380 - U * DZHE
|
|
045F - - L
|
|
0428 12/2390 - U * SHA
|
|
0448 - - L
|
|
0429 12/2400 - U * SHCHA
|
|
0449 - - L
|
|
044A 12/2410 - L * HARD SIGN
|
|
042A - - U
|
|
042B 12/2420 - U * YERU
|
|
044B - - L
|
|
044C 12/2430 - L * SOFT SIGN
|
|
042C - - U
|
|
042D 12/2440 - U * E
|
|
044D - - L
|
|
042E 12/2450 - U * YU
|
|
044E - - L
|
|
042F 12/2460 - U * YA
|
|
044F - - L
|