tldm-universe/Ardent/UV/nls/locales/WT.TABLES/CYRILLIC-DEFAULT

122 lines
3.2 KiB
Plaintext
Raw Permalink Normal View History

2024-09-09 21:51:08 +00:00
* Sorting weight table for Cyrillic alphabet, default order.
* This covers Unicode block 12 CYRILLIC (0400 - 04FF)
* BUT not all the characters have a specified order.
*
* The Basic Russian alphabet, and other alphabetic characters that
* appear in ISO 8859-5, are all assigned weights. Any other characters in
* this block, essentially some 'historic' letters and 'extended Cyrillic',
* will sort before any of the ones mentioned, in their Unicode codepoint
* order. (That is the reason weights here start at 2000.)
*
* Note that accented characters are treated as characters in their
* own right, therefore accent weights are not assigned.
*
* SYNTAX:
* Each non-comment line gives one or more weights for a character,
* as follows (character value in hex, weights in decimal):
* Field 1 = Unicode character value
* Field 2 = Shared weight (characters that sort together if accents
* and case were to be disregarded should have the same SW)
* Or, Block Weight/Shared Weight. This form allows characters
* in different Unicode blocks to have equal SWs. If BW is
* omitted, only SWs for characters in the same block are equal.
* Field 3 = Accent weight, or '-' to omit or copy from previous.
* Please use values as defined in the file NLS.WT.LOOKUP.
* Field 4 = Case weight, or 'U' for upper and 'L' for lower case chars.
*
* HEX (BW/)SW AW CW
0410 12/2000 - U * A
0430 - - L
0411 12/2010 - U * BE
0431 - - L
0412 12/2020 - U * VE
0432 - - L
0413 12/2030 - U * GHE
0433 - - L
0403 12/2040 - U * GJE
0453 - - L
0490 12/2050 - U * GHE WITH UPTURN
0491 - - L
0414 12/2060 - U * DE
0434 - - L
0402 12/2070 - U * DJE
0452 - - L
0415 12/2080 - U * IE
0435 - - L
0401 12/2090 - U * IO
0451 - - L
0404 12/2100 - U * UKRAINIAN IE
0454 - - L
0416 12/2110 - U * ZHE
0436 - - L
0417 12/2120 - U * ZE
0437 - - L
0418 12/2130 - U * I
0438 - - L
0406 12/2140 - U * BYELORUSSIAN-UKRAINIAN I
0456 - - L
0407 12/2150 - U * YI
0457 - - L
0408 12/2160 - U * JE
0458 - - L
0419 12/2170 - U * SHORT I
0439 - - L
041A 12/2180 - U * KA
043A - - L
040C 12/2190 - U * KJE
045C - - L
041B 12/2200 - U * EL
043B - - L
0409 12/2210 - U * LJE
0459 - - L
041C 12/2220 - U * EM
043C - - L
041D 12/2230 - U * EN
043D - - L
040A 12/2240 - U * NJE
045A - - L
041E 12/2250 - U * O
043E - - L
041F 12/2260 - U * PE
043F - - L
0420 12/2270 - U * ER
0440 - - L
0421 12/2280 - U * ES
0441 - - L
0422 12/2290 - U * TE
0442 - - L
040B 12/2300 - U * TSHE
045B - - L
0423 12/2310 - U * U
0443 - - L
040E 12/2320 - U * SHORT U
045E - - L
0424 12/2330 - U * EF
0444 - - L
0425 12/2340 - U * HA
0445 - - L
0426 12/2350 - U * TSE
0446 - - L
0405 12/2360 - U * DZE
0455 - - L
0427 12/2370 - U * CHE
0447 - - L
040F 12/2380 - U * DZHE
045F - - L
0428 12/2390 - U * SHA
0448 - - L
0429 12/2400 - U * SHCHA
0449 - - L
044A 12/2410 - L * HARD SIGN
042A - - U
042B 12/2420 - U * YERU
044B - - L
044C 12/2430 - L * SOFT SIGN
042C - - U
042D 12/2440 - U * E
044D - - L
042E 12/2450 - U * YU
044E - - L
042F 12/2460 - U * YA
044F - - L