tldm-universe/Ardent/UV/nls/locales/WT.TABLES/LATIN-TURKISH

40 lines
1.6 KiB
Plaintext
Raw Normal View History

2024-09-09 21:51:08 +00:00
* Sorting weight table for TURKISH characters (from ISO8859/9) in order
* on top of LATIN1/LATINX tables. These characters are:
*
* Between G and H: G BREVE
* Between H and J: I WITH DOT ABOVE (uppercase version of SMALL I 0069)
* DOTLESS I (lowercase version of CAPITAL I 0049)
* (Note: sequence is H, dotless I, I dot + accented versions, J, ...)
* Between S and T: S CEDILLA
*
* SYNTAX:
* Each non-comment line gives one or more weights for a character,
* as follows (character value in hex, weights in decimal):
* Field 1 = Unicode character value
* Field 2 = Shared weight (characters that sort together if accents
* and case were to be disregarded should have the same SW)
* Or, Block Weight/Shared Weight. This form allows characters
* in different Unicode blocks to have equal SWs. If BW is
* omitted, only SWs for characters in the same block are equal.
* Field 3 = Accent weight, or '-' to omit or copy from previous.
* Please use values as defined in the file NLS.WT.LOOKUP.
* Field 4 = Case weight, or 'U' for upper and 'L' for lower case chars.
*
**********************************************************************
* HEX (BW/)SW AW CW
* After G:
011E 4/1092 5 U * G WITH BREVE
011F - 5 L
* I , dotted and undotted:
* (Note we do not use AWs here, but use SWs to differentiate
* these characters from the unaccented versions.)
0049 4/1109 - U * I
0131 - - L * DOTLESS I
0130 4/1110 - U * I WITH DOT ABOVE
0069 - - L * I
* S cedilla
015E 4/1232 40 U * S WITH CEDILLA
015F - 40 L
*
* END