tldm-universe/Ardent/UV/nls/locales/WT.TABLES/LATINX-DEFAULT
2024-09-09 17:51:08 -04:00

181 lines
4.7 KiB
Plaintext
Executable File

* Sorting weight table for European Extended Latin (A), default order.
* This covers Unicode block 5 LATIN EXTENDED-A (0100 - 017F)
*
* The table is intended to be loaded after 'LATIN1-DEFAULT', which
* covers blocks 2 & 4. Note that all these block 5 characters are
* forced into block 4, so that all alphabetic characters in blocks 2,
* 4 & 5 sort together. This means that the control characters (blocks 1 and 3)
* sort before all of these, in Unicode value order.
*
* SYNTAX:
* Each non-comment line gives one or more weights for a character,
* as follows (character value in hex, weights in decimal):
* Field 1 = Unicode character value
* Field 2 = Shared weight (characters that sort together if accents
* and case were to be disregarded should have the same SW)
* Or, Block Weight/Shared Weight. This form allows characters
* in different Unicode blocks to have equal SWs. If BW is
* omitted, only SWs for characters in the same block are equal.
* Field 3 = Accent weight, or '-' to omit or copy from previous.
* Please use values as defined in the file NLS.WT.LOOKUP.
* Field 4 = Case weight, or 'U' for upper and 'L' for lower case chars.
*
* HEX (BW/)SW AW CW
* A-related
0102 4/1000 5 U * A WITH BREVE
0103 - 5 L
0104 - 44 U * A WITH OGONEK
0105 - 44 L
0100 - 46 U * A WITH MACRON
0101 - 46 L
* C-related
0106 4/1030 1 U * C WITH ACUTE
0107 - 1 L
0108 - 13 U * C WITH CIRCUMFLEX
0109 - 13 L
010C - 19 U * C WITH CARON
010D - 19 L
010A - 35 U * C WITH DOT ABOVE
010B - 35 L
* D-related
010E 4/1040 19 U * D WITH CARON
010F - 19 L
0110 - 38 U * D WITH STROKE
0111 - 38 L
* E-related
0114 4/1060 5 U * E WITH BREVE
0115 - 5 L
011A - 19 U * E WITH CARON
011B - 19 L
0116 - 35 U * E WITH DOT ABOVE
0117 - 35 L
0118 - 44 U * E WITH OGONEK
0119 - 44 L
0112 - 46 U * E WITH MACRON
0113 - 46 L
* G-related
011E 4/1090 5 U * G WITH BREVE
011F - 5 L
011C - 13 U * G WITH CIRCUMFLEX
011D - 13 L
0120 - 35 U * G WITH DOT ABOVE
0121 - 35 L
0122 - 40 U * G WITH CEDILLA
0123 - 40 L
* H-related
0124 4/1100 13 U * H WITH CIRCUMFLEX
0125 - 13 L
0126 - 38 U * H WITH STROKE
0127 - 38 L
* I-related
012C 4/1110 5 U * I WITH BREVE
012D - 5 L
0128 - 31 U * I WITH TILDE
0129 - 31 L
0130 - 35 U * I WITH DOT ABOVE
012E - 44 U * I WITH OGONEK
012F - 44 L
012A - 46 U * I WITH MACRON
012B - 46 L
* Dotless lowercase I, comes after i
0131 4/1117 - L * DOTLESS I
* IJ ligature, comes after I
0132 4/1119 - U * LIGATURE IJ
0133 - - L
*J-related
0134 4/1120 13 U * J WITH CIRCUMFLEX
0135 - 13 L
* K-related
0136 4/1130 40 U * K WITH CEDILLA
0137 - 40 L
* Letter KRA, comes after K
0138 4/1137 - L * KRA
* L-related
0139 4/1140 1 U * L WITH ACUTE
013A - 1 L
013D - 19 U * L WITH CARON
013E - 19 L
013B - 40 U * L WITH CEDILLA
013C - 40 L
* L with middle dot, comes after L
013F 4/1145 - U * L WITH MIDDLE DOT
0140 - - L
* L with slash, comes after L
0141 4/1147 38 U * L WITH STROKE
0142 - 38 L
* N-related
0143 4/1160 1 U * N WITH ACUTE
0144 - 1 L
0147 - 19 U * N WITH CARON
0148 - 19 L
0145 - 40 U * N WITH CEDILLA
0146 - 40 L
* n preceded by apostrophe, comes after n
0149 4/1165 - L * N PRECEDED BY APOSTROPHE
* ENG, comes after N
014A 4/1167 - U * ENG
014B - - L
* O-related
014E 4/1170 5 U * O WITH BREVE
014F - 5 L
0150 - 29 U * O WITH DOUBLE ACUTE
0151 - 29 L
014C - 46 U * O WITH MACRON
014D - 46 L
* OE ligature, comes after O
0152 4/1175 - U * LIGATURE OE
0153 - - L
* R-related
0154 4/1210 1 U * R WITH ACUTE
0155 - 1 L
0158 - 19 U * R WITH CARON
0159 - 19 L
0156 - 40 U * R WITH CEDILLA
0157 - 40 L
* S-related
015A 4/1220 1 U * S WITH ACUTE
015B - 1 L
015C - 13 U * S WITH CIRCUMFLEX
015D - 13 L
0160 - 19 U * S WITH CARON
0161 - 19 L
015E - 40 U * S WITH CEDILLA
015F - 40 L
* Letter long s, comes after s
017F 4/1224 - L * LONG S
* T-related
0164 4/1240 19 U * T WITH CARON
0165 - 19 L
0166 - 38 U * T WITH STROKE
0167 - 38 L
0162 - 40 U * T WITH CEDILLA
0163 - 40 L
* U-related
016C 4/1250 5 U * U WITH BREVE
016D - 5 L
016E - 21 U * U WITH RING ABOVE
016F - 21 L
0170 - 29 U * U WITH DOUBLE ACUTE
0171 - 29 L
0168 - 31 U * U WITH TILDE
0169 - 31 L
0172 - 44 U * U WITH OGONEK
0173 - 44 L
016A - 46 U * U WITH MACRON
016B - 46 L
* W-related
0174 4/1270 13 U * W WITH CIRCUMFLEX
0175 - 13 L
* Y-related
0176 4/1290 13 U * Y WITH CIRCUMFLEX
0177 - 13 L
0178 - 24 U * Y WITH DIAERESIS
* Z-related
0179 4/1300 1 U * Z WITH ACUTE
017A - 1 L
017D - 19 U * Z WITH CARON
017E - 19 L
017B - 35 U * Z WITH DOT ABOVE
017C - 35 L
* END