diff options
author | ache <ache@FreeBSD.org> | 1996-08-11 14:55:14 +0000 |
---|---|---|
committer | ache <ache@FreeBSD.org> | 1996-08-11 14:55:14 +0000 |
commit | a254ea6e33e6f4ca427e699c84c01263e4d8707d (patch) | |
tree | 68325199307152509168aeae16d65bf882637fee /usr.bin/colldef | |
parent | b6b7ab5f10e6646df43349effd5daa3d124c1636 (diff) | |
download | FreeBSD-src-a254ea6e33e6f4ca427e699c84c01263e4d8707d.zip FreeBSD-src-a254ea6e33e6f4ca427e699c84c01263e4d8707d.tar.gz |
Describe collation order rules (from POSIX WG)
Diffstat (limited to 'usr.bin/colldef')
-rw-r--r-- | usr.bin/colldef/NOTE | 35 |
1 files changed, 35 insertions, 0 deletions
diff --git a/usr.bin/colldef/NOTE b/usr.bin/colldef/NOTE new file mode 100644 index 0000000..327adf5 --- /dev/null +++ b/usr.bin/colldef/NOTE @@ -0,0 +1,35 @@ +% Base collation scheme: 1994-03-22 + +% Ordering algorithm: +% 1. Spaces and hyphen (but not soft +% hyphen) before punctuation +% characters, punctuation characters +% before numbers, +% numbers before letters. +% 2. Letters with diacritical marks are +% members of equivalence classes +% 3. A distinction is made with regards +% to case as noted below. +% 4. Special characters are ignored +% when comparing letters, but then +% they are considered +% 5. The alphabets are sorted in order +% of first appearance in ISO 10646: +% Latin, Greek, Cyrillic, etc. +% +% According to Greek specifications, +% the steps 2. and 3. above are reversed +% for the Greek script + +% collating symbols + +collating-symbol <CAPITAL> +collating-symbol <CAPITAL-SMALL> +collating-symbol <SMALL-CAPITAL> +collating-symbol <SMALL> + +% <CAPITAL-SMALL> and <SMALL-CAPITAL> +% are for cases like Danish <A><a> +% and Spanish <C><h> being treated +% as one letter. + |