summaryrefslogtreecommitdiffstats
path: root/lib/libc/locale/utf8.c
Commit message (Collapse)AuthorAgeFilesLines
* Use conversion state objects to store the accumulated wide character,tjr2004-05-171-63/+67
| | | | | | low bound, and the number of bytes remaining instead of storing the raw byte sequence and deriving them every time mbrtowc() is called. This is much faster -- about twice as fast in some crude benchmarks.
* Move prototypes of various encoding-related functions into a new headertjr2004-05-121-5/+1
| | | | file to avoid extern'ing them all over the place.
* Perform some basic validation of multibyte conversion state objects.tjr2004-04-121-2/+14
|
* Don't cast away const qualifiers.tjr2004-04-101-1/+1
| | | | Spotted by: bde
* Allow partial multibyte characters to accumulate in conversion statetjr2004-04-071-12/+41
| | | | | objects passed to mbrtowc(), mbsrtowcs(), and mbrlen(), as required by C99.
* Fix a typo that caused mbrtowc() to always return 0.tjr2003-11-111-1/+1
|
* Convert the Big5, EUC, MSKanji and UTF-8 encoding methods to implementtjr2003-11-021-71/+69
| | | | | | mbrtowc() and wcrtomb() directly. GB18030, GBK and UTF2 are left unconverted; GB18030 will be done eventually, but GBK and UTF2 may just be removed, as they are subsets of GB18030 and UTF-8 respectively.
* Whack 28 unused variables.nectar2003-02-181-1/+1
|
* Add a UTF-8 encoding method, which will eventually replace the antiquetjr2002-10-101-0/+204
"UTF2" method. Although UTF-8 and the old UTF2 encoding are compatible for 16-bit characters, the new UTF-8 implementation is much more strict about rejecting malformed input and also handles the full 31 bit range of characters.
OpenPOWER on IntegriCloud