mirror of
https://github.com/torvalds/linux.git
synced 2024-11-16 17:12:06 +00:00
74675a5850
This patch (as1239) updates the kernel's treatment of Unicode. The character-set conversion routines are well behind the current state of the Unicode specification: They don't recognize the existence of code points beyond plane 0 or of surrogate pairs in the UTF-16 encoding. The old wchar_t 16-bit type is retained because it's still used in lots of places. This shouldn't cause any new problems; if a conversion now results in an invalid 16-bit code then before it must have yielded an undefined code. Difficult-to-read names like "utf_mbstowcs" are replaced with more transparent names like "utf8s_to_utf16s" and the ordering of the parameters is rationalized (buffer lengths come immediate after the pointers they refer to, and the inputs precede the outputs). Fortunately the low-level conversion routines are used in only a few places; the interfaces to the higher-level uni2char and char2uni methods have been left unchanged. Signed-off-by: Alan Stern <stern@rowland.harvard.edu> Acked-by: Clemens Ladisch <clemens@ladisch.de> Signed-off-by: Greg Kroah-Hartman <gregkh@suse.de> |
||
---|---|---|
.. | ||
Kconfig | ||
Makefile | ||
nls_ascii.c | ||
nls_base.c | ||
nls_cp437.c | ||
nls_cp737.c | ||
nls_cp775.c | ||
nls_cp850.c | ||
nls_cp852.c | ||
nls_cp855.c | ||
nls_cp857.c | ||
nls_cp860.c | ||
nls_cp861.c | ||
nls_cp862.c | ||
nls_cp863.c | ||
nls_cp864.c | ||
nls_cp865.c | ||
nls_cp866.c | ||
nls_cp869.c | ||
nls_cp874.c | ||
nls_cp932.c | ||
nls_cp936.c | ||
nls_cp949.c | ||
nls_cp950.c | ||
nls_cp1250.c | ||
nls_cp1251.c | ||
nls_cp1255.c | ||
nls_euc-jp.c | ||
nls_iso8859-1.c | ||
nls_iso8859-2.c | ||
nls_iso8859-3.c | ||
nls_iso8859-4.c | ||
nls_iso8859-5.c | ||
nls_iso8859-6.c | ||
nls_iso8859-7.c | ||
nls_iso8859-9.c | ||
nls_iso8859-13.c | ||
nls_iso8859-14.c | ||
nls_iso8859-15.c | ||
nls_koi8-r.c | ||
nls_koi8-ru.c | ||
nls_koi8-u.c | ||
nls_utf8.c |