Silvan Jegen dixit:
>That sounds reasonable but requires that we convert UTF-8 to UTF-32
>which should not be strictly necessary when we only map one UTF-8 value
>to another.
Arrgh, no. UTF-8 and UTF-32/UCS-4 are encodings of numerical Unicode
codepoints. When working with text documents, you always operate on
those codepoints. This was true for single-byte encodings as well,
except there, the codepoints always fit into bytes.
bye,
//mirabilos
--
08:05⎜<XTaran:#grml> mika: Does grml have an tool to read Apple
⎜ System Log (asl) files? :)
08:08⎜<ft:#grml> yeah. /bin/rm. ;) 08:09⎜<mrud:#grml> hexdump -C
08:31⎜<XTaran:#grml> ft, mrud: *g*
Received on Sat Nov 30 2013 - 13:33:25 CET