Re: [dev] [sbase][RFC] Add a simplistic version of tr

Contemporary messages sorted: [ by date ] [ by thread ] [ by subject ] [ by author ] [ by messages with attachments ]

From: Thorsten Glaser <tg_AT_mirbsd.de>
Date: Sat, 30 Nov 2013 12:33:25 +0000 (UTC)

Silvan Jegen dixit:

>That sounds reasonable but requires that we convert UTF-8 to UTF-32
>which should not be strictly necessary when we only map one UTF-8 value
>to another.

Arrgh, no. UTF-8 and UTF-32/UCS-4 are encodings of numerical Unicode
codepoints. When working with text documents, you always operate on
those codepoints. This was true for single-byte encodings as well,
except there, the codepoints always fit into bytes.

bye,
//mirabilos

-- 
08:05⎜<XTaran:#grml> mika: Does grml have an tool to read Apple
     ⎜    System Log (asl) files? :)
08:08⎜<ft:#grml> yeah. /bin/rm. ;)       08:09⎜<mrud:#grml> hexdump -C
08:31⎜<XTaran:#grml> ft, mrud: *g*

Received on Sat Nov 30 2013 - 13:33:25 CET

This archive was generated by hypermail 2.3.0 : Sat Nov 30 2013 - 13:48:11 CET