Re: [dev] [st] wide characters

From: <random832_AT_fastmail.us>
Date: Mon, 15 Apr 2013 15:25:22 -0400

On Mon, Apr 15, 2013, at 15:16, Strake wrote:
> On 15/04/2013, random832_AT_fastmail.us <random832_AT_fastmail.us> wrote:
> > On Mon, Apr 15, 2013, at 10:58, Martti K├╝hne wrote:
> >> According to a quick google those chars can become as wide as 6
> >> bytes,
> >
> > No, they can't. I have no idea what your source on this is.
>
> In UTF-8 the maximum encoded character length is 6 bytes [1]

What on earth does that have to do with using an int to store the code
point *instead of* the raw UTF-8 bytes (which are used _now_)?

Also, this is out of date; the latest version of unicode (since 2003 at
the latest) limits code points to 0x10FFFF and therefore UTF-8 sequences
to four bytes. Unless your manpage is much older than mine, it states
this clearly and you misread it.
Received on Mon Apr 15 2013 - 21:25:22 CEST

This archive was generated by hypermail 2.3.0 : Mon Apr 15 2013 - 21:36:05 CEST