Re: [dev] [sbase] [PATCH-UPDATE] Rewrite tr(1) in a sane way

From: Ian D. Scott <ian_AT_perebruin.com>
Date: Sat, 10 Jan 2015 16:11:09 -0800

On Sat, Jan 10, 2015 at 06:56:45PM -0500, random832_AT_fastmail.us wrote:
> On Sat, Jan 10, 2015, at 16:47, Markus Wichmann wrote:
> > You wanted to be Unicode compatible, right? Because in that case I
> > expect [:alpha:] to be the class of all characters in General Category L
> > (that is, Lu, Ll, Lt, Lm, or Lo). That includes a few more characters
> > than just A-Z and a-z. And I don't see you add any other character to
> > that class later.
>
> Note that translating between [:upper:] and [:lower:] requires using the
> toupper and tolower mapping, rather than just dumping the character
> classes (since otherwise you'll run into there being something like ß
> that is in [:lower:] and has no counterpart in [:upper:], or they're in
> a different order)
>

Actually, ẞ, capital of ß, was added in Unicode 5.1. There are probably
others letters with this issue, however.
Received on Sun Jan 11 2015 - 01:11:09 CET

This archive was generated by hypermail 2.3.0 : Sun Jan 11 2015 - 01:12:07 CET