Re: UTF-8 on stdout?



On 2002-07-06 at 16:15 -0500, Lars Clausen wrote:

OK, now I have to admit to being a bit confused <grin>.
Surely what we want is for the accents etc. to be output
correctly both on the screen when using --credits and in
the AUTHORS file? Which will be the same thing in most
cases, right? My question is how to get the UTF-8 encoding
to print out the correct characters from the ASCII
character set (see the current output of --credits to see
what I mean). Or am I missing the point of all of this
UTF-8 stuff totally?

I don't see any problems with the ASCII characters in the
output.  Only the non-ASCII chars in Cyrille's and Hubert's
names look funny.  There may not be a single locale that
would be able to display them all (except if you consider
UTF-8/Unicode a locale).  When I run the output through PAPS,
I do get the right chars.  Note that ASCII doesn't contain é
or è, you're probably thinking of Latin-1.  That output is
just the way it should be, blame the xterm writers for not
udnerstanding UTF-8.

Those were the problems I was referring to :). I think I was
getting mixed up with my terminology: I see your point and I
think I understand. I won't worry about it too much then. I was
thinking of old-fashoned ANSI (255 chars) --- I have to admit I
have no idea how this maps onto character sets these days,
though I think what I'm thinking of basically is Latin-1.
That's the same as ISO 8859-1, right?

In fact, the more I think about this, the more I get confused.
I can't seem to find any good introduction/references to all
this stuff on the web. Does anyone know where I can go to learn
more about character sets etc. these days?

I will leave the output as it is, write the AUTHORS dependency
etc., and then the bug can be closed.

Ta,
Andrew.

-- 
Andrew Ferrier

email: andrew junk new-destiny co uk
web:   http://www.new-destiny.co.uk/andrew/





[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]