RE: [xml] XML and Unicode Determination



-----Original Message-----
From: xml-bounces gnome org [mailto:xml-bounces gnome org] On Behalf Of
Daniel Veillard
Sent: Friday, March 03, 2006 3:47 AM
To: Browder, Tom
Cc: xml gnome org
Subject: Re: [xml] XML and Unicode Determination

 check the first byte of content you received with the high order bit set,
since all strings are >
UTF-8 internally.
 There is no way the parser can tell you when this happen because encoding
conversion to UTF-8 are done block by block as a >prior step to parsing.

It would be sufficient if I could ask the parser if any unicode character >
ascii were found.  The parser could set a boolean when the first non-ascii
char was found.  Wouldn't that be easy for you to do during parsing?

-Tom




[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]