Re: UCS-2 in gunicode.h
- From: Robert Brady <robert susu org uk>
- Cc: gtk-devel-list gnome org
- Subject: Re: UCS-2 in gunicode.h
- Date: Sat, 8 Jul 2000 15:58:15 +0100 (BST)
On Fri, 7 Jul 2000, Derek Simkowiak wrote:
> ran acrossed a character which required 32 bits? Would you just
> drop that character alltogether? Treat the 32-bit character as two 16-bit
> characters (which would break your character-count code and possibly not
> be renderable)? What is the standard behaviour for apps that take UTF-8,
> but only support 16-bit Unicode (i.e., what does Java do?)
The thing to do here is to use UTF-16, which is like UCS-2, but can use
two "Surrogates" to represent characters outside the BMP, thus combining
the worst characteristics of UTF-8 and UCS-4. There are enough bits to
represent characters up to plane 16 - and the committee responsible for
such has promiseed that no characters above plane 16 will ever be
allocated.
--
Robert
[
Date Prev][
Date Next] [
Thread Prev][
Thread Next]
[
Thread Index]
[
Date Index]
[
Author Index]