Re: [gtk-i18n-list] Unicode PUA supporting issue in gtk+/pango



On Fri, 30 Dec 2005 10:01:15 -0500
Matthias Clasen <matthias clasen gmail com> wrote:

>On 12/30/05, Christopher Fynn <cfynn gmx net> wrote:
>>
>> PRC are also now using PUA as a "standard" encoding (in GB1300 (? &
>> GB18030 ?))  for pre-composed Tibetan "characters".
>
>standardizing the private use area ?
>sounds a bit oxymoromic to me ....

Aha, "standardize" of precomposed Tibetan characters
is prepared (or finished?) in PRC's national standard.
I never heard PRC is pushing them into International
standards.

PRC government know it's difficult to force computer
vendors to use GB18030 as "internal" character encoding,
so, to use GB18030 (as external character encoding)
in the system whose internal character encoding is
UTF-16, PRC government maps their natinally-standardized
characters to PUA codepoints, if the characters are 
not available in Unicode.

There are 2 different situation in "unavailability".

A. The character is already proposed to Unicode,
   but new version of Unicode is not published yet.

B. The character won't be included in Unicode,
   because it violates Unicode character inclusion
   policy.

The extra Hanzi charset for Hong Kong is A.
The precomposed Tibetan charset is B.

Arne G and Chia-I Wu told that there are similar
issues (A & B) in Taiwanese standard CNS11643.

Regards,
mpsuzuki



[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]