Re: GB18030 multi-byte to wchar



dleach qnx com wrote:
> Hello Everyone,
> 
> I need some insight on converting a GB18030 encoded multi-byte character
> to a wchar_t.  The OS I use does not have fully implemented locale support
> in the mbtowc routines :(.  Anyhow, this is what I do have:
> 
> The GB18030 multi-byte character.
> The bytes broken in seperate entities.
> The number of bytes that comprise the GB18030 character (1, 2, or 4).
> 
> What do I do know with the seperated bytes to make the
> wchar value?  I know how to make a conversion to
> a Unicode wchar, but not a "calculation" to a GB18030 wchar.
> Is it a simple formulae, or are there look up tables involved?
> 
> Is there an all encompassing website/document (in English) that describes
> all possible desired manipulations for GB18030?
> 
> Any insight appreciated.
> 
> Regards.
> _______________________________________________
> gtk-i18n-list mailing list
> gtk-i18n-list gnome org
> http://mail.gnome.org/mailman/listinfo/gtk-i18n-list
> 
> 
Try to look for something called ICU at http://oss.software.ibm.com/icu/
- perhaps you can use that? I used it briefly last year, and as far as I
remember it has the converters you need.

ICU is free (as far as I know) and youget the source code.

/jan





[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]