RE: How to determine Script from given Unicode text?



On Fri, 2005-10-21 at 10:50 -0700, Gaurav Jain wrote:
[...]
 I have a
unicode character, and I need to find out what script
it belongs to.  Is it possible to determine this using
a single Unicode character?

In general no, because a single Unicode character can
belong to more than one script (e.g. to Chinese Han and
also to Japanese Kanji, and also even to Korean; this is
the "CJK Unification" that was a controversial part of
Unicode).

You may find the gucharmap code useful, though, or the
W3C Internationalisation pages (www.w3.org/International/)
or www.Unicode.org.

Liam

-- 
Liam Quin - XML Activity Lead, W3C, http://www.w3.org/People/Quin/
Pictures from old books: http://fromoldbooks.org/
Ankh: irc.sorcery.net irc.gnome.org www.advogato.org





[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]