Re: UCS-2 in gunicode.h



-> The thing to do here is to use UTF-16, which is like UCS-2, but can use
-> two "Surrogates" to represent characters outside the BMP

	No, it's not.  If I wanted a multi-byte encoding I'd just stick to
UTF-8.

	The whole point of a fixed-width encoding scheme is to have fast
searches and fast pointer arithmatic, and to always know that 'n' bytes is
exactly one character.  A gapped text buffer can't work with a multi-byte
encoding (for example).

--Derek





[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]