Re: Proper handling of unicode strings

From: Chris Vine <chris cvine freeserve co uk>
To: "Milosz Derezynski" <internalerror gmail com>
Cc: gtk-list gnome org
Subject: Re: Proper handling of unicode strings
Date: Tue, 8 Jul 2008 21:18:34 +0100

On Mon, 7 Jul 2008 12:01:36 +0200
"Milosz Derezynski" <internalerror gmail com> wrote:

> It's "safe" in the aforementioned sense, but if you want to properly
> count characters in the UTF-8 string, you should use g_utf8_strlen()
> instead.
> 
> 2008/7/7 LCID Fire <lcid-fire gmx net>:
> 
> > That's great - simplifies a lot of things. But since one character
> > might need more space than a gchar is it save to call strlen on
> > that string?

It is not just "safe" in the sense described above, but required if you
need to know the byte length (say to allocate storage on the heap).

If you need to know the byte length use strlen().  If you need to know
the number of characters (which will be rare, unless you are thinking of
converting say to UCS-4), then use g_utf8_strlen().  If you want to
iterate over the string then g_utf8_next_char() is handy.

Chris

References:
- Proper handling of unicode strings
  - From: LCID Fire
- Re: Proper handling of unicode strings
  - From: Milosz Derezynski
- Re: Proper handling of unicode strings
  - From: LCID Fire
- Re: Proper handling of unicode strings
  - From: Milosz Derezynski

[Date Prev][Date Next] [Thread Prev][Thread Next] [Thread Index] [Date Index] [Author Index]