Re: utf8 bug ?

From: Jan Hudec <bulb ucw cz>
To: Christian Borup <borup borup com>
Cc: gtk-perl-list gnome org
Subject: Re: utf8 bug ?
Date: Sun, 5 Sep 2004 09:10:05 +0200

On Sun, Sep 05, 2004 at 00:39:25 +0200, Christian Borup wrote:

On Sat, 2004-09-04 at 23:48 +0200, Jan Hudec wrote:

This is really strange:

The code:

use utf8;
$s1 = 'string';
$s2 = 'Åtring';
print "\$s1 ($s1) is " . (utf8::is_utf8($s1) ? '' : 'NOT ') . "unicode\n";
print "\$s2 ($s2) is " . (utf8::is_utf8($s2) ? '' : 'NOT ') . "unicode\n";

Produces:

$s1 (string) is NOT unicode
$s2 (Åtring) is unicode


This isn't strange - that is exactly what should happen...


It depends on what the semantics of the utf8-mark is supposed to be. If
it's off, I'd expect the string to be in locale encoding. This one is in
both locale encoding and utf8, but it's origin suggests utf8 and not
locale should be prefered.

-------------------------------------------------------------------------------
                                                 Jan 'Bulb' Hudec <bulb ucw cz>

Attachment: signature.asc
Description: Digital signature

References:
- utf8 bug ?
  - From: Quentin
- Re: utf8 bug ?
  - From: muppet
- Re: utf8 bug ?
  - From: Quentin
- Re: utf8 bug ?
  - From: Christian Borup
- Re: utf8 bug ?
  - From: Quentin
- Re: utf8 bug ?
  - From: muppet
- Re: utf8 bug ?
  - From: Christian Borup
- Re: utf8 bug ?
  - From: Jan Hudec
- Re: utf8 bug ?
  - From: Christian Borup

[Date Prev][Date Next] [Thread Prev][Thread Next] [Thread Index] [Date Index] [Author Index]