Re: g_filename_to_uri() issue in glib-win32



On Wed, 2012-05-23 at 06:48 +0100, John Emmas wrote:
> But whatever that (second) character looked like, it's decimal value
> would always be 246 (because the UTF-8 sequence C3 B6 translates to
> decimal 246).
> 
> The URI translation of decimal 246 is %F6.

U+00F6 is the Unicode codepoint but URI percent encoding never directly
uses codepoints as you can encode only a single byte at a time and the
range of Unicode codepoints is much larger than that (up to U+10FFFF).
As Krzysztof already wrote, byte-wise encoding of UTF-8 strings is the
generally recommended way to encode URIs. See also the following links:

http://tools.ietf.org/html/rfc3987#section-6.4
http://www.w3.org/International/O-URL-code.html

Regards,
Jürg



[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]