Re: [xml] encoding fails



On 5/25/06, Bjoern Hoehrmann <derhoermi gmx net> wrote:
* Aron Stansvik wrote:
>On 5/25/06, boss gregerhaga net <boss gregerhaga net> wrote:
>> <?xml version="1.0" encoding="ISO-8859-1"?>
>> <rss version="2.0">
>>    <channel>
>>       <title>Aftonbladet &#246;jesliv</title>
>>    </channel>
>> </rss>
>>
>> I try to extract the title element from the above. But the encoding is not
>> recognised. What i get is this:
>> Aftonbladet öjesliv
>
>What do you mean the encoding is not recognized? That looks like a
>perfectly valid result. &#246; is U+00F6 LATIN SMALL LETTER O WITH
>DIAERESIS.

This appears to be a defect in your mail user agent, the message you
reponded to was ISO-8859-1 encoded and had the o-umlaut encoded as two
octets (C3 B6, which is the proper UTF-8 sequence). The original problem
appears to the the usual "API gives UTF-8 but I expect something else".

Ah. Right. Using Gmail so it showed it just fine.

Aron



[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]