Re: [xml] encoding fails



* Aron Stansvik wrote:
On 5/25/06, boss gregerhaga net <boss gregerhaga net> wrote:
<?xml version="1.0" encoding="ISO-8859-1"?>
<rss version="2.0">
   <channel>
      <title>Aftonbladet &#246;jesliv</title>
   </channel>
</rss>

I try to extract the title element from the above. But the encoding is not
recognised. What i get is this:
Aftonbladet öjesliv

What do you mean the encoding is not recognized? That looks like a
perfectly valid result. &#246; is U+00F6 LATIN SMALL LETTER O WITH
DIAERESIS.

This appears to be a defect in your mail user agent, the message you
reponded to was ISO-8859-1 encoded and had the o-umlaut encoded as two
octets (C3 B6, which is the proper UTF-8 sequence). The original problem
appears to the the usual "API gives UTF-8 but I expect something else".
-- 
Björn Höhrmann · mailto:bjoern hoehrmann de · http://bjoern.hoehrmann.de
Weinh. Str. 22 · Telefon: +49(0)621/4309674 · http://www.bjoernsworld.de
68309 Mannheim · PGP Pub. KeyID: 0xA4357E78 · http://www.websitedev.de/ 



[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]