Re: [xml] XML Entities encoding question



* Fred wrote:
I apparently fail to understand something about how char encodings work at
the intersection of XML and Oracle.

If I send:

<?xml version="1.0" encoding="WINDOWS-1252"?>
<MSG>
...
<LAST_NAME>BOLA<C3><C9>OS</LAST_NAME>
...
</MSG>

the two accented characters are each transformed into 0xBF. (with exactly
the same result if it's 8859-1 instead of WINDOWS-1252.)

however, if I send:

<LAST_NAME>BOLA&#x00c3; &#x00c9;OS</LAST_NAME>

I get the desired result.

The interpretation of numeric character references, the latter case, is
independent of the character encoding scheme. That would explain why you
see a difference between the two cases, except that C3 in Windows-1252
is U+00C3, same as &#x00C3;, so I would suspect that you have analysed
your problem incorrectly.
-- 
Björn Höhrmann · mailto:bjoern hoehrmann de · http://bjoern.hoehrmann.de
Am Badedeich 7 · Telefon: +49(0)160/4415681 · http://www.bjoernsworld.de
25899 Dagebüll · PGP Pub. KeyID: 0xA4357E78 · http://www.websitedev.de/ 


[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]