Re: [xml] XML Entities encoding question
- From: Bjoern Hoehrmann <derhoermi gmx net>
- To: Fred <fred fredex gmail com>
- Cc: xml gnome org
- Subject: Re: [xml] XML Entities encoding question
- Date: Wed, 07 Aug 2013 11:29:42 +0200
* Fred wrote:
I apparently fail to understand something about how char encodings work at
the intersection of XML and Oracle.
If I send:
<?xml version="1.0" encoding="WINDOWS-1252"?>
<MSG>
...
<LAST_NAME>BOLA<C3><C9>OS</LAST_NAME>
...
</MSG>
the two accented characters are each transformed into 0xBF. (with exactly
the same result if it's 8859-1 instead of WINDOWS-1252.)
however, if I send:
<LAST_NAME>BOLAÃ ÉOS</LAST_NAME>
I get the desired result.
The interpretation of numeric character references, the latter case, is
independent of the character encoding scheme. That would explain why you
see a difference between the two cases, except that C3 in Windows-1252
is U+00C3, same as Ã, so I would suspect that you have analysed
your problem incorrectly.
--
Björn Höhrmann · mailto:bjoern hoehrmann de · http://bjoern.hoehrmann.de
Am Badedeich 7 · Telefon: +49(0)160/4415681 · http://www.bjoernsworld.de
25899 Dagebüll · PGP Pub. KeyID: 0xA4357E78 · http://www.websitedev.de/
[
Date Prev][
Date Next] [
Thread Prev][
Thread Next]
[
Thread Index]
[
Date Index]
[
Author Index]