[xml] Re: =?iso-8859-1?Q?=5Bxml=5D_Re:_=5Bxml=5D_=E4=2C=F6=2C=FC_in_xml_file?=



On Fri, Jan 17, 2003 at 03:31:18PM +0100, Holger Rauch wrote:
Hi!

On Fri, 17 Jan 2003, spinmar interfree it wrote:

So you have to convert your informations from iso-8859-1 to utf 8 because the internal represenation of 
libxml2 is utf8.

  This is a confusing statement since it did not specify the context.
The original mail was clearly for parsing, not for tree operations. This
is not required for parsing. This is required for tree operations.

As far as I understand, libxml2 takes care of the conversion automatically
in case the enoding attribute of the XML declaration contains one of the
encodings that are directly supported by libxml2.

  Right ... at parse time. 
  It's not an "attribute"
  If the encoding is not supported parsing will simply fail as required by
the XML spec. "directly" is wrong since libxml may use the iconv library
to do the conversion for encodings which are not supported directly by
libxml2 code.

If I understand the docs
right, this should be the case for ISO-8859-1.

  Right at parse time. Wrong when editing the tree.
Whatever the encoding of the serialization, any modification or query of the
parsed tree myst use UTF-8.

Conclusion: There should be no need for explicit conversion in case
ISO-8859-1 is used.

  True for parsing only.

Daniel

-- 
Daniel Veillard      | Red Hat Network https://rhn.redhat.com/
veillard redhat com  | libxml GNOME XML XSLT toolkit  http://xmlsoft.org/
http://veillard.com/ | Rpmfind RPM search engine http://rpmfind.net/



[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]