Re: [xml] Usage of entities when writing documents without encoding



On Mon, 2005-04-25 at 12:59 +0200, Joachim Bauch wrote:
[...]
If I run this through xmllint (on Windows, binary version 2.6.19 from xmlsoft.org)
the result looks like this:

<?xml version="1.0"?>
<Root>
     <Node>Content &#xE4;&#xF6;&#xFC;</Node>
</Root>

As you can see, the utf-8 characters have been replaced by entities that specify
the corresponding latin-1 characters.

Actually they are numeric character references, not entities... I am not
trying to be needlessly pedantic, but rather trying to give you the
right term to help you search for information...

Numeric character references are always to Unicode codepoints,
regardless of the document's encoding.

Liam

-- 
Liam Quin, W3C XML Activity Lead, http://www.w3.org/People/Quin
Pictures from old books: http://www.holoweb.net/~liam/pictures/oldbooks/
IRC (chat) programs: www.ircreviews.org/clients/




[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]