[xml] xmllint --html problem?



Does the following sequence of commands indicate a problem in the HTML parsing of libxml or not?

# xmllint --version
xmllint: using libxml version 20409
# xmllint --html --encode UTF8 71.html >71.xml 2>/dev/null
# xmllint --noout  71.xml
71.xml:53: error: Input is not proper UTF-8, indicate encoding !
ophy of Education, The</a><br/>Edited by Michael A. Peters (New Zealand)Ã? &amp
^
1.xml:53: error: Bytes: 0xC3 0x20 0x50 0x61
ophy of Education, The</a><br/>Edited by Michael A. Peters (New Zealand)Ã? &amp


File "71.html" available on request: it's about 53K which I thought would be too large to send to the list right away...



Elizabeth Mattijsen




[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]