[xml] HTMLparser: UTF-8 byte order mark
- From: Michael Day <mikeday yeslogic com>
- To: xml gnome org
- Subject: [xml] HTMLparser: UTF-8 byte order mark
- Date: Thu, 29 Dec 2005 15:12:48 +1100 (EST)
Hi,
The HTMLparser chokes on HTML files that begin with a UTF-8 byte order
mark. This is unfortunate, as files edited with Notepad can easily end up
with a byte order mark at the start if saved with UTF-8 encoding.
Any tips on what would be the best way to handle this in HTMLparser.c?
Best regards,
Michael
--
Print XML with Prince!
http://www.princexml.com
[
Date Prev][
Date Next] [
Thread Prev][
Thread Next]
[
Thread Index]
[
Date Index]
[
Author Index]