[xml] HTMLparser: UTF-8 byte order mark




Hi,

The HTMLparser chokes on HTML files that begin with a UTF-8 byte order
mark. This is unfortunate, as files edited with Notepad can easily end up
with a byte order mark at the start if saved with UTF-8 encoding.

Any tips on what would be the best way to handle this in HTMLparser.c?

Best regards,

Michael

-- 
Print XML with Prince!
http://www.princexml.com



[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]