[xml] HTMLparser enhancements

As recently discussed, I attach a proposed patch.  At Daniel's suggestion,
I've added knowledge of REQUIRED attributes to my original proposal.

This is largely cut-and-paste from libhtnorm, which stores the HTML info
as a hash.  It's also untested, and likely to contain bugs.  If you're
happy with it in principle, I'll take a bit more time to check it and
plug it in to some real-life software.

The attachments are diff -u against 2.4.30.


Nick Kew

