Commandline tool html2pdf



I have scraped the Internet (a lot) looking for a tool that fits my needs.
So far the things I've come up with are programs like xhtml2pdf, htmldoc
and html2ps.
Of these html2ps (latest update from 2005) still seems to be the best choice.
Sadly, it's a gross hack as a Perl script that doesn't really parse anything,
it just transforms the text using appropriate sloppy regexp scanners, and
then uses things like TeX to actually generate part of the output.

It's not even that bad (I guess that's due to TeX's amazing capabilities), but
it lacks proper CSS support.  Further extending this hack to actually to
start support CSS seems to be the wrong avenue.

So I tried approaching it from a different angle, and started looking for
as simple as possible webbrowsers, hoping to find one which has a commandline
option to actually load an HTML page and subsequently render it into
multipage Postscript or PDF without ever opening up a GUI.

Suffice it to say, that doesn't seem to exist.
Now the question is, can epiphany be coerced into performing this task?
I can't imagine I'm the first person to actually have this problem.
What would it take to support something like this?
I'm willing to contribute code, if I know it is likely to be accepted.
-- 
Sincerely,
           Stephen R. van den Berg.
"Technology is stuff that doesn't work yet."   --  Bran Ferren
"We no longer think of chairs as technology."  --  Douglas Adams


[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]