Re: New Indexer



> 3) Infrastructure for bayesian relevance processing, but I haven't
coded
> all of that yet.

You could also try wrapping ots (open text summarizer). It's used by
abiword to summarize text, and you could always hack that to just spit
out how important it finds each block of text (possibly in xml markup).

Right now, at least, you can try it out on  the command line  by running
ots -k doc.txt, and it should output keywords it thinks are important.

A Bayesian processor would probably be better, but this might provide a
decent short-term solution.


-- Matt Jones




[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]