Re: what files does beagle index?



> Its hard to extract data from binary files.

How about "strings" them first?

Please! Strings will output all kind of junk. Probably not even meant
to be found. I am in favour of indexing any data worthy to be indexed
and meant to be indexed. If anyone wants to write a filter for some db
file, he/she has to find/write a filter for it which will traverse its
tables and find columns with string data and extract them.

OTOH, if you want to index db files and all you care is the output of
'strings', check the last paragraph of
http://beagle-project.org/Supported_Filetypes
There is something called ExternalFilters in beagle, which allow users
to write custom command based filters. Might be just what you want.

> Similar to the way you examine Google's index and see what webpages
> are in the index :)

Maybe beagle does something similar but if not, i think my touche is
coming...

http://www.google.ca/search?hl=en&q=site%
3Abeagle-project.org&btnG=Google+Search&meta=

haha :-) That only gives you the number of web-pages. There is no way
you (as a web user or a google api user) can find out the URLs of the
pages. (Hint: google doesnt allow you go past 1000 in search results)

Yeah, cool!  So a manual search does indicate that a given file I am
thinking of is indeed in a dot-dir.  :-(  And it is a .db file which
file says:

$ file .icq.old/history/6000006.db
.icq.old/history/6000006.db: GNU dbm 1.x or ndbm database, little endian

$ strings .icq.old/history/6000006.db
b ssage throug
rver
Hi! I found them!
URL: http://artistic.device.sh

(Apart from being spam) Obviously useful information in those files,
even filtered through "strings".  Aside from being in a dot-dir it would
be nice that Beagle could give me this.

Read above. External Filter is what you need. Should be easy. Good luck!

- dBera

--
-----------------------------------------------------
Debajyoti Bera @ http://dbera.blogspot.com
beagle / KDE fan
Mandriva / Inspiron-1100 user



[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]