[Tracker] Indexing mbox files



Hi,

I am in the process of writing an extractor for mbox files. My purpose
is to use Tracker as an internal search engine for another application,
and this application stores conversations in mbox files with specific
headers I need to parse.

Additionally, it could be used to successfully search within e-mails
when you don't use evolution.


And, I have the following problems, I thought you guys could help me.


First (but this isn't important at all), libmagic report mbox files as
text/plain instead of application/mbox. So I have to make my extractor
for the two mime types and filter afterwards.


Then, I started looking at ontologies, and I had a hard time figuring
everything out. I want to register both the mailbox and the messages
within it.

I started out reading the NMO (Nepomuk Message Ontology) to find that
there ware no relations between a nmo:Mailbox and a nmo:Email. I had to
look at the NIE (Nepomuk Information Element) to construct the relations
between my ontologies.

So, if I understood everything well, I should have something like:


nmo:Mailbox -> nie:hasPart -> nmo:MailboxDataObject -> nie:interpretedAs
-> nmo:Email

and of course

nmo:Email -> nie:isStoredAs -> nmo:MailboxDataObject -> nie:isPartOf ->
nmo:Mailbox

It seems rather complex to me and I wanted to know if there was a
simpler way to doing things.

Thanks,


Mildred


-- 
Mildred Ki'Lya
ââââââââââ ïïïïïïïïïïïïïïïïïïïï ââââââââââ
â Jabber, GoogleTalk: <mildred jabber fr>
â Website: <http://ki.lya.online.fr>           GPG ID: 9A7D 2E2B
â Fingerprint: 197C A7E6 645B 4299 6D37 684B 6F9D A8D6 9A7D 2E2B

Attachment: 0x9A7D2E2B.asc
Description: application/pgp-keys

Attachment: signature.asc
Description: OpenPGP digital signature



[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]