Re: [Tracker] Tracker to do list



Le jeudi 07 septembre 2006 Ã 12:48 +0100, Jamie McCracken a Ãcrit :
Im posting some to do items in case any of you lot have some spare time 
and want to use it hacking on tracker and help speed up development :)

...

C programming:

To pave the way for email indexing we will need  mail/mbox handling 
utilities.

Suggest use GMime
more info at http://spruce.sourceforge.net/gmime/ and tutorial at 
http://spruce.sourceforge.net/gmime/tutorial/

We will need utility functions to :


1) parse entire mbox file - extracting message ID and all other fields 
into a GHashTable.

2) As (1) but parse only new mails (given a file offset of the last 
known email). All new mails are always appended to an mbox file.

3) work out whether a mail is marked as deleted or junk (evo and 
thunderbird use different flags in the email headers to determine this - 
google for the exact flags)

4) Extract plain text (we have an html filter in tracker already for html)

5) extract and decode mime attachments

All the above should be easy to implement using GMime.

Hum, it seems interesting. I would like to take a look at that. :-)


But before, I will continue to read and clean code.

I wonder whether the use of strlen() on UTF-8 is correct, it
shouldn't... If I remember correctly, unicode can use arrays filled that
way:
'\0' 'H' '\0' 'E' '\0' 'L' '\0' L '\0' 'O'      ("HELLO")
where a '\0' can be replaced by a value to stock characters on 2 bytes.
But I don't remember if it happens with UTF-8. I'll have to check what
happen with strlen() and funky characters.
strlen() counts number of characters that precede a null byte.
In Glib, there are functions like: g_utf8_strlen(), g_utf8_strncpy(),
g_utf8_strchr(), etc.


Laurent.



[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]