Re: [Evolution] Recognizing Arbitrary Message Quotations (was Re: [Evolution]
- From: Dave Euser <dave_euser yahoo com>
- To: Evolution Mailing List <evolution ximian com>
- Subject: Re: [Evolution] Recognizing Arbitrary Message Quotations (was Re: [Evolution]
- Date: 15 Nov 2001 16:53:42 -0500
Just a few thoughts on this:
You could assume that any characters marking a quotation will be non
alphanumeric (makes some things simpler).
But this brings up other issues:
1) how to recognize a single line being quoted (ie the original
message has only one line). I would assume that you would have to have
2+ lines to make a valid assumption that the character(s) you're
looking at are a quotation marker, and not actually part of the text
2) what happens if the original message has some kind of list:
eg - item 1
- item 2
- item 3
This could easily be interpreted as a quotation marker. How to
distinguish?
Definitely not a simple problem....it might come to the point of
a) not doing anything at all....leave things as-is
b) make certain assumptions, causing some mis-quoted messages
c) create a list of standard quotation markers and work from that list
On Thu, 2001-11-15 at 14:36, Brett Johnson wrote:
On Thu, 2001-11-15 at 11:50, NotZed wrote:
Or, we could just use '> ' like most everyone else, and the problem
isn't there.
?? I don't see how this makes the problem go away. Just because
Evolution decides to use '> ' for quoting doesn't mean that every other
mail client in the world will suddenly fall into line. So, we still
need a way to recognize quoted text that doesn't conform to the '> '
convention.
Jim Meyer wrote:
It appears that an poorly worded query on my part spawned an interesting
topic: how can we recognize arbitrarily quoted messages for purposes of
colorization, requoting, and rewrapping?
Personally, I'm partial to the regexp solution (this shouldn't be a
surprise ;o). Simply use a regular expression to describe the various
ways that a string can be quoted. Every line that matches said regular
expression is considered to be an attribution. This works very well in
my experience (using Balsa and VM under XEmacs). It's even a fairly
simple process to recognize different levels of quotation (simply remove
the first part of the line that matched the regexp, then try the match
again - recurse until it doesn't match anymore).
It feels like a potentially worthwhile approach is to spend some time
trying to recognize attribution strings; if we could learn to do that,
we can infer that the next line is quoted somehow, and start to unravel
which quoting strings relate to which attributions.
Wow, that's ambitious. Seems like it could easily be broken as well. Take
this message for example: NotZed's reply to your message didn't contain an
attribution at all, so I added one. How would you distinguish between
NotZed's comments and your own, if I hadn't added the attribution? For
that matter, how do you distinguish between them now?
Ideally, we could completely unroll any number of arbitrarily quoted
messages, which would allow us to then requote or rewrap tidily.
However, this is no more easy than the first problem, and perhaps
considerably harder. I'm thinking about this a bit in my spare moments,
but to paraphrase ESR, many eyes make light work. ;]
Cheers!
--
Brett Johnson <brett_johnson hp com>
- i n v e n t -
_______________________________________________
evolution maillist - evolution ximian com
http://lists.ximian.com/mailman/listinfo/evolution
[
Date Prev][
Date Next] [
Thread Prev][
Thread Next]
[
Thread Index]
[
Date Index]
[
Author Index]