Re: [Tracker] [PATCH] Improve oasis extractor to handle embedded tabs and line breaks



On 09/07/14 19:26, Karl Relton wrote:
The following patch improves the oasis extractor on odt documents so
that it keeps extracting plain text content even when there are embedded
tab and line-break xml tags. Without this patch the extractor stops when
such a tag is encountered, and resumes typically at the next paragraph
or style/format change. This means extractable text is missed.

Thank you for the patch Karl, I've just committed¹ this to master! :)
We appreciate the work and if you have any other patches you want to submit, we welcome them!

¹ https://git.gnome.org/browse/tracker/commit/?id=77994c3397c576ad468a6be0d2925727689e1932

--
Regards,
Martyn

Founder & Director @ Lanedo GmbH.
http://www.linkedin.com/in/martynrussell


[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]