Re: [Tracker] Text extraction on text formats



Le vendredi 17 novembre 2006 Ã 15:00 +0100, Luca Ferretti a Ãcrit :
Il giorno gio, 16/11/2006 alle 22.53 +0100, Laurent Aguerreche ha
scritto:
-----------------
Questo ?? un semplice esempio delle potenzialit?? di OO.o

         ^^ it was "Ã"                          ^^ it was Ã


A question: what is encoding of this string? UTF8, ISO-something,
Win-something, etc.? Now, I can see that with libGSF and a RTF file:

Dunno. I simply wrote in OO.o (Italian locale). It should be UTF-8. 

But are you asking about the original ODT or the exported RTF?

Both... But I saw the same problem on some of my DOC files with
different encodings.

Doc.Comment="Questo file altro non \303\250 che un esempio di modello di
file per OO.o Writer per testare l'indicizzazione di Tracker";

\303\250 was "Ã" (Ã in Italian === is in English)

Ok, but I would like to directly have a UTF8 string and not escaped
characters.  :-/


Laurent.

Attachment: signature.asc
Description: Ceci est une partie de message =?ISO-8859-1?Q?num=E9riquement?= =?ISO-8859-1?Q?_sign=E9e?=



[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]