Re: [Tracker] C file not detected as nfo:SourceCode



-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Hi Ralph,

That's probably because there's no extractor for source code yet.

pvanhoof lenny:~/repos/gnome/tracker/src/tracker-extract$ grep
SourceCode *
pvanhoof lenny:~/repos/gnome/tracker/src/tracker-extract$

Should not be very hard to develop one. Good luck.

Kind regards,

Philip

Ralph Böhme schreef op 11/05/2014 9:48:
Hi all,

I noticed a sample C source code file not being detected as
'nfo:SourceCode':

# file /Volumes/spotlight/text/server.c 
/Volumes/spotlight/text/server.c: C source, ASCII text

# tracker-info /Volumes/spotlight/text/server.c | grep rdf:type 
'rdf:type' = 'http://www.w3.org/2000/01/rdf-schema#Resource' 
'rdf:type' =
'http://www.semanticdesktop.org/ontologies/2007/01/19/nie#DataObject'


'rdf:type' =
'http://www.semanticdesktop.org/ontologies/2007/01/19/nie#InformationElement'
'rdf:type' =
'http://www.semanticdesktop.org/ontologies/2007/03/22/nfo#Document'


'rdf:type' =
'http://www.semanticdesktop.org/ontologies/2007/03/22/nfo#FileDataObject'
'rdf:type' =
'http://www.semanticdesktop.org/ontologies/2007/03/22/nfo#TextDocument'


'rdf:type' =
'http://www.semanticdesktop.org/ontologies/2007/03/22/nfo#PlainTextDocument'
#

The file in question is source3/smbd/server.c from the Samba source
tree. :)

# /usr/libexec/tracker-extract -v 3 -f
/Volumes/spotlight/text/server.c Locale 'TRACKER_LOCALE_LANGUAGE'
was set to 'en_US.UTF-8' Locale 'TRACKER_LOCALE_TIME' was set to
'en_US.UTF-8' Locale 'TRACKER_LOCALE_COLLATE' was set to
'en_US.UTF-8' Locale 'TRACKER_LOCALE_NUMERIC' was set to
'en_US.UTF-8' Locale 'TRACKER_LOCALE_MONETARY' was set to
'en_US.UTF-8' Initializing Storage... Mount monitors set up for to
watch for added, removed and pre-unmounts... No mounts found to
iterate Setting priority nice level to 19 Loading extractor
rules... (/usr/share/tracker/extract-rules) Loaded rule
'10-abw.rule' Loaded rule '10-dvi.rule' Loaded rule '10-epub.rule' 
Loaded rule '10-gif.rule' Loaded rule '10-html.rule' Loaded rule
'10-ico.rule' Loaded rule '10-jpeg.rule' Loaded rule '10-mp3.rule' 
Loaded rule '10-msoffice.rule' Loaded rule '10-oasis.rule' Loaded
rule '10-pdf.rule' Loaded rule '10-png.rule' Loaded rule
'10-ps.rule' Loaded rule '10-svg.rule' Loaded rule '10-tiff.rule' 
Loaded rule '10-xmp.rule' Loaded rule '10-xps.rule' Loaded rule
'11-iso.rule' Loaded rule '11-msoffice-xml.rule' Loaded rule
'15-gstreamer-guess.rule' Loaded rule '15-playlist.rule' Loaded
rule '90-gstreamer-generic.rule' Loaded rule
'90-text-generic.rule' Extractor rules loaded Setting memory
limitations: total is 1.0 GB, minimum is 256 MB, recommended is ~1
GB Virtual/Heap set to 805.3 MB (50% of total or MAXLONG) Guessing
mime type as '(null)' Extracting... Using
/usr/lib64/tracker-0.16/extract-modules/libextract-text.so... 
Starting to read 'file:///Volumes/spotlight/text/server.c' up to
1048576 bytes... Read 41858 bytes from file, 1006718 bytes
remaining until configured threshold is reached Done (2 items)

SPARQL pre-update: -- --

SPARQL item: -- a nfo:PlainTextDocument ; nie:plainTextContent
"/*\n ...

Any idea what may be missing?

-Ralph _______________________________________________ tracker-list
mailing list tracker-list gnome org 
https://mail.gnome.org/mailman/listinfo/tracker-list


-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2.0.20 (MingW32)
Comment: Using GnuPG with Thunderbird - http://www.enigmail.net/

iQEcBAEBAgAGBQJTb18YAAoJEEP2NSGEz4aDZugIAI7+fltepV/+Uym3oL4EngQF
9AlOl0tkhnyIIBM2mgYGG3IK79WAlymAQkw7moA7kSNAYKkgYvvYM9a9SpYNWLSa
9k1wFT/wc16XUTXiH/+CyS68oCkB0WetvoCDebi7vP4DbhQeNE5lrpqQwrb76vmz
RAxj3IZ1JaxRqExMRoUtA5PL3piRTpqLDaXEON7axZo6YppwvMD7xl8f7KC6Ww2U
Ct3+61SSZwsE9HZbxTiqiz+mMwiUnzVmK5GVJPJ1yL5ZVGFZC9ApWaJ7QOCX1iMT
OOwcuYMG5DLz/MHs4j+4pKNYj1KlxFoJhcFMY3akMNY+IEItf345Kx5MXj46q0k=
=7vjA
-----END PGP SIGNATURE-----


[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]