Re: [Tracker] tracker 0.10: could not process <file>, creating minimal info [WAS: some files aren't indexed]





On 14 October 2011 10:55, Ivan Frade <ivan frade gmail com> wrote:

On Fri, Oct 14, 2011 at 10:56 AM, Mildred Ki'Lya

> Do you have an idea how I could get a list of those files which couldn't be
> harvested but only had minimal info created for them? Currently, I'm only
> looking at a certain type of files and I am looking for specific metadata.
> But I can't do that for any kind of file reliably.

Maybe this query can help you:
$ tracker-sparql -q "SELECT ?u WHERE { ?u a nfo:FileDataObject. FILTER
(NOT EXISTS { ?u a nie:InformationElement. })}"

It gives you everything that is a File but doesn't have any
interpretation (the extractor didn't tell us what it is).

Well, this doesn't help. There is no such files.

Here is the output of tracker-info which have only basic metadata because the extractor failed:

 Querying information for entity:'file:///home/tracker/gnatbugs/62/6225-004/comment'
  'urn:uuid:8fc0f308-a9ac-481e-1cde-0e56f482fc13'
Results:
  'http://purl.org/dc/elements/1.1/date' = '2009-03-29T04:09:34Z'
  'http://purl.org/dc/elements/1.1/date' = '2011-09-06T17:01:24Z'
  'http://purl.org/dc/elements/1.1/source' = 'urn:nepomuk:datasource:9291a450-1d49-11de-8c30-0800200c9a66'
  'tracker:added' = '2011-09-21T01:17:55Z'
  'tracker:modified' = '74744'
  'rdf:type' = 'http://www.w3.org/2000/01/rdf-schema#Resource'
  'rdf:type' = 'http://www.semanticdesktop.org/ontologies/2007/01/19/nie#DataObject'
  'rdf:type' = 'http://www.semanticdesktop.org/ontologies/2007/01/19/nie#InformationElement'
  'rdf:type' = 'http://www.semanticdesktop.org/ontologies/2007/03/22/nfo#FileDataObject'
  'nie:byteSize' = '3800'
  'nie:dataSource' = 'urn:nepomuk:datasource:9291a450-1d49-11de-8c30-0800200c9a66'
  'nie:isPartOf' = 'urn:uuid:a41a1e3b-b09f-d7e3-c2f0-f3a47b49c453'
  'nie:url' = 'file:///home/tracker/gnatbugs/62/6225-004/comment'
  'nfo:belongsToContainer' = 'urn:uuid:a41a1e3b-b09f-d7e3-c2f0-f3a47b49c453'
  'tracker:available' = 'true'
  'nie:isStoredAs' = 'urn:uuid:8fc0f308-a9ac-481e-1cde-0e56f482fc13'
  'nie:mimeType' = 'text/plain'
  'nfo:fileLastAccessed' = '2011-09-06T17:01:24Z'
  'nfo:fileLastModified' = '2009-03-29T04:09:34Z'
  'nfo:fileName' = 'comment'
  'nfo:fileSize' = '3800'

It is indeed a nfo:FileDataObject

My current command only check for some files that I know to have the property nie:hasPart. it's what I am checking, but that wouldn't do for any kind of files.

I would really like a tracker:error field so I could get the list of those files easily

Mildred





[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]