Re: [Tracker] Tracker used on a webserver to index uploaded documents

From: jamie <jamiemcc blueyonder co uk>
To: raphael slinckx net
Cc: tracker-list gnome org
Subject: Re: [Tracker] Tracker used on a webserver to index uploaded documents
Date: Wed, 25 Apr 2007 20:33:25 +0100

On Wed, 2007-04-25 at 17:16 +0200, RaphaÃl Slinckx wrote:

Hi !

(I'm not subscribed, so if you could please keep me CC-ed)

I was wondering if it makes sense to use tracker on a webserver to
index documents (pdf, ppt, doc, images,..) uploaded by users. The
web-app could then have a search box and query tracker to return the
results in the webpage.


I dont see why not - a web service could be built that interfaces to
tracker

This raises some questions:
* Can we use dbus in conjunction with a web-app. I'm using the
turbogears framework in python, i guess i can use dbus without glib
mainloop if i don't need to listen to signals


I would build a web service for this (in python if you like). I think
the python bindings dont require glib mainloop? If not you should be ok
(assuming python has its own mainloop)

* Do the results get back quickly, since i guess i'll have to make
sync-calls ?


just do async. If you dont have a mainloop then spawn tracker-search and
collect the results from stdout

* Is this just a crazy idea, and i should instead use the extraction
libraries directly, and..
* What would i gain by using tracker instead of the extraction
libraries directly (beside the advantage of having a ready-made
solution) 
* What about security ? feeding tracker with more or less random
queries from the web could be dangerous ?


all string values in sql are quoted so we should be immune to sql
injection

If your web service is running as root then trackerd will need tor un as
root (as its uses the dbus session bus). I dont know about the security
aspects of running trackerd as root (we always make use of the safer
glib functions in the code so fingers crossed it should be ok but I
would not like to guarantee that)

References:
- [Tracker] Tracker used on a webserver to index uploaded documents
  - From: =?UTF-8?Q?Rapha=C3=ABl_Slinckx?=

[Date Prev][Date Next] [Thread Prev][Thread Next] [Thread Index] [Date Index] [Author Index]