[Tagdb] Tags and data storage
Nitin Borwankar
nitin at borwankar.com
Thu Mar 23 06:01:37 GMT 2006
OK, so this is clearly quite powerful.
Erik, Otis, thanks very much for the mini tutorial.
Nitin
Erik Hatcher wrote:
>
> On Mar 22, 2006, at 7:29 PM, Nitin Borwankar wrote:
>
>> a) How does a Lucene based app perform at the lower ends of the
>> scale - is there an overhead and a threshold above which Lucene makes
>> sense ?
>
>
> Lucene works well at all scales, actually. Certainly at the higher
> ends of scalability more sophisticated management needs to be
> considered such as distributed index servers, etc.
>
> I think even at lower scales, Lucene makes great sense because of its
> querying capability. I'm no SQL expert, but formulating a query such
> as "show me all objects tagged with _foo_, _bar_, but not _baz_....
> OR _baz_ and not _bar_" is likely to be tricky, but with an inverted
> index such as Lucene it is trivial. Maybe that particular query is
> actually not too bad in SQL, but I'm also combining tag queries with
> full-text searches such as "this phrase" in the body of a document
> that has been tagged, along with other filters (for date, genre, etc).
>
>> b) How do I hook up my web app to a Lucene-tag-backend when my web
>> app is not written in Java ?
>> c) Are there commonly used JSON/XML-RPC etc. wrappers around the
>> backend so I can call it from Python/PHP/Ruby ?
>
>
> Lucene by itself is just a JAR file, and by itself there is no
> "server" as such. However, many projects have built web services
> around Lucene. The most interesting of these is the newly donated
> Solr project:
>
> http://incubator.apache.org/solr/
>
> I'm starting to prototype with it to replace my current tag system
> for my University research project (tagging and annotating library
> archives). My current system is partly in Kowari, and partly my own
> custom Lucene search server which is a very rudimentary version of
> the more sophisticated faceted capabilities that Solr provides. My
> front-end is in Ruby on Rails, using SOAP and XML-RPC respectively.
> Solr uses HTTP GET/POST, and it is driving CNETs faceted search
> system, where it is deployed in a distributed fashion.
>
> Erik
>
> _______________________________________________
> Tagdb mailing list
> Tagdb at lists.tagschema.com
> http://lists.tagschema.com/mailman/listinfo/tagdb
More information about the Tagdb
mailing list