[Tagdb] RDBMS, Lucene or both?

Erik Hatcher esh6h at virginia.edu
Tue Feb 6 11:27:10 GMT 2007


On Feb 5, 2007, at 10:46 PM, Ace Jayz wrote:
> I've been grappling with a design of a bookmarking/tagging system  
> and I'm leaning towards storing the tag data in an RDBMS so that I  
> can answer fairly complex relational queries efficiently (will  
> probably use a 3-table schema similar to that of Toxi), but I'm  
> thinking about storing some data in a Lucene index for efficient  
> free text searching.  I looked back in the list archives and  
> noticed that a system based on a combination of a DB and Lucene has  
> been suggested here before.  Has anyone on the list implemented  
> such a system?  If so, care to share your experiences?  I've read  
> some posts from Otis G. about his Simpy system and I'm curious if  
> it fits into this mold or if it uses Lucene exclusively.  If Simpy  
> is not an example of a tagging system solely based on Lucene, does  
> anyone know of a full-featured tagging system that is?

I've been working on Collex <http://www.nines.org/collex>, which is  
Solr-based (Lucene underneath).  All tags/annotations as well as  
facets and full-text searching of objects are driven by Lucene.

We are working on distilling the heart of Collex's faceted browsing  
as well as tagging/annotating into a Ruby API and Rails plugins.   
This is called Solr Flare (and the underlying solrb library) <http:// 
wiki.apache.org/solr/Flare>.

And perhaps most relevant to your question is the consideration that  
the Solr team is paying to tagging, with use case and implementation  
possibilities being brainstormed here: <http://wiki.apache.org/solr/ 
UserTagDesign>.

	Erik




More information about the Tagdb mailing list