[Tagdb] RDBMS, Lucene or both?

Ace Jayz fourtlove at gmail.com
Tue Feb 6 18:43:15 GMT 2007


> And perhaps most relevant to your question is the consideration that
> the Solr team is paying to tagging, with use case and implementation
> possibilities being brainstormed here: <http://wiki.apache.org/solr/
> UserTagDesign>.
>
>         Erik



Erik,

Thanks for the pointer to the Solr Wiki discussion on tagging.  I've been
thinking along similar lines and convinced myself that Lucene could
efficiently handle many of the queries required of a tagging system.  I
would like to avoid an RDBMS and all of its complexities if possible, but
there are some operations that I think would be somewhat difficult with
Lucene alone.  For example, gathering the data to build a tag cloud would
seem to be problematic, without a lot of data duplication in the text
index.   It boils down to trying to choose the right tool for the right
job.  IMHO, Lucene is the right choice for the text search portion of a
tagging system.  But an RDBMS seems like the right tool to represent the
relationships between the tags, users, bookmarks, etc.  My one concern is
that I go down the Lucene-only path and get most things to work well but
that I fail to anticipate a future requirement that would be easily handled
by an RDBMS and very difficult, if not impossible, with Lucene.  My gut
tells me that some combination of the two is the right answer...

-- Ace.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.tagschema.com/pipermail/tagdb/attachments/20070206/ef52682d/attachment.htm 


More information about the Tagdb mailing list