Hi,
We have a Drupal intranet for our organization and would like to employ an enterprise search using Lucene and/or Solr (with Nutch crawler). We are interesting in searching a structured Greenstone document repository (www.greenstone.org) and an unstructured "forums" site (a social software site using Clearspace www.jivesoftware.com) and probably more down the line.

I'm wondering if anyone has undertaken a similar project and can give us some insight as to:

* Which modules you used, and which version of Drupal you were using.

* How long it took to set up (programming hours)

* Setbacks or problems we should look out for (e.g., related to searching both structured and unstructured data, or anything else that you wish someone had warned you about :)

* How successful you feel the project has been for you. Are your users finding what they need?

We deeply appreciate any responses to this thread. As a non-profit, we're interested in going with open source if at all possible. Feel free to email me directly.

Thanks and best regards,
Jeff

Comments

robertDouglass’s picture

Hi, I'd like to suggest the ApacheSolr module as a starting point. There are extant patches for Lucene/Solr that let it work with Nutch, and this would be an interesting area to research. Solr is very good at searching structured data because of it's excellent schema and dynamic fields. There is out-of-the box integration for CCK fields (of certain sorts), and this feature can be easily extended.

- Robert Douglass

-----
my Drupal book | Twitter

Francewhoa’s picture

I tried a lot of Drupal search modules. The "Apache Solr Search Integration" module is the most powerful for large project http://drupal.org/project/apachesolr

It can be install in the same day. I wrote a how-to handbook for absolute beginners at http://drupal.org/node/504558

Loving back your Drupal community result in multiple benefits for you