Truckloads of non-node, ancillary data in Drupal + Apache_Solr ???

yountod - October 1, 2009 - 18:45

I'd like to take gobs and gobs of data and dump it onto a filesystem or separtate MySQL DB, but not necessarily import it as nodes in Drupal. However I'd like the ApahceSolr module to pick up on this data, i.e. I'd like my separate Solr installation to have indexed it and made those search results available back through the Drupal search results page.

Does anyone have suggestions as to how I might colocate this data, and/or create new indexes in Solr for Drupal to see?

I'm thinking something in the order of tens of millions of database entries and a few hundred thousand documents in a file system, all indexed by Lucene. So if I search for "Joe Blow" in my Drupal ApacheSolr search box, I can find Joe simultaneously in nodes, external DB's, external files and perhaps even other people's Web content I might have NUTCHed.

Pointers, anyone? Thanks...

Forums good place to get lost

robertDouglass - October 10, 2009 - 14:10

This would make a good support issue in the Apache Solr queue - that is the best place to get in touch with some of the other developers who have actually done what you're talking about: http://drupal.org/project/issues/apachesolr?categories=All

The other place, which you've found, is the Lucene, Nutch and Solr group.

Good luck, and let us know the details of your solution when you get something set up. I've not done anything similar so I don't have any pointers to share.

- Robert Douglass

-----
my Drupal book | Twitter

 
 

Drupal is a registered trademark of Dries Buytaert.