ok, this is a stub, another one, for Nutch love
anyone out there working on Nutch 1.4 and Drupal 6 or 7 ?
I am going to take another crack at some kind of integration with BOA Aegir LEMP stack over the Christmas break
some relevant news below that may give some focus
it seems to me, Nutch could do with a HUGE Drupal integration boost
26 November 2011 - Apache Nutch 1.4 Released
The 1.4 release of Nutch is now available. This release includes several improvements including allowing Parsers to declare support for multiple MIME types, configurable Fetcher Queue depth, Fetcher speed improvements, tigther Tika integration, and support for HTTP auth in Solr indexing. Please see the list of changes made in this version. The release is available here.23 September 2011 - Apache Nutch focuses on 1.x series for main development
After some discussion and a vote about the issue, the Nutch development community decided to focus their efforts on maintaining and releasing the 1.x series of Nutch, and to branch the now former Nutch trunk based on Gora, allowing others to try and improve it, while the mainline development goes on.
Comments
Comment #1
niccolox commentedjust found this package from Australia's CSIRO, kind of like the National Science Foundation in Oz http://www.atnf.csiro.au/computing/software/arch/
Comment #2
niccolox commentedlooks like the CSIRO is moving towards using Drupal for its own sites and so the Arch (Nutch/Solr) package will *have* to integrate with Drupal
http://lucene.472066.n3.nabble.com/Drupal-Integration-with-Nutch-via-CSI...
http://lucene.472066.n3.nabble.com/Drupal-Integration-with-Nutch-via-CSI...
Comment #3
dstuart commentedNew Years resolution: Make Nutch module kick ass and get D7 version up to speed!
Comment #4
dstuart commentedYea, I saw thoses posts looks very interesting. With the latest patches in 1.4 I think we can get full integration without having to change the schema.xml in solr. Also with the new work by @ygerasimov in Apache Solr Views 7 we have a good option for proper D7 integration also
Comment #5
niccolox commentedgday dstuart, great new years resolutions
I think that the Nutch project could get a lot of fresh interest and energy with an easy-to-use and documented package for Drupal integration
I guess, if a network provider started implementing Nutch AND Solr we would also see it take off
Comment #6
niccolox commentedOk. Its been a long time between drinks. Any movement on Nutch Drupal?
Comment #7
naeluh commentedoops thats is from almost a year ago haha my bad ? also I am interested in helping test anything thanks
Comment #8
niccolox commentedCheck the lucene nutch solr group for a major nutch module sandbox release
Comment #9
niccolox commentedSolr Nutch Search Sandbox Project Updated to Integrate with Common Schema
http://groups.drupal.org/node/273813
Comment #10
niccolox commentedand the good news is the Solr Nutch sandbox works using Solr 3.6 and Nutch 1.6
http://groups.drupal.org/node/273813#comment-869623
can we fold this work into the Nutch module ?
Comment #11
avpadernoI am closing this issue, since Drupal 6 isn't supported anymore.