Fuzzy Search

BlakeLucchesi - July 9, 2007 - 10:40

This module provides drupal sites with a fuzzy search engine to allow for broader keyword matches.

Features:

  • Mispellings and typos still provide relevant results.
  • External scoring factor hooks exposed so contrib modules can give administrators options for scoring.
  • Reindex function available to allow modules to specifically call a certain node for reindexing at next cron run.
  • Indexing of CCK textfield field types and taxonomy terms.
  • Implements hook_nodeapi's 'update index' op, so current modules integrating with search.module will work the same.
  • Improved search performance over search.module because there are no temporary tables created during search.

NOTE: There are currently issues that I am trying to resolve regarding node access and the query that gets rewritten when I wrap the search function in db_rewrite_sql. I am trying to fix this as soon as possible.

NOTE: Please download the development snapshots as these are more accurate working releases. Also, this module works independently of the search.module.

Version 1.2 will be the final version of the module using an older result fetching method that was a bit too fuzzy. I have since reworked a bunch of the code and thus made it so that the completeness of individual words is checked before a match is returned (when searching for a 10 letter word, when a 3 character match occurs on a 4 letter word this word is not considered a match, and vice versa). I hope to release this newer version as a 2.0 but I will need some people giving feedback on the dev snapshot release before I commit it as 2.0. Thanks for all the community support and I hope this module can serve everyone well.

Releases

Official releasesDateSizeLinksStatus
5.x-1.22007-Aug-1912.88 KBRecommended for 5.xThis is currently the recommended release for 5.x.
Development snapshotsDateSizeLinksStatus
5.x-1.x-dev2008-Jun-2815.08 KBDevelopment snapshotDevelopment snapshots are automatically regenerated and their contents can frequently change, so they are not recommended for production use.
 
 

Drupal is a registered trademark of Dries Buytaert.