Spam

Jeremy - September 26, 2004 - 22:43

The Spam module provides numerous tools to auto-detect and deal with spam content that is posted to your site, without having to rely on third-party services.

The Spam module provides a trainable Bayesian filter, automatic learning of spammer URLs, flagging of content with an excessive number of links, the ability to create custom filters, and more.

Features:

  • Can be used completely independently of any third-party service.
  • Automatically learns and blocks spammer URLs and IPs.
  • Detects repeated postings of the same identical content, or content containing too many links.
  • Can notify the user and/or administrator that content was determined to be spam, preventing confusion over why their content doesn't show up.
  • Allows filtered users to provide feedback when their postings are incorrectly flagged as spam.
  • Provides comprehensive logging to offer an understanding as to how and why content is determined to be or not to be spam.
  • Language-independent: automatically learns to detect spam in any language using Bayesian logic.
  • Supports the creation of custom filters using powerful regular expressions.
  • Written in PHP specifically for Drupal.
  • Highly configurable and extendable (includes hooks for writing custom
    filters).

Details:

The spam api module includes several spam filter modules, all of which work together to determine if a given piece of content is spam. Each module will review the content and return a score between 1 and 99, where 1 means there is a 1% chance that the scanned content is spam and 99 means there is a 99% chance that the scanned content is spam. The spam api module takes a weighted average of all of these scores and assigns a final overall score for the content. Based on this final score, the content may or may not be allowed to be posted on your website.

Further documentation can be found in the module's INSTALL and README files, and on the Spam module handbook page.

Included filters:

  • Basyesian filter - auto-learns, performing statistical analysis on the words in new content
  • Custom filter - regexp/plain text matching.
  • URL limiter - auto-learns spammer websites and blocks content linking to these URLs
  • SURBL - blacklist of urls that commonly occur in spam (Third Party).
  • Node age filter - treats comments on old content as likely spam
  • Duplicate filter - blocks duplicate posts and bans associated IPs

Releases

Official releasesDateSizeLinksStatus
6.x-1.02009-Aug-1051.98 KBRecommended for 6.xThis is currently the recommended release for 6.x.
5.x-3.0-beta12008-Sep-2946.74 KBRecommended for 5.xThis is currently the recommended release for 5.x.
4.7.x-1.02007-Jan-0338.58 KBRecommended for 4.7.xThis is currently the recommended release for 4.7.x.
Development snapshotsDateSizeLinksStatus
6.x-1.x-dev2009-Sep-0151.77 KBDevelopment snapshotDevelopment snapshots are automatically regenerated and their contents can frequently change, so they are not recommended for production use.
5.x-3.x-dev2009-Feb-1447.1 KBDevelopment snapshotDevelopment snapshots are automatically regenerated and their contents can frequently change, so they are not recommended for production use.


 
 

Drupal is a registered trademark of Dries Buytaert.