Scraper

This project is not covered by Drupal’s security advisory policy.

For a modern, working example see the Example Web Scraper (built on Feeds and Feeds XPath Parser.

@TODO: For Drupal 6, use SimpleXML as done by Nick Lewis.

The currently maintained portion of this module is a very simple scraper that can take a URL and beginning and ending code and display the result in a block. (This could easily be extended to display on a page.)

It does not store this scraped data at the moment, therefore it is necessary to use the blockcache module and replace the scraped_content block with its cached equivalent. Otherwise the page you are scraping is called every time the scraped_content block is viewed.

4.7 only: This module also contains code from an industrial strength scraping module written for 4.7, which is not maintained, Legacy scraper.

Project information

Unsupported
Not supported (i.e. abandoned), and no longer being developed. Learn more about dealing with unsupported (abandoned) projects
Obsolete
Use of this project is deprecated.
Project categories: Content display, Import and export
Created by dado on 26 June 2006, updated 13 February 2024
This project is not covered by the security advisory policy.
Use at your own risk! It may have publicly disclosed vulnerabilities.