Posted by mikesmullin on January 17, 2008 at 5:06pm
An API for scraping the Internet via cURL, HTMLTidy, and SimpleXML.
Makes scraping sites, parsing XML, and following hyperlinks a snap. A number of other scraper-type modules depend on it, so its easier to update one place than in many. Saves developers a lot of time. Can be extended to implement proxy rotation, delayed hits, useragent rotation, etc.
Check out these cool DataMiner API related projects:
- Import Contacts - import email addresses from external sites.
For now, this module requires PHP 5.
Sponsored by Punim.com.
Developed by Smullin Design.
Downloads
Project Information
- Module categories: Developer
- Maintenance status: Unknown
- Development status: Unknown
- Last modified: March 2, 2008