FeedAPI Scraper

This project has been abandoned since the maintainers of Feed Element Mapper launched a successor project: Feeds - read more about the future of FeedAPI and Feed Element Mapper in Good bye FeedAPI, hello Feeds

Add-on module for Feed Element Mapper that extracts (scrapes) content from HTML encoded in syndication feed items and allows to map it to CCK fields. In order to extract HTML content, it comes with XPath and Regular Expression parsers out of the box; it is possible to extend the module providing custom parsers.

Usage Example

The module could be used, for example, to extract an image URL from within raw HTML and to map it in a FileField image field.

Module Dependences

The module depends on:

Credits

This project has been sponsored by Nuvole and Youth Agora.

Project Information

Downloads