Posted by macho on June 23, 2011 at 5:34pm
The NewsArchiver module is for people who
- find that web news items they'd like to refer to have disappeared from the sites where they read them;
- are sometimes unable to find news items they once remember reading; or
- would like to be able to tag news items for future reference.
Features
- Downloads a copy of the content at the url, in case it disappears later.
- Allows tagging of archived items.
- Suggests tags from the content, based on the ones you've already entered.
- Reads data inside html, pdf, doc, jpg, and tiff files.
- Stores away the original document's url
Requirements
Known problems
- Alpha release.
Current maintainer
Recommended software
- pdftotext
- antiword
- pdftotext