I recently had an issue with a large feed (~400 items) which is creating nodes and running through some external processing (calais). It usually works well but I think it encountered a timeout or network error during the last cron refresh which I'm guessing may have been during node_save. The result was that feedapi thought it had processed the feed and nothing happened on manual refresh. The only way to refresh was to manually reset the hash field in the db (I tried most of the other fields). This might be treating the symptoms but I'm wondering if it's worth having some kind of admin-only 'hard reset' link for each feed which will clear the hash and last updated fields.
Comments
Comment #1
alex_b commentedThis is actually a bug. If feedapi does not finish, it should not set the hash. Can you confirm this behavior for the latest version? I'm pretty sure you will.
Comment #2
adub commentedI think it will be pretty difficult for me a) to change version (as this is on a tightly controlled environment) b) repeat the error. I could scan the code and look for a suspect area if that helps but I can't really test this unfortunately.
Comment #3
aron novakOf course FeedAPI alters hash at the very last step of this process.