Community

Drupal 4.6 aggregator module not reading/importing a lot of RSS feeds

Hi,

on my site I'm using the aggregator module to retrieve a few RSS feeds, but recently a lot of them are not being imported any more.
Does anyone have an idea as to why?

Here's an example of a RSS feed that is not being importen anymore:
http://noorderlicht.vpro.nl/themasites/rss/magazine.jsp?rssnr=21789719

The RSS feed does seem to exist.

I'm lost.

The Phoenix

Comments

Anybody?

Does someone know what causes the problem and perhaps even how to fix it?

Some feeds not importing into 4.7.3

I'm using the latest version and this problem of some feeds not being imported or displayed is still there. I'm doing fine with most of the other feeds but I couldn't understand why others just keep having 0 items in them even though Drupal tells me there are new items from such and such source. These feeds are read and displayed well by other readers. They are also from a publisher whose other feeds otherwise display well in Drupal. [I do know how to use the block or menu system to display feeds.]

Here are some of the feeds that have contents in them but show as having 0 items in Drupal:

I would appreciate any help on this.
-- Glenda (http://softtester.org)

RSS Problem

Hello People,

I am using drupal 4.7 and when I add a RSS feed and then click on the update items it shows 0 items.
But if the same rss feed I run it in a any RSS reader it works fine. So I am not able understand the exact reason. Is it that Drupal is not able to parse the data in the rss or there is some other reason. Because i dont get any errors in the logs.So I am not able to understand the exact reason.Also some where i had read on the forums that the problem could be cause of the special characters which PHP XML parser doesnot understand and hence it doesnt parse the data.

Regards,

Hitesh

Regards,

Hitesh

Reason found

Hello People,

Ok I founf the reason for the rss feed showing 0 items. The reason is that drupal is able to parse the data succecsfully but not entering the data into the database. The reason for that is in the Insert query of the aggregator.module there is a term edit['timestamp'] when we remove this from the query and try to insert the data in the data base it is done succesfully i.e. without putting in the timestamp. But with the time stamp the data is not getting inserted. It is geeting the the timestamp also but while entering its not doing so.If we try to print the timestamp we can see it.

I dont know the exact reason for what the timestamp is nit geting entered in the database but for my purpose i do not require timestamp so the query w/o timestamp kind of works for me.

Regards,

Hitesh

Regards,

Hitesh

RSS stopped updating feeds for no reason....

Hi,

My 3 blog feeds were coming in then, then nothing. Site is here:

http://www.movingaheadcommunications.com/drupal/

3 feeds that stopped:

Moving Ahead Blog
Friday Biz Opp Report
Gift Gallery

Cron job set & was working fine.
Can't even update manually via admin panel - -it times out for each feed. What's up with this????

My Solution

In my case, I had a self created feed that was working with all aggregators etc - but showing up as zero items in drupal. After investigating, I found the problem. On line ~904 of aggregator.module it does a check of the item to figure out if it should be adding it as a new item - or updating an existing item. IMHO, the check is flawed.

To determine if a news item already exists, it first checks for items with the same link and assumes if the items have the same link, they are the same items. If it can't find any items within the feed with the same link, it then looks for items with the same title within the feed and again assumes that the item is the same. Both of these checks are a little dubious imo - neither are specified as being required to be unique within the specs.

Basically, it assumes that item links and/or item titles are unique within the feed. The RSS 2.0 spec doesn't say this for sure.. not sure about 1.0 or Atom. All of these feed types do have id's (GUID for RSS 2.0, dc:identifier for 1.0, id for Atom) that are globally unique identifiers for the feed item.

Unfortunately, within the aggregator module, these actual item id's are not stored - so there is no way later to compare them to figure out what is / is not new.

For a quick and dirty solution, I removed the check entirely - so each item is new regardless if it is an update; however, you I suppose you could leave in the check for identical titles if you like.

Cheers,

toddz

toddz - that sounds great

toddz - that sounds great that we can easily get that aggregator up and working. could you possibly show exactly which part of the code to delete?

thanks

-nik

First, I am working with

First, I am working with this aggregator module - the id is at the top of the file:
$Id: aggregator.module,v 1.278.2.6 2006/07/03 08:09:31 killes Exp $

I strongly recommend you don't do this if you don't know what you are doing. This didn't break my system - your mileage may vary.

I commented out the following lines (lines 905 to 910):

 

    if ($link && $link != $feed['link'] && $link != $feed['url']) {
      $entry = db_fetch_object(db_query("SELECT iid FROM {aggregator_item} WHERE fid = %d AND link = '%s'", $feed['fid'], $link));
    }
    else {
      $entry = db_fetch_object(db_query("SELECT iid FROM {aggregator_item} WHERE fid = %d AND title = '%s'", $feed['fid'], $title));
    }

Cheers,

Todd

Doesn't Work

I tried commenting out the code you suggested and this doesn't work. Perhaps I am commenting out of this incorrectly. Please show the entire code again including commenting out tags.

re: Doesn't Work...

for you ;-)

As mentioned in my original post - this _only_ solves the problem iff the feed was using the same link or title on multiple posts which was the case in my situation. I would manually check to see if that was the case in your situation.

If you are unsure how to comment out code in PHP, I'm not entirely sure I would suggest that you edit the php directly without first learning a bit about it.

My guess is that you are running into another problem.

Only Importing 1 item for valid RSS 2 feed

I'm having a similar problem with Aggregator, and Aggregation module. They seem to do the same thing. Only import the first of many items in a (as far as I can tell) perfectly valid RSS 2 feed. The only difference I can visually detect between the entries is that the links in the un-imported items have dynamic URLs. I'm not sure that has anything to do with the issue, but those checks mentioned above, by excessory, do seem dubious as potential failure points. I tried commenting them out, but doesn't seem to have changed anything.

hmmm...
Anyone else having this type of problem?

Cheers,
dafreak

nobody click here