Hi, I like the idea of this module, but after reading and re-reading the documentation I can't get it to work.

I keep on getting the errors like:

warning: DOMDocument::loadHTML(): Tag rss invalid in Entity, line: 2 in /Library/WebServer/Documents/newsitorium/sites/all/modules/feeds_xpathparser/FeedsXPathParser.inc on line 419.
warning: DOMDocument::loadHTML(): Tag channel invalid in Entity, line: 4 in /Library/WebServer/Documents/newsitorium/sites/all/modules/feeds_xpathparser/FeedsXPathParser.inc on line 419.
warning: DOMDocument::loadHTML(): Unexpected end tag : link in Entity, line: 7 in /Library/WebServer/Documents/newsitorium/sites/all/modules/feeds_xpathparser/FeedsXPathParser.inc on line 419.
warning: DOMDocument::loadHTML(): Tag description invalid in Entity, line: 8 in /Library/WebServer/Documents/newsitorium/sites/all/modules/feeds_xpathparser/FeedsXPathParser.inc on line 419.
warning: DOMDocument::loadHTML(): Tag lastbuilddate invalid in Entity, line: 9 in /Library/WebServer/Documents/newsitorium/sites/all/modules/feeds_xpathparser/FeedsXPathParser.inc on line 419.
warning: DOMDocument::loadHTML(): Tag language invalid in Entity, line: 11 in /Library/WebServer/Documents/newsitorium/sites/all/modules/feeds_xpathparser/FeedsXPathParser.inc on line 419.
warning: DOMDocument::loadHTML(): Tag sy:updateperiod invalid in Entity, line: 12 in /Library/WebServer/Documents/newsitorium/sites/all/modules/feeds_xpathparser/FeedsXPathParser.inc on line 419.
warning: DOMDocument::loadHTML(): Tag sy:updatefrequency invalid in Entity, line: 13 in /Library/WebServer/Documents/newsitorium/sites/all/modules/feeds_xpathparser/FeedsXPathParser.inc on line 419.
warning: DOMDocument::loadHTML(): Tag generator invalid in Entity, line: 14 in /Library/WebServer/Documents/newsitorium/sites/all/modules/feeds_xpathparser/FeedsXPathParser.inc on line 419.
warning: DOMDocument::loadHTML(): Tag atom10:link invalid in Entity, line: 16 in /Library/WebServer/Documents/newsitorium/sites/all/modules/feeds_xpathparser/FeedsXPathParser.inc on line 419.
warning: DOMDocument::loadHTML(): Tag feedburner:info invalid in Entity, line: 16 in /Library/WebServer/Documents/newsitorium/sites/all/modules/feeds_xpathparser/FeedsXPathParser.inc on line 419.

...

There was an error during the XPath query: //div[@class="article"].
Libxml returned the message: Tag description invalid, with the error code: 801.

Basically, I don't understand what I am doing wrong.

Here's the feed that I am trying to parse: http://feeds.macrumors.com/MacRumors-All

And here are the XPath settings:

Context: //div[@class='article']
title: h1
body: div[@class='content']

Thanks in advance for your help.

Comments

costinius’s picture

Forgot to mention, I am using XPath HTML Parser with Node Processor

costinius’s picture

Title: Grabbing full HTML code of an article » warning: DOMDocument::loadHTML(): Tag rss invalid in Entity, line: 2 in /FeedsXPathParser.inc on line 419.
Status: Closed (works as designed) » Active

Oups, I was wrong in expecting something this module wasn't designed to deliver.

I was thinking it would use an RSS feed and run through all the individual URLs of the items listed in that feed and fetch HTML content from each URL. Apparently, this is not the case. I just tried to use not the RSS feed above, but an actual URL of one of the items, and it worked (with minor errors).

costinius’s picture

Title: warning: DOMDocument::loadHTML(): Tag rss invalid in Entity, line: 2 in /FeedsXPathParser.inc on line 419. » Grabbing full HTML code of an article
Status: Active » Closed (works as designed)

Ok, found another way to solve the problem by modifying the image grabber module instead. Issue is closed.

Status: Active » Closed (works as designed)