Is it possible with crawler and/or xpath to get data from deeper pages.

Lets say i crawl tables here http://bit.ly/gXJRmr
but i want to get data to my nodes from deeper pages that would come trough links, so i would like to have data from http://bit.ly/fPudvG and http://bit.ly/gx7ayB for first 2 items etc. Is this possible with crawler?

Comments

danny_joris’s picture

Version: 6.x-1.0-beta3 » 7.x-1.x-dev

Currently that's not possible. It would be very useful, but I'm not sure how a Feeds UI interface for that would look like as you're fetching multiple pages. Potentially more than one level deep too.

twistor’s picture

Status: Active » Fixed

Check out Feeds Spider.

There's a good chance I will roll that into this project soon.

Status: Fixed » Closed (fixed)

Automatically closed -- issue fixed for 2 weeks with no activity.