I use QueryPath module 1.x dev. I am newbe with QueryPath library and connot properly setup css selectors to export content from HTML page.
I am trying to parse http://blogs.yandex.ru/search.xml?text=%22%D0%B5%D0%B4%D0%B8%D0%BD%D0%B0... (comments from livejournal.com)
In Firefox Firebug i detect html structure for elements i want to export:
root repeated pattern (i.e. context in Feeds, Settings for QueryPath parser) div[class='b-item Ppb-c-ItemMore SearchStatistics-item'] ,
comment title: h3[class='title'] ,
comment author: ul[class='info b-hlist b-hlist-middot'] li a
journal author: ul[class='info b-hlist b-hlist-middot'] li a (with offset 1)
All these setup works well for SimpleHTMLDOM Parser module i used resently http://drupal.org/project/simplehtmldom_parser and i understand QueryPath is another library, i need different syntax (i've read http://www.ibm.com/developerworks/opensource/library/os-php-querypath/in...).
I tried several times, according to manual, such as:
div."b-item Ppb-c-ItemMore SearchStatistics-item" for context
.title for title
ul.'info b-hlist b-hlist-middot'>li:first for comment author
ul.'info b-hlist b-hlist-middot'>li:second for journal author
(of course i set up mappings, etc.)
and variants without quotes, starting dots, starting divs etc. - nothing imported.
What i am missing?
| Comment | File | Size | Author |
|---|---|---|---|
| querypath_settings_in_importer.jpg | 105.63 KB | joomlerrostov | |
| firebug_dom_structure.jpg | 250.22 KB | joomlerrostov |