I am importing a site with 1500 files. I am getting a lot of warnings of the form:
Warning: DOMDocument::loadXML() [<a href='domdocument.loadxml'>domdocument.loadxml</a>]: Failed to parse QName 'background:' in Entity, line: 152 in parse_in_xml_string() (line 175 of /var/customers/webs/example.com/sites/all/modules/contrib/import_html/coders_php_library/xml-transform.inc).
Missing here is the path to the document it failed to parse. How can I determine which one it was?

Comments

dman’s picture

In the advanced settings, you can turn up the debug volume, that'll tell you more than you ever wanted to know about the steps the process is going through, including (at high levels) full dumps of the text in question.

Lots of debug messages means LOTS of messages, so don't try to do thousands of files with the volume up that high, take it in batches.

Post the example up here if you find it.

spflanze’s picture

Title: Failed to parse QName » Errors from parse_in_xml_string()
Category: support » feature

Feature Request:

When parse_in_xml_string() creates an error there is no indication in the output what file caused the error at lower debug levels. This is needed. Higher debug levels would cause too much output.

I have a site of 1500 files producing many such errors. Now I have to do trial and error over many sections of the site to find which section of the site will reveal the file name at higher debug levels.

spflanze’s picture

I turned up the debug volume to the highest level that field allows, a value of three. I also checked the "Keep Temp Files" box and did an import within an area I found to be creating some of the error messages. No additional information appeared on screen within the pink box after importing. The error messages remained the same there. I also could not find a "files/import" path anywhere in the Drupal installation. I used the "find" command to do a search for an "import" directory in the entire Drupal installation and it did not get any hits. Where are the temp files supposed to be?

The "files/import" path seems to be where it says to look for the temp files in the description under the "Keep Temp Files" check box.

dman’s picture

on admin/build/import_html/results/ you should be able to show 'more' details.
These logs are also put into the watchdog log and you can find them there.
The temp folder is 'files/imported' not files/import

dman’s picture

Status: Active » Closed (cannot reproduce)

Clearing the old 6.x issues from the issue queue for a cleanup.
The UI and filtering of error messages has been improved in the meantime however.