I'm trying to learn ImportHTML with the simplest file I can think of:





Evergreene Digest



but when I try to import it (or anything more complicated, for that matter) I don't understand this output:

List files to import

0 total files found. (Some may be hidden or skipped)
: All Pages : All Images : All Resources : All Other http://localhost/importTest/index.html

Any debugging suggestions would be appreciated. Many thanks.

Comments

dman’s picture

That eg file is simple alright. Even simpler would be a zero-length file :)

At least add something between the body tags...

Check that 'html' is a supported content type. I'm pretty sure it is, but I may have typo-ed. Also, 'index' is (understandably) a special case file. That's not a reason why you should get any problem, but may be a factor that needs investigation.

The '0 files found' may be an off-by-one arithmetic error. I had to count all files and directories and hidden files found, then subtract to get the number of displayed files.

For fun debugging, just uncomment and tweak the DEBUGLEVEL at the top of the import_html.module.
2 should give you explanations of the process. 4 will dump the full source of the page at every one of the dozen transformation stages.

.dan.

dman’s picture

Status: Active » Closed (fixed)

no more info/troubles?