It would be great to be able to import an HTML page from a URL (or local file), which is too dissimilar from CSV... it just uses different delimiters.
I know that my page title is in <title>.. <title> tags, and the body of my page starts where the HTML begins <table><tr><td class=start> and ends </table></table>.
Ideally, I'd like to be able to specify a list of URLs, which is probably easier than spidering a site.
Unfortunately the Import HTML module requires XML/XSLT support on the server, which is not always available, or in my case, is not recognised.