Install

Works with Drupal: 7.x

Using dev releases is not recommended, except for testing.

Using Composer to manage Drupal site dependencies

Downloads

Download tar.gz 225.49 KB
MD5: 482171a40520dda60234e6cd1bf18ba3
SHA-1: 99359b559d59f2f12a88c91b31403313e3a9d4d3
SHA-256: 942cf2ed8ddeadb9449ae6b7392fb50bf4e07528277e2349dbcad9d2a05c9ddc
Download zip 267.77 KB
MD5: 7728f47423f5aebbdad9f00b5a60555d
SHA-1: 6e09703925fdcfc1d7d1cbc452acb5688aeb2c08
SHA-256: 0666a6ba9b944515a76e7ef6d99493effac5c873aefdfffc1110a5a78ffbfc3b

Release notes

So the D7 branch is doing most of the things it is supposed to now.
The UI and folder browser works as before, importing pages now supports
* path
* title
* body
* text fields (in general)
* even menus again.
* Url rewrites and file storage also works as before (though url relinking may be refactored at a later date)

Processing either in single pass of batch job seems to work.
The debug results screen works again - turn on debug and see it under the 'results' tab.

CHANGES from D6 include:
* debug flag is moved from per-preset configs to a global flag (no big chenge)
* the UI for settings is moved around a lot, but most previous items are there - they just may have shifted on the page.
* A large rewrite for scraping logic was made to support D7 fields
* There is the beginnings of a UI-based element mapper - we should soon be able to abandon the XSL templates entirely as logic that they used to perform is now easier to control via the settings, specifically:
* you can select elements (via xpath) to be discarded from the input (an add-on called preprocess-cleanups)
* The 'content tag ID' selector is still supported, but will be phased out as the node 'body' field is no longer such a special case, it's just another 'textarea'
* Preliminary support for direct RDF-based data field mappings, though that needs work to explain.
* Behavior of htmltidy PHP extension changed again in some versions of PHP, and the API is unreliable. Added a switch (admin/structure/import_html/setup) to let you choose to use the stable commandline version even if there is a php extension available. (previously it would use the php extension by choice if available

CURRENTLY NOT WORKING
* the 'demo' page
* additional field types like date and file attachment. Seldom used anyway, but can be tested sometime soon
* some 'meta' fields - there was an add-on for nodewords that would pull page 'meta' into Drupal. that data is still being captured - but is not currently retained anywhere yet.
* taxonomy - need to revisit

Created by: dman
Created on: 21 Mar 2012 at 21:51 UTC
Last updated: 12 Mar 2015 at 20:48 UTC
New features

Other releases