Install
Works with Drupal: 7.xUsing dev releases is not recommended, except for testing.
Using Composer to manage Drupal site dependencies
Downloads
Release notes
So the D7 branch is doing most of the things it is supposed to now.
The UI and folder browser works as before, importing pages now supports
* path
* title
* body
* text fields (in general)
* even menus again.
* Url rewrites and file storage also works as before (though url relinking may be refactored at a later date)
Processing either in single pass of batch job seems to work.
The debug results screen works again - turn on debug and see it under the 'results' tab.
CHANGES from D6 include:
* debug flag is moved from per-preset configs to a global flag (no big chenge)
* the UI for settings is moved around a lot, but most previous items are there - they just may have shifted on the page.
* A large rewrite for scraping logic was made to support D7 fields
* There is the beginnings of a UI-based element mapper - we should soon be able to abandon the XSL templates entirely as logic that they used to perform is now easier to control via the settings, specifically:
* you can select elements (via xpath) to be discarded from the input (an add-on called preprocess-cleanups)
* The 'content tag ID' selector is still supported, but will be phased out as the node 'body' field is no longer such a special case, it's just another 'textarea'
* Preliminary support for direct RDF-based data field mappings, though that needs work to explain.
* Behavior of htmltidy PHP extension changed again in some versions of PHP, and the API is unreliable. Added a switch (admin/structure/import_html/setup) to let you choose to use the stable commandline version even if there is a php extension available. (previously it would use the php extension by choice if available
CURRENTLY NOT WORKING
* the 'demo' page
* additional field types like date and file attachment. Seldom used anyway, but can be tested sometime soon
* some 'meta' fields - there was an add-on for nodewords that would pull page 'meta' into Drupal. that data is still being captured - but is not currently retained anywhere yet.
* taxonomy - need to revisit