import_html 7.x-2.x-dev

Refactoring the settings (import profiles) to be more modular.
Settings should be able to be imported exported and shared better.
Phase 1 is removing all the globals and other shortcuts that remain, as well as cleaning up the naming conventions.
Then we will get to the point of saying "here is an import profile for a squarespace site" or something.
Ongoing - move more of the individual processes into smaller functions, with a view to eventually making them available to Feeds infrastructure as cleanup processes.

import_html 7.x-2.0-alpha1

Unsupported

The import_html settings profiles are being rearranged to be exportable and shared.
The way settings are saved therefore is incompatible between versions, so there is not upgrade path between 7.x-1.x and 7.x-2.x . Seeing as this is mostly run-once anyway, there probably won't be one added.

import_html 7.x-1.0

Unsupported

This has been stable enough for a long time.
I just left it without a tag accidentally.

import_html 7.x-1.x-dev

New features

So the D7 branch is doing most of the things it is supposed to now.
The UI and folder browser works as before, importing pages now supports
* path
* title
* body
* text fields (in general)
* even menus again.
* Url rewrites and file storage also works as before (though url relinking may be refactored at a later date)

Processing either in single pass of batch job seems to work.
The debug results screen works again - turn on debug and see it under the 'results' tab.

CHANGES from D6 include:
* debug flag is moved from per-preset configs to a global flag (no big chenge)
* the UI for settings is moved around a lot, but most previous items are there - they just may have shifted on the page.
* A large rewrite for scraping logic was made to support D7 fields
* There is the beginnings of a UI-based element mapper - we should soon be able to abandon the XSL templates entirely as logic that they used to perform is now easier to control via the settings, specifically:
* you can select elements (via xpath) to be discarded from the input (an add-on called preprocess-cleanups)
* The 'content tag ID' selector is still supported, but will be phased out as the node 'body' field is no longer such a special case, it's just another 'textarea'
* Preliminary support for direct RDF-based data field mappings, though that needs work to explain.

import_html 7.x-0.x-dev

New features

DO NOT USE yet (2012-03)
Only the broken API bits from the D6-D7 upgrade have been done so far.
The pages do their thing, but data is not yet being absorbed.
Hoping to have an actual working thing together soon though

import_html 6.x-1.1

New features
Unsupported

Absolutely dozens of minor updates. Lots happened in the last year, nothing huge, but lots small.
- Better support for cck filefield attachments
- support for Dublin core meta tas (and nodewords)
- A new option to avoid over-writing hand-edited content.
- More work on scaling
- drush support
- features support - export/import the import_htm rules as a feature - also import CONTENT pages from raw HTML as a feature.
- testing in strange corners
- takes more care when adding placeholder items for menus

import_html 6.x-1.0

New features
Unsupported

This has been working pretty well for a while now, nothing big outstanding. Time for a release, or I won't be seeing any more feedback.
Includes heaps of differences from the DRUPAL-5 version, notably bigger better batch support.
I even tried a unit test or two.
UI needs love, but functions are there.

import_html 6.x-1.x-dev

New features

DO NOT USE unless you can debug a bit. (Debug logging is an option under the profiles advanced settings)
MAY require the devel.module to be enabled as debug messages and scaffolding code may be scattered throughout the code.
REQUIRES PHP5. PHP4 support is being removed.

import_html 5.x-2.x-dev

New features

This release is a rewrite of the internal structure, including
- a better use of FAPI and form theming to really use the features that were unavailable in D4 when this was written
- Abstraction of the internal logic into per-module callbacks to add their own fields instead of doing it all in one huge function
- Abstraction of the functions into inc libraries for better code partitioning and CVS

import_html 5.x-1.2

Unsupported

Testing from the 5.0 upgrade threw up some issues with the multiple-page support.
All these are getting pretty stable now.
This release is pretty solid.
No extra special new functionality recently.
Some work on multiple CCK fields yet to come.

Subscribe with RSS Subscribe to Releases for Import HTML