Postponed
Project:
Import HTML
Version:
7.x-1.x-dev
Component:
Code
Priority:
Normal
Category:
Support request
Assigned:
Unassigned
Reporter:
Created:
27 Nov 2010 at 15:30 UTC
Updated:
8 Nov 2012 at 10:41 UTC
Some old html dont have proper Tag ID.
Is it possible to choose content before and after text? (similar to Yahoo Pipes)
For example: import content after text text1 and before text2
and also: import content start from text text1 until text2
Comments
Comment #1
dman commentedClearing the old 6.x issues from the issue queue for a cleanup.
The very first D4 version DID have token-based and regular-expression-based data extraction, but Only DOM-based methods have gone forward since. What you want CAN be done in a custom module that uses the HOOK_import_html() callback, where you can do your own process on the raw text and add it to the new $node->body
It would be an OK extension and I'd support someone who had a go at it, but won't be built on the current roadmap.