Download & Extend

Title - Importing the actual Title and making sure it does not convert to ASCII

Project:Import HTML
Version:6.x-1.0
Component:Code
Category:support request
Priority:normal
Assigned:Unassigned
Status:active

Issue Summary

I saw another thread on tweaking the XSL to get this module to import the actual title of the page rather than the h1 tag. What I am needing to know is how do I get the module to do that, plus make sure that it imports the actual title text instead of converting it to ASCII. When importing pages that have Cyrillic, the module does a very bad thing, converting the pages to ASCII and then this code is show in the title rather than the Cyrillic text. Has anyone else experienced this and how did you resolve it? (I don't want the titles transliterated, through the transliteration module.)

Comments

#1

The supplied XSL templates contain examples for how to select different titles.

For extended character support, we need someone who knows how to do this correctly in XML to help. I don't and have spent a few days trying. Hopefully someone else out there has tried working with non-ASCII character sets before.
The only reliable way I've seen is to convert difficult characters to XML codes early in the process, and keep them that way through the processing.

#2

Thank you for your answer! And thank you for checking into it and trying to work with it. Hopefully someone else with this experience will contribute information as to how they did it, to make it work.

nobody click here