Closed (fixed)
Project:
Node import
Version:
5.x-1.x-dev
Component:
Code
Priority:
Normal
Category:
Support request
Assigned:
Unassigned
Reporter:
Created:
7 Aug 2007 at 22:20 UTC
Updated:
4 May 2008 at 01:31 UTC
In spanish we use "´" for words like "canción", "llevó", etc.
Problem is that this module seems to cut every part, since "´" is used
For instance, a node whose title is
"Thompsom según Thays"
would have a cuted titled:
"Thompsom seg"
Is there anyway to solve this?
In the file attach you can see how both title and content have been cuted after import.
Cheers
David.
| Comment | File | Size | Author |
|---|---|---|---|
| drupal-import-node.png | 10.73 KB | dsanchez |
Comments
Comment #1
jantzen commentedI experience this problem too. It happens after these accent characters such as 'é' as well as at umlauts as in "ü".
Comment #2
jantzen commentedI think this problem is related to character encoding. I was able to get the data to import correctly when it was saved in utf-8 format. utf-16 didn't work, nor did the default Excel CSV format output.
Comment #3
davemybes commentedYou should always try and use UTF-8 as your default text format for importing into Drupal, as all its tables are UTF-8. So, when preparing your data, ensure that the text editor you use is saving the files in UTF-8 (without BOM if you have that option). Excel has some issues with UTF-8 and I would recommend that you rather use OpenOffice for this. When saving CSV files in OpenOffice, you get the option for selecting not only the delimiters and separators, but also what text encoding to use - select Unicode (UTF-8) here. I've had those little trademark and registered symbols in node titles, and if I don't do this, they all get cut at that symbol.
So I wouldn't call this a bug - its actually a file format error on the user's side. Changing tags to reflect this.
Comment #4
Robrecht Jacques commentedDid you try node_import-5.x-1.5 ?
Comment #5
rgammon commentedI can confirm this. Importing Chinese dictionary data, all UTF-8. Standard line has many diacritics in transliteration field. Sample entry:
"安靜","ānjìng","calm". It imports fine.
Comment #6
Anonymous (not verified) commentedAutomatically closed -- issue fixed for two weeks with no activity.