When I import users with names or addresses including the danisk letters 'æ', 'ø', and 'å' the user is importet, but the string is truncated from the danish letter.
Names as "Peter Jensen", "Poul Hansen" is ok and will be handled correct, but names as "Peter Jørgensen" results in "Peter J". (Same with alle strings).
I have tried to use quotes " or ' like "Peter Jørgensen" or 'Peter Jørgensen', but it alså results in Peter J.
If it was only in the username I could live with it and change the trouble letters before import, but when it is in users fullname and address it is a great problem.
I think it is the same with other non-english names.
| Comment | File | Size | Author |
|---|---|---|---|
| #2 | drupal_6_user_import_handle_non_utf8_files.patch | 442 bytes | bathizte |
Comments
Comment #1
Anonymous (not verified) commentedNo solution yet?
I'm having exactly same problem...
Comment #2
bathizte commentedHello, I had the same problem.
This one-line patch solved the issue by converting csv read data into utf-8.
Comment #3
fellbrade commentedHello, I had met same problem too for Japanese.
Your one-line patch seems not works well for Japanese.
I had added "setlocale(LC_ALL , 'ja_JP.UTF-8');" to user_import.module and it seems works well in my environment.
I think this is fgetcsv locale setting problem.
Comment #4
robert castelo commentedI've added this line to the README.txt:
File needs to be saved as "Character Set: Unicode (UTF-8)".I'm not going to mess around with converting other formats to UTF-8 in the module, there are too many variations, and it's something that can be done fairly easily in any modern text editor.
Feel welcome to re-open this issue if you're still experiencing problems when trying to import a UTF-8 encoded csv file.
Comment #6
gisleThere is now a patch to fix this in code. Please see #3058492: Add support for Windows-1252 character encoding (and review).