I have a cvs file with names with accents.
The terms with accents are not imported.

example of cvs file


BURGUNDY, Narrower Terms,CÔTE-D'OR
CÔTE-D'OR, Narrower Terms,Châtillon sur Seine
CÔTE-D'OR, Narrower Terms,Semur-en-Auxois
Semur-en-Auxois, Narrower Terms,L'abbaye de Fontenay
CÔTE-D'OR, Narrower Terms,Dijon
Dijon, Narrower Terms,Côte de Nuits
Dijon, Narrower Terms,Côte de Beaune
Dijon, Narrower Terms,Route des Grands Crus

The terms without accents are imported successful creating the hierarchy.
The term with accents are not imported.

Comments

ppblaauw’s picture

Tried also the latest 2.x dev version, but also importing of terms with accents does not work in this version.

Suggestions where to look in the code to get support for importing terms with accents welcome, so can try to make a patch?

dman’s picture

csv_format.inc includes the main function taxonomy_xml_csv_parse()
So that's where to look.
Internally it uses the php-native function csv_string_to_array() to do the splitting. If there is a character-encoding issue, I guess you have to look there.

May hae to convert to UTF8 in some way or another, I'm not to clear about what would be needed there.

ppblaauw’s picture

Did not look at the import the last few days, but will soon.
Also found another import module (http://drupal.org/project/taxonomy_csv) which says it can import UTF8

With your information and this module can probably find out how to import UTF8

Will keep you informed what I find out.