Hi!

I searched the issue queue and didn't find such issue. When I export webform results in non-latin language (for example russian) there are encoding problems. Excel (both 2002 and 2007) doesn't show up letters correctly. I tried to change encoding to Unicode(UTF-8) it didn't help. However everything works in Open Office in Unicode.

Thanks!

CommentFileSizeAuthor
#4 webform_export_encoding.patch784 bytesquicksketch

Comments

Elian’s picture

By the way, same problem with webform-3.x-dev.

quicksketch’s picture

What are the settings for your download (CSV, TSV, Excel, etc?) I've successfully imported files with all kinds of characters (Arabic, Chinese, French). Often times this is a matter getting Excel to read your files properly, as opposed to something wrong with the files. More recent versions of Office give you trouble when dealing with Webform's xls files, I suggest using TSV for everything.

Whatever you do, do NOT use CSV. That format by standards does not support non-latin characters. It it is impossible to get them working with CSV. TSV is definitely the safest way to go (which is why it is the default).

Elian’s picture

Thank You for Your answer!

Yes, I tried all options. I don't think that's because of Excel settings, because webform 2 exports everything fine. I tried French, Hebrew, Russian, only English is displayed correctly.

I can attach images of what I see and .tsv files if it helps.

Thanks!

quicksketch’s picture

Version: 6.x-3.0-beta2 » 6.x-3.0-beta3
Status: Active » Fixed
StatusFileSize
new784 bytes

Well this is slightly embarrassing, everything would be working if we were doing a simple comparison correctly. Webform encodes everything perfectly, but is forgetting the infamous "BOM" marker beginning the file. This small change makes XLS and TSV files download and open in Excel properly. I tested with French and Hebrew and they downloaded just fine.

quicksketch’s picture

These changes are included in the beta 4 version which will be available shortly.

Status: Fixed » Closed (fixed)

Automatically closed -- issue fixed for 2 weeks with no activity.

avner’s picture

Version: 6.x-3.11 » 6.x-3.0-beta3
Priority: Normal » Minor
Status: Active » Closed (fixed)

Sorry for reopening this but this issue seems to come back
I am using 6.x-3.11 and when trying to export a TSV (or CSV for this manner) file with non Lating chars i get gibberish when on Linux machines
on Windows this works fine, however on linux machines the non Latin chars are unreadable

avner’s picture

Status: Closed (fixed) » Active
avner’s picture

Version: 6.x-3.0-beta3 » 6.x-3.11
Priority: Minor » Major
quicksketch’s picture

Priority: Major » Normal

I can't reproduce any problems when exporting TSV/Excel files. As noted, you shouldn't use CSV files for exports because they specifically do not support UTF8 characters.

avner’s picture

Version: 6.x-3.0-beta3 » 6.x-3.11
Priority: Minor » Normal
Status: Closed (fixed) » Active

sorry,
didnt see your reply

As i said i tested on TSV as well

the issue is on Linux
On windows its working fine

avner’s picture

Version: 6.x-3.11 » 6.x-3.x-dev
Status: Active » Closed (fixed)

ok, the issue is with my Linux configuration
I checked it with two other Linux machines and it seems fine

Sorry of this

Avner