I'm importing some text and when that text has an em dash in it; when I do so the resulting text cuts off where the em dash is. So the text:

Forcier on the edge with a ponderous-looking Fleming—foreshadowing! He jukes past the LB; the LB makes a diving tackle attempt that trips him up. Would have been a few more otherwise.

Ends up being "Forcier on the edge with a ponderous-looking Fleming".

CommentFileSizeAuthor
#6 ndash.JPG11.7 KBsaratt
#5 dash.txt43 bytesalex_b
#5 ndash.txt45 bytesalex_b

Comments

alex_b’s picture

What parser are you using?

Funkymoses’s picture

CSV.

alex_b’s picture

Status: Active » Postponed (maintainer needs more info)

Could you post a minimal CSV file that breaks the import?

saratt’s picture

I had the same problem too. I was trying to import a CSV file which had fields like "Post-award – Ongoing Grant Management" with an ndash (–) and it wouldn't import the entire field(it was just importing Post-award). I then changed the ndash to a normal dash and imported the file again and it worked. I was actually mapping this field to node title, if it makes any difference.

Thanks,
Sarat.

alex_b’s picture

Version: 6.x-1.0-alpha15 » 6.x-1.x-dev
Status: Postponed (maintainer needs more info) » Active
StatusFileSize
new45 bytes
new43 bytes

Ok. We should go after this. Just compiled two test files, one with ndash, other one with dash. Haven't used them yet. Can you confirm that ndash.txt breaks parsing and dash.txt does?

saratt’s picture

StatusFileSize
new11.7 KB

Yes, ndash.txt breaks and dash.txt does not break. The first node in the attached image is the ndash title and the second is the dash title
FYI: The url for the node created with ndash.txt has the full title (/post-award-ongoing-grant-management).

alex_b’s picture

Thank you for quick confirmation. I will take a closer look at this issue as soon as I can grab some time, should be before Monday.

alex_b’s picture

Ha, I actually can't confirm that.

PHP 5.2.6, MySQL 5.0.77 - what are your versions?

saratt’s picture

PHP 5.1.6
MySQL 5.0.45

saratt’s picture

Just now noticed that the character ' ’ ' is not being parsed as well.

alex_b’s picture

st455: I don't have the bandwidth to investigate this issue further, I hope that others can jump in here and report from their PHP versions.

- Specifically, can anybody else with PHP 5.1.x confirm that #5 breaks?
- st455 - can you upgrade to 5.2.x ?

saratt’s picture

alex: Thank you for your time and help. I will try to upgrade and see if it breaks.

alex_b’s picture

Status: Active » Postponed (maintainer needs more info)

Great - thanks.

David Goode’s picture

Status: Postponed (maintainer needs more info) » Closed (fixed)
hejeva’s picture

Status: Closed (fixed) » Needs work

Any suggestions on this issue. I have ran into also
I am seeing parsing not import any text after it encounters ’ and –

illeace’s picture

The problem might be the character encoding of your CSV file. See this related issue: http://drupal.org/node/988620

twistor’s picture

Issue summary: View changes
Status: Needs work » Closed (outdated)