Issue with character encoding.

Gidgidonihah - May 13, 2008 - 18:54
Project:Aggregation
Version:5.x-4.3
Component:Code
Category:support request
Priority:normal
Assigned:Unassigned
Status:postponed (maintainer needs more info)
Description

Maybe I've missed this somewhere, but I'm having a problem with character encoding. The atom feed I'm pulling from is encoded in utf-8 and uses some characters that should probably be html entities.
For example this phrase has been used:

industry’s “Brightest and Most Talented 35-and-Under Artisans.”

which contains the characters, not the entities for:

’ “ and ”

When this item is aggregated and published, it appears as:

industry’s “Brightest and Most Talented 35-and-Under Artisans.”Â

#1

sinmao - May 14, 2008 - 16:09

I have the same problem with rss feeds. Would appreciate a fix.

#2

mistknight - September 19, 2008 - 12:14
Status:active» postponed (maintainer needs more info)

I'm a little surprised they're being processed because they're not valid XML. Providing a URL would be helpful.

#3

sinmao - September 19, 2008 - 17:16

I see it a lot from feeds from Google News or Aint It Cool News. I ended up using php to convert the characters from the titles in the node template back to html entities.

#4

carvalhar - February 17, 2009 - 17:58

can't delete?

 
 

Drupal is a registered trademark of Dries Buytaert.