Issue with character encoding.
Gidgidonihah - May 13, 2008 - 18:54
| Project: | Aggregation |
| Version: | 5.x-4.3 |
| Component: | Code |
| Category: | support request |
| Priority: | normal |
| Assigned: | Unassigned |
| Status: | postponed (maintainer needs more info) |
Jump to:
Description
Maybe I've missed this somewhere, but I'm having a problem with character encoding. The atom feed I'm pulling from is encoded in utf-8 and uses some characters that should probably be html entities.
For example this phrase has been used:
industry’s “Brightest and Most Talented 35-and-Under Artisans.”
which contains the characters, not the entities for:
’ “ and ”When this item is aggregated and published, it appears as:
industryâs âBrightest and Most Talented 35-and-Under Artisans.âÂ

#1
I have the same problem with rss feeds. Would appreciate a fix.
#2
I'm a little surprised they're being processed because they're not valid XML. Providing a URL would be helpful.
#3
I see it a lot from feeds from Google News or Aint It Cool News. I ended up using php to convert the characters from the titles in the node template back to html entities.
#4
can't delete?