Closed (fixed)
Project:
FeedAPI
Version:
6.x-1.x-dev
Component:
Code feedapi (core module)
Priority:
Normal
Category:
Bug report
Assigned:
Unassigned
Reporter:
Created:
29 Dec 2008 at 08:15 UTC
Updated:
13 Feb 2011 at 18:10 UTC
Jump to comment: Most recent file
Comments
Comment #1
aron novakThis bug is a nightmare, i occasionally kill it and it appear again and again :)
I'm sure you use parser common syndication, don't you?
Please provide me some example feed URLs.
Comment #2
wmostrey commentedThis one has the "Don't want your stinking hook" post: http://arancaytar.ermarian.net/news/technology/web/drupal/feed
This one has the q&a tag in the "The Deal With Nodes" post: http://feeds.learnbythedrop.com/learnbythedrop?format=xml
I'm using the SimplePie parser and no other modules that might do an html replace.
Comment #3
wmostrey commentedComment #4
aron novakI could not reproduce it with the arancaytar.... feed and simplepie. Which version of simplepie.inc do you use? Please double check that you really use the 1.5 version.
Comment #5
wmostrey commentedI'm using FileAPI 6.x-1.5 and simplepie.inc 1.1.1.
Comment #6
likewhoa commentedtry using 1.1.3 or development version of simplepie as it works for me.
Comment #7
wmostrey commentedOK I'm using 1.1.3. I'm marking this as fixed for now. I'll reopen when this behavior pops up again. Thanks!
Comment #8
wmostrey commentedUnfortunately the problem still happens.
Post title:
"Drupal For Education And E-Learning" Book ReviewFeed: http://www.civicactions.com/taxonomy/term/55
Node: http://www.drupaldigest.net/all
Node title:
"Drupal For Education And E-Learning" Book ReviewComment #9
wmostrey commentedComment #10
gsnedders commentedIs FeedAPI assuming the output of SP be text/plain for the title? It should be expected text/html. As far as I can tell, SP is doing what it should.
Comment #11
aron novakWell, i could not reproduce the problem at http://civicactions.com/taxonomy/term/55/0/feed using -dev.
Comment #12
likewhoa commentedtry this feed http://thinkmoult.com/?feed=rss2 that one and other feeds which bring in quotes in node titles fail, this on drupal-6.11 and -dev modules. My issue is not that those quotes are converted to their html entities but that links are broken because they contain quotes which normally should fail. I think this could be an issue with pathauto not converting or removing those characters for you.
Comment #13
dwightaspinwall commented@wmostrey: I have had the exact same problem. I'm using feedapi 6.x-1.6 and simplepie.inc 1.1.3. After several hours digging through the code I finally gave up and implemented a kludge to fix titles on their way into nodes. I put the following snippet in a hook_nodeapi function:
And the title fixing function:
Of course it would be far preferable to fix the code.
Comment #14
alex_b commentedThe feeds contain entity encoded characters, when they're rendered in Drupal and thus run through check_plain() to avoid XSS attacks, they're double encoded. The solution is to run html_entity_decode on feed items before storing them.
This should have been addressed with the introduction of _feedapi_process_text(). Apparently some items are still not being properly decoded. I'd love to see more digging by those affected on
- where exactly there are still HTML encoded characters stored to the database
- with which feed
- and what parser
Please only report on 6.x dev issues.
Comment #15
apennington commentedI'm running into a similar problem.
I've setup news feeds from a UAE newspaper called The National (http://www.thenational.ae/section/rsslist). Many of the RSS feeds look fine on their servers. The feed source has this encoding at the top: <?xml version="1.0" encoding="ISO-8859-1"?>. Our our Drupal database character encoding is UTF-8.
Here is one example. The feed shows two special characters (‘ and ’).
http://www.thenational.ae/article/20090515/NATIONAL/705149825/1010/rss More than 80 per cent of unemployed Emiratis are young people between 15 and 24, according to a report. Fri, 15 May 2009 16:57:00 +0400Job hunt ‘is toughest for the young’
When it is parsed by FeedAPI (via ) the title comes over as
Job hunt is toughest for the young
I'm using FeedAPI as-is with no additional add-ons. The parser the feeds use is Common syndication parser.
Attached is a small screen capture to display what I see onscreen. Hope this information helps you find a solution to the problem. Thanks!
Comment #16
robertdjung commentedsubscribe.
Comment #17
TimG1 commentedI think I have a related problem.
I'm trying to display
<img>tags that are in in a text field of a node created by feedapi and feed mapper. I'm trying to display them in a View and< >are being displayed instead of <>. On the page /admin/settings/feedapi I have the "Allow all HTML tags" checkbox checked.I'm using...
Drupal 6.13
FeedAPI 6.x-1.7-beta3
Feed Element Mapper 6.x-1.0-beta12
SimplePie 1.2 from (www.simplepie.org)
Am I overlooking something obvious?
Thanks!
-Tim
Comment #18
ben610 commentedHaving a similar issue with FeedAPI turning apostrophe's into HTML special chars in the node title.
Using:
Drupal 6.13
FeedAPI 6.x-1.8
Common syndication parser 6.x-1.7
FeedAPI Node 6.x-1.7
Some examples:
http://eyebeam.org/reblog/09-08-13/how-big-is-it-how-big-you-want-it-awe...
http://eyebeam.org/reblog/09-08-13/bbc-news-europe-dutchman-builds-moder...
Help?
Thanks.
Comment #19
TimG1 commentedI solved my problem by placing html_entity_decode() around all the variables in my node/view template files when displaying them. This is on the site that has the mapped content.
Ben 610, try doing html_entity_decode($title) in your node.tpl.php
-Tim
Comment #20
aron novakhttp://newsrss.bbc.co.uk/rss/newsonline_world_edition/europe/rss.xml - I tried that feed URL w/ common syndication parser and the apostrophes just appear normal. Is this the feed url what you use?
Comment #21
alex_b commentedRelated: #61456: Aggregator titles display quotes and other characters with HTML entity equivalents badly (write tests)
Comment #22
bcobin commentedThank you - thank you, TimG. #19 solved the problem I was having with imported tags brilliantly.
Comment #23
wmostrey commentedFixed with the comment in #19.
Comment #25
iceous commentedI have a similar but different character parse errors.
char '=' and '&' are rendered into '%3D' and '%26'.
I have no clue where to begin...
please help....
Thanks,
Joko