Weird start, but seen as valid by W3C!
AlexisWilke - January 18, 2009 - 19:42
| Project: | Aggregation |
| Version: | 6.x-1.5 |
| Component: | Miscellaneous |
| Category: | bug report |
| Priority: | minor |
| Assigned: | Unassigned |
| Status: | active |
Description
Hi guys,
I checked out this feed:
http://www.articlesnatch.com/rss2/computers-and-technology
trying to get it in my http://linux.m2osw.com site. But the aggregator fails.
So I looked at the content and it starts like this:
407c
<?xml version="1.0" encoding="utf-8"?>
...The encoding is a lie since they use ISO-8859-1 in there. But you fail on the 407c (which, if you ask me, is normal!). What I was very surprised about is the fact that http://validator.w3.org/feed/ says it is valid except for the few "bad characters".
So in other words it sounds like you should skip the 407c and accept the file as legal XML anyway.
What do you think?
Thank you.
Alexis Wilke
