Closed (fixed)
Project:
Link checker
Version:
7.x-1.x-dev
Component:
Code
Priority:
Normal
Category:
Bug report
Assigned:
Unassigned
Reporter:
Created:
6 May 2011 at 20:14 UTC
Updated:
7 Feb 2017 at 16:44 UTC
Jump to comment: Most recent
Broken links with space are not recognized, i.e. <a href="/one two">Link</a> or <a href="one two">Link</a>.
This applies to the dev-version as well.
Other than that, nice module.
Comments
Comment #1
pawel_r commented+1
Comment #2
hass commentedIs one of You guys able to fix the regexes in
_linkchecker_extract_links()and provide patch, please? This should be a D6 bug only.Comment #3
hass commentedFunction
valid_url()returns FALSE for links with spaces... http://www.example.com/foo%20bar/foo works. Not sure how we can fix this.Comment #4
hass commenteddrupal_encode_path() rawencode double points, ankers, question marks and others. url() does not work, too.
This may work, but I expect more than only spaces and side effects... damn hacks... I hate them. Current design may be wrong.
Line 1356 ff:
Comment #5
hass commentedFunny, drupal_http_request() can request URLs with spaces...
Comment #6
hass commentedComment #7
hass commentedDecided to go with the hack for now.
Comment #8
hass commentedComment #10
drupal_jon commentedHi,
Not sure whether to reopen this issue, or refer to the similar D6 issue #1525146: Space in filename triggers 404. But although links with spaces in them are added to the report now, the linkcheck itself seems to return an incorrect 404 if the link contains a space.
Comment #11
hass commentedComment #12
rob230 commentedStill experiencing the problem reported by drupal_jon: files with a space in them incorrectly appear in the link report as 404, even though the link works. You can click on one of the 404 links in the link report and it successfully loads the page.
Comment #13
rob230 commentedActually, this is a content problem. Drupal's valid_url() function correctly returns FALSE if the URL has a space in it. The URLs in the content should use %20 instead of a space, even though most browsers handle a space without issue.