Enhancement request to search404 providing ability to set meta noindex tag to "noindex, nofollow"

earthandeconomy - July 25, 2007 - 16:16
Project:Search 404
Version:5.x-1.x-dev
Component:Code
Category:feature request
Priority:normal
Assigned:zyxware
Status:closed
Description

Currently, we use Google Webmaster Tools to manage URL removals and Google's policy for removal states:

To remove content from the Google index, do one of the following:

Ensure requests for the page return an HTTP status code of either 404 or 410.
Block the page using a robots.txt file.
Block the page using a meta noindex tag.

http://www.google.com/webmasters/sitemaps/

The issue with search 404 is that it does not return a 404 message nor does it contain a meta noindex tag set to "noindex, nofollow" and defining all the pages in the robots.txt for removal would be tedious.

Currently, a search for a page will redirect Google's bot to the search404 results and return a successful search results page (with no 404 error) with a meta noindex tag set to index, follow. The web page removal is then denied.

If the search404 page could substitute the existing meta noindex tag with:

Then Google's bot should obey the rules and mark the page for removal. This functionality would need to be tested.

If there is SEO benefit to indexing search404 pages then consider making the meta noindex tag editable so users can specify which tags they would like to set. Maybe it is possible to integrate search404 module with node words module (aka Meta tags).

http://drupal.org/project/nodewords

Hope this makes logical sense,

E&E

#1

earthandeconomy - July 25, 2007 - 16:20

Forgot to add code tags around meta tags. The following should read:

If the search404 page could substitute the existing meta noindex tag with:

<meta name="robots" content="noindex,nofollow" />

#2

forngren - August 5, 2007 - 11:58
Version:5.x-1.1» 5.x-1.x-dev

My apologies for the delay.

This has been bugging for a while, but I can't come up with any solution that doesn't increase the page load on every page (i.e. hook_init). I suppose you use the "jump"-feature, 'cause the search results page should generate a proper 404 header. One temporary solution could be to disable jumping until the pages are removed from the index.

The root to the problem is that 404 is not a valid redirecting code. Right now it passes a 302/Temporary redirect instead of the 404. http://api.drupal.org/api/function/drupal_goto/5

#3

wwwoliondorcom - December 27, 2007 - 20:24

Hello,

Google bot and other search engines boats still visit hundreds of pages that have been created by a translation script even though I disabled the script and deleted the translated pages on the server,

do you think it is because of search 404 that the bots still visits the translated pages, causing high load on the server ?

I do not use the jump feature.

Thanks.

#4

wwwoliondorcom - January 6, 2008 - 06:42

Any help ? I still have the same problem. Thanks.

#5

zyxware - April 1, 2008 - 19:07
Assigned to:Anonymous» zyxware
Status:active» fixed

This feature seems to be working fine and search404 is returning a 404 error it is going to be assumed that this problem has been fixed. Also please upgrade to the latest version of the module. Regarding the meta noindex nofollow - since the 404 error code is returned there is no need for meta tags.

#6

Anonymous (not verified) - April 15, 2008 - 19:11
Status:fixed» closed

Automatically closed -- issue fixed for two weeks with no activity.

 
 

Drupal is a registered trademark of Dries Buytaert.