Redirect url with a bad/spammed query string?

jpfle - February 28, 2009 - 19:22
Project:Global Redirect
Version:6.x-1.x-dev
Component:Code
Category:feature request
Priority:normal
Assigned:Unassigned
Status:postponed
Description

Hi.

I would like to know if it would be possible to redirect an url that contains a bad GET page variable. For example, say we have a node:

mysite.com/a-test

If this page doesn't have pagination and we add a GET page variable:

mysite.com/a-test?page=4

There's no redirection 404 to mysite.com/a-test

Is it possible to know if a page has a pagination, and if not or if the GET page value isn't valid, doing a 404 redirection?

Thanks.

#1

nicholasThompson - March 13, 2009 - 10:00
Title:Redirect url with a bad GET page variable?» Redirect url with a bad/spammed query string?
Version:6.x-1.2» 6.x-1.x-dev
Category:support request» feature request
Status:active» postponed

Technically this is possible.

The problem is "how do you define a bad entry in the query string". Maybe a module on the page requires it? It could be anything...

#2

giorgio79 - April 15, 2009 - 04:01

Good idea!

I just noticed in Google Webmasters one of my simple node pages was reported for duplicate title tage, when I checked it was like this

mysite.com/mynode
mysite.com/mynode?page=1
mysite.com/mynode?page=2
mysite.com/mynode?page=1205

Weird. No clue how Google picked that up as those page variables do not exist, it is just simply "mynode"

#3

kiamlaluno - April 15, 2009 - 04:06

Google Webmaster Tools will always report those pages like duplicated, whenever the passed query string is used or not.

The only solution to that problem is to add a meta tag to those pages.

#4

giorgio79 - April 15, 2009 - 05:34

Thanks Kiam, I think I understand but the problem is that those ?page=xxx dont exist!

I have no idea how Google picked those up, as they all show the same page!
mysite.com/mynode?page=1
mysite.com/mynode?page=2
mysite.com/mynode?page=1205

is the equivalent of mysite.com/mynode

This is not a views page with paging, it is a simple node page :)

#5

kiamlaluno - April 15, 2009 - 09:37

That is really oddy. Google should pick up links used in Drupal nodes, not attach random strings to the URLs.

 
 

Drupal is a registered trademark of Dries Buytaert.