Posted by mikeytown2 on December 9, 2008 at 10:05am
Jump to:
| Project: | Views |
| Version: | 6.x-2.1 |
| Component: | exposed filters |
| Category: | support request |
| Priority: | normal |
| Assigned: | Unassigned |
| Status: | closed (fixed) |
Issue Summary
I'm fairly certain that Google will reindex the same page, trying every single value from a drop down list if you have one as an exposed filter. I'm suggesting that
# URL Variables
disallow: /*list
disallow: /*of
disallow: /*drop
disallow: /*down
disallow: /*url
disallow: /*variablesbe added to robots.txt, or at least a warning somewhere so web admins know how to handle the issue. Spiders might try other values in other url variables as well? Guessing on the robots.txt syntax, it's correct right?
Similar yet different issue
#280281: Views pager duplicates content
Comments
#1
or would you do something like this
Disallow: /path/of/view?Allow: /path/of/view
Allow: /path/of/view?page=*
Or this
Disallow: /*?Allow: /*?page=
using clean urls.
The option of having views insert a "noindex" meta tag might be the answer.
EDIT:
http://www.google.com/support/webmasters/bin/answer.py?answer=76329&hl=en
#2
Actually now that I think about it, I'm disabling anything that makes a url variable until we get robots.txt figured out. Duplicate content is a killer.
?page=is fine, everything else isn't.#3
https://www.google.com/webmasters/tools/dashboard
Once I get some feedback on this, I'll add it to the handbook page
This is what I came up with
# URL VariablesDisallow: /*?
Allow: /*?page=
Disallow: /*?page=*&*
#4
Here's the handbook page...
SEO (Search Engine Optimization) Guidelines When Using Views
#5
I have a question?
If i use
Disallow: /*?
Allow: /*?page=
Disallow: /*?page=*&*
http://www.mysite.com/?page=1 is ok
http://www.mysite.com/?page=2 is ok
but what about
http://www.mysite.com/?page=0
it is the same as http://www.mysite.com/ so it is kind of you are duplicating the frontpage.
Is there a way to eliminate this problem
#6
I updated the handbook page
http://drupal.org/node/345620
Another way is to not link to page=0; or 301 it.
#7
How is this a bug? Views has no control over the robots.txt. Looks like you guys have it sorted, anyhow.
#8
Automatically closed -- issue fixed for 2 weeks with no activity.