I've noticed that basically all nodes that allow comments have 2 urls:

site.com/node/45 and
site.com/comment/reply/45

that show identical content and both get indexed in search engines (which is no good).
To me this doesn't make sense. Any idea if this has a certain reason?

Comments

johnhanley’s picture

The first URL points to the node and any associated comments. The second is the comment form itself. They're related by the node number, but serve two different functions.

Sorry if this a simplistic explanation, but it's not much more complicated than that.

kkobashi’s picture

What you can do to avoid duplicate content penalties in this case is add the following in your robots.txt file:

user-agent: *
Disallow: /comment/

If you find more, let me know. I'm very interested on this to see what you come up with.

Kerry Kobashi

pixelpope’s picture

Hey Kerry, seems like you understand my point.
Disallowing the comment links to be indexed through robots.txt is a good idea and quick to do.
Still, I'll see if there's a way to completely remove/change that link.

It's even more absurd, that, if you have 1 or more comments, the url of "add comment" changes to the real url of that certain page. So why should it be any different before?

bradlis7’s picture

A fix for this would be to add "rel='nofollow'" to the link for "add comment". e.g.:

<a href="/comment/reply/45" rel="nofollow">add comment</a>

I submitted a bug: http://drupal.org/node/48539

--
Visit My Site: Bradlis7.com