Closed (duplicate)
Project:
Drupal core
Version:
7.x-dev
Component:
base system
Priority:
Normal
Category:
Feature request
Assigned:
Unassigned
Reporter:
Created:
16 Dec 2008 at 01:32 UTC
Updated:
17 Jul 2012 at 04:35 UTC
Find attached patches against latest D6 and D7 checkouts and give your opinion ;)
| Comment | File | Size | Author |
|---|---|---|---|
| robots-txt-wildcard-paths-for-multilanguage-d6.patch | 852 bytes | eMPee584 | |
| robots-txt-wildcard-paths-for-multilanguage-d7.patch | 860 bytes | eMPee584 |
Comments
Comment #1
catchMarking as duplicate of
#180379: Fix path matching in robots.txt which also tries to reduce duplicate content.
Comment #2
eMPee584 commentedWell but that one was outdated and didn't apply while so i thought maybe if i open a clean issue and attach a clean patch that actually can be applied now we could go from there? The other issue is mixing several things which also were still in discussion, while this patch just duplicates the existing exclusion paths to apply for multi-language sites.. that would save a lot of sites a lot of unnecessary server load, for free, now!
Comment #3
owen barton commentedThis is invalid syntax - from http://www.robotstxt.org/robotstxt.html:
Unless we are going to add every possible language, or make robots.txt autogenerated by Drupal I don't think there is any technical solution to this. I guess the best approach is to update the documentation to explain to people how to add these themselves.
Comment #4
owen barton commentedOK, reading the "fixing" issue I guess "*" is pretty commonly accepted.
Comment #5
sun.core commentedMarking as duplicate of #180379: Fix path matching in robots.txt. You can follow up on that issue to track its status instead. If any information from this issue is missing in the other issue, please make sure you provide it over there.
However, thanks for taking the time to report this issue.
Comment #6
eMPee584 commentedOk, but imho this easily could and should have been committed to the D6 branch long ago, to fix hammering of all the i18n sites in the wild...anyways, at least not a problem anymore for my site.
Comment #7
j0rd commentedSame problem. There appears to be very little discussion about the limitation of Drupal's default robots.txt when it comes to multi-language sites.
For others who have this problem and find this issue on Google, here's the best Drupal 6 robots.txt I've found.
#1317338: Improvements to the core file robots.txt in Drupal
https://wiki.koumbit.net/DrupalRobots