My site is growing slowly: www.forniture-alberghiere.net but its xmlsitemap is growing more of the real number of pages I suppose till to generate more of 5000 urls that are not real, they are here attached.
What can I do? When I was using xmlsitemap 6.x-1 I was with PR4 now with 6.x-2 after some weeks I'm gone to PR3, maybe better that I downgrade to 1?

[Edited by KiamLaLuno]

Comments

avpaderno’s picture

Title: xmlsitemap is generating false urls? » Could xmlsitemap generate false URLs?
dave reid’s picture

Well it looks like you're using the 6.x-2.x version now, and I don't see any of these bad links in your site's current sitemap.xml (http://www.forniture-alberghiere.net/sitemap.xml). So my guess would be either these links were in the 6.x-1.x version sitemap, or you have bad internal links leading to 404s on your own site (most likely).

matteoraggi’s picture

The strange thing is that these links are growing up week after week..

avpaderno’s picture

I checked some of the links reported in the sitemap, but all the links take to existing pages. I am not sure what you mean by false URLs, as the module cannot invent URLs that don't exist in your site.

Froggie-2’s picture

@ matteoraggi
Are you using pathauto module? I do not think that xml sitemap (6.x-2.0) version is responsible for generating false urls (urls that lead to "Page Not Found" Errors).
On my site I get thousands of false urls being generated after I started using pathauto module.
Hope this info is of some help to you.

matteoraggi’s picture

Yes, I'm using pathauto, I'm gone to check on issues of that module, i founded nothing about it, so i added a new issue: http://drupal.org/node/622916

avpaderno’s picture

Status: Active » Fixed

In general, XML sitemap uses the path aliases set from other modules; it doesn't create new path aliases. That is true for both the Drupal 6 branches.

Thanks to Froggie for making clear what a false URL should be (at least it was not clear to me).

As the question has gotten an answer, I am marking this support request as fixed.

As a side note, if in the site there are more than one language enabled, it is probably that pathauto.module has problems on creating path aliases when there is already an alias for another language. At least that was one of the problems with the module, which was really caused by the Drupal core module that handles the path aliases.

matteoraggi’s picture

99% of my wrong url are done in this way:
http://www.forniture-alberghiere.net/ar/taxonomy/term/1309/0
where just AR (language letters) and numbers change and al the rest is always the same text
and here: admin/build/domain/taxonomy I founded this command
taxonomy/term/%t
into the Term link patterns
but
1) I don't know what to do in this case
2) I don't understand if it is about domain access module or about taxonomy domain module

matteoraggi’s picture

This was a past error, just patched from some other module, now sitemap.xml is ok, and I have patched it disallowing these urls on robots.txt . We can consider this issue closed.

Status: Fixed » Closed (fixed)

Automatically closed -- issue fixed for 2 weeks with no activity.