How to change http://ipkfks.ru/content/%D0%BE-%D0%BD%D0%B0%D1%81 to http://ipkfks.ru/content/о-нас ?
my sitemap http://ipkfks.ru/sitemap.xml
------------
Drupal 7.14
Pathauto 1.1

Comments

Anonymous’s picture

Component: Translations » Other
Category: bug » support
Status: Active » Postponed (maintainer needs more info)

The links we store are given us by the Drupal core module. Do you have any i18n modules installed and enabled? This isn't a Translation component, that has to do with translating the strings of this module, and not the content or links on your site.

keha3912’s picture

I installed and enabled: i18n, variable, XML sitemap internationalization
applied /admin/config/search/xmlsitemap/rebuild
clean cache admin/config/development/performance
update cached files /admin/config/search/xmlsitemap
result: page not found http://ipkfks.ru/sitemap.xml

keha3912’s picture

I changed "english" to "russian" in admin/config/search/xml sitemap - http://ipkfks.ru/sitemap.xml return
but the Cyrillic Letters in the URL is not

Anonymous’s picture

Assigned: keha3912 » Unassigned

The links work correctly. I think the problem for you is a presentational one for the xsl. I'll try to do some research later this week to see if I can find some related material.

keha3912’s picture

any news?

Anonymous’s picture

Sorry, no, I didn't find any time for it. Other projects kept me busy. It shouldn't bother the search engines, it is just the user representation that is bonkers. Ping me again after about 72 hours if I haven't responded.

Anonymous’s picture

Ok, let's try the following. In the xmlsitemap module directory is a file named xsl/xmlsitemap.xsl. In the file is a line 30 that reads

  <xsl:output method="html" version="1.0" encoding="utf-8" indent="yes"/>

change the encoding="utf-8" to read encoding="unicode".

Will it help? I don't know but I do know the Russian character set is specified in UNICODE. I wouldn't change any other utf-8 value though, all of Drupal is UTF-8.

Dave Reid’s picture

Status: Postponed (maintainer needs more info) » Closed (works as designed)

You cannot have "nice" looking URLS in sitemaps. They *must* be URL-encoded for special characters. This works as designed.

Anonymous’s picture

Status: Closed (works as designed) » Postponed (maintainer needs more info)

Yes, Dave, as far as the sitemap.xml file goes, it is correct. It is the presentation of the URL to the user via the xsl that needs adjusted. Let's leave it open until keha3912 has a chance to give us feed back on my suggested change. My suggested change doesn't change the output of sitemap.xml as the search engine sees it; it only changes the presentation in the browser.

keha3912’s picture

change the encoding="utf-8" to read encoding="unicode" not helped :(

PS Dave Reid - I know, it's work's browser, but nothing changed ("воз и ныне там" - russian proverb)

Anonymous’s picture

Title: Cyrillic Letters in URL » URL decode the URI display string via xmlsitemap.xsl
Category: support » feature
Status: Postponed (maintainer needs more info) » Postponed

Hmm... Yea, that won't help. We could extend our xml stylesheet with the str:decode-uri documented at http://www.exslt.org/str/index.html and decode the display string. I'm modifying this to a feature request and changing the title to match.

keha3912’s picture

str:decode-uri? - it's awesome idea! I'm subscribe for this feature.

adams.garfield’s picture

yea this is a great idea . Thanks a lot

m.schwarzenberg’s picture

I don't know if the issue is fixed now with solution #12. If not - can I do it by myself? - or when it will be implemented?

Anonymous’s picture

Re: #14

You're welcome to provide a patch for it.