Lack of support for extended punctuation ‘ ’
modulist - July 23, 2009 - 23:12
| Project: | Pathauto |
| Version: | 6.x-2.x-dev |
| Component: | I18n stuff |
| Category: | bug report |
| Priority: | normal |
| Assigned: | Unassigned |
| Status: | active |
Jump to:
Description
In the Punctuation Settings menu of Automated Alias settings, there is no way to keep curly quotes and other characters form appearing in the automated URLs.
The short term fix is to have pathauto ignore characters like [’] The long term fix would be have a field that lists characters to exclude from URLs. Thecontent of the text field could then be edited by an advanced user.
In any case, this is a problem for supporting proper typography in publishing environments, where quotes are frequently used within node titles.

#1
In the 6.x-2.x version you can use the transliteration module to handle advanced bits of text like this.
#2
Automatically closed -- issue fixed for 2 weeks with no activity.
#3
I'm using the 6.x-2.x-dev version, with transliteration enabled, and still encountering this problem.
#4
Can you describe your transliteration settings?
#5
I'm guessing you mean within Automated Alias Settings ...
I didn't change any of the punctuation changes after installing it. The checkbox "Transliterate prior to creating alias" is checked. None of the path settings were changed from their defaults, so the Node default pattern is still "content/[title-raw]".
If there is some other place to report on, let me know.
By the way, the site I encountered this on is in EN and ZH-HANS.
#6
I mean that you have to configure the transliteration module. What are your transliteration settings.
#7
I don't think there are any settings for the transliteration module. No option appears on any admin menu that I can find, and according to the module's readme file:
-- CONFIGURATION --
This module has no settings that can be customized.
By the way -- I encountered this punctuation issue on an English article.
#8
#9
I turned on "Reduce strings to letters and numbers from ASCII-96" to have it remove those characters from the URL but now I get URLs with "don-t" or "isn-t" because ’ is being replaced with a dash. Ideally those characters should be in the punctuation list. This module doesn't seem to account for any "fancy" punctuation characters. Just the standard ones.
#10
Looks like the transliterate feature is what is suppose to handle them but you need to add the punctuation in yourself. FYI I'm using 6.x-1.x version.