Missing character in glossary menu [#327699]

Comment	File	Size	Author
	Picture 1.png	36.69 KB	snowball43

Comment #1

merlinofchaos commented 29 October 2008 at 17:37

I believe that the collation on the database makes the A and Å appear to be the same character to SQL, even though they are not the same character to PHP. I'm not actually sure how to deal with issues like this.

Log in or register to post comments

Comment #2

snowball43 commented 29 October 2008 at 18:01

So the generating of the glossary mode is done via SQL?

Log in or register to post comments

Comment #3

merlinofchaos commented 29 October 2008 at 18:12

Yes, it uses a SQL SUBSTR command on the field, which I assume is the node.title field in this case.

Log in or register to post comments

Comment #4

karens commented 29 October 2008 at 18:34

This is just a shot in the dark, totally untested, but I ran into an issue like this on translated dates and it was the LENGTH that was the problem, not the SUBSTR. I took a quick look at the MYSQL manual and it looks like SUBSTR is multi-byte safe but LENGTH is not. You may be able to swap CHAR_LENGTH instead into the SQL to see if that fixes it.

Log in or register to post comments

Comment #5

snowball43 commented 29 October 2008 at 23:02

The problem appears that the GROUP BY doesn't recognize the difference in the characters. So if we can group by something that is more specific such as ORD.

I've tested this a little by copying the query presented in the preview section of the edit view screen and tweaking it and running the query from Navicat. I added a field to the SELECT using ORD on the field to group by and changing the GROUP BY value to the name specified for the ORD field.

Note: I'm using MySQL, and I'm not sure if ORD is a globally acceptable SQL function.

Log in or register to post comments

Comment #6

wojtha commented 9 November 2009 at 16:47

I had same issue with Czech letters (ĚŠČŘŽÝÁÍÉŮ etc). There is a solution: you need to change table column comparison function in your database.

In MySQL & Drupal are comparison function of text columns set to "utf8_general_ci" by default. When I changed it to our language specific function "utf8_czech_ci", all (present) variants of letters appeared. Binary comparison function - utf8_bin - is maybe universal solution for that, but you loose language specific sorting.

Log in or register to post comments

Comment #7

esmerel commented 19 July 2010 at 22:48

Component:

Miscellaneous

» Translations

This seems like something that should go into documentation, or request some help from the internationalization teams to see if anyone there has a good idea for dealing with this.

Log in or register to post comments

Comment #8

esmerel commented 28 August 2010 at 06:29

Issue tags:

+special characters

Adding tag

Log in or register to post comments

Comment #9

bojanz commented 30 December 2010 at 16:01

utf8_general_ci sucks for many languages.
Cyrilic or arabian are pretty broken, for example. In those cases, utf8_unicode_ci is used.
utf8_unicode_ci supports everything, but is slower.

Log in or register to post comments

Comment #10

iamjon commented 8 February 2011 at 20:56

Status:

Active

» Closed (works as designed)

Closing from a lack of activity.Please feel free to reopen.

Log in or register to post comments

Missing character in glossary menu

Comments

Comment #1

Comment #2

Comment #3

Comment #4

Comment #5

Comment #6

Comment #7

Comment #8

Comment #9

Comment #10

News items

Our community

Documentation

Drupal code base

Governance of community