Global usage statistics of drupal.org - Which languages do we speak?

s.Daniel - May 23, 2008 - 15:19
Project:Drupal.org webmasters
Component:Other
Category:feature request
Priority:normal
Assigned:Unassigned
Status:closed
Description

Do you have information about how many % of the drupal.org visitors come from a none English speaking country?
This information would be interesting for a couple of discussions out there at the moment:

  • D.o redesign: Should there be a place to collect Links to language specific sites somewhere else besides /support - how about a single page on de/fr/ru/etc.drupal.org that links to the language server, links to local communities and blogs, a list of translated modules etc. ?
  • One of drupal improvements is i18n - it would be interesting how the user curve looks from people other than native English speakers.
  • There is discussion about drupals terminology (here for example http://www.angrydonuts.com/drupal-terminology-and-users) - should we think about drupal users that don't speak english there?

Greggles has posted information about how many translations have been downloaded from d.o
http://growingventuresolutions.com/blog/drupal-download-statistics-janua...
While this gives a firs impression about the global usage of drupal I doubt that these numbers get us too far.

  • Translations are being provided on other sites than d.o contributed modules not (afaik).
  • Modules are updated far more often than translations.
  • I personally develop all my German sites in English until I present them to the customer.
  • All my test sites are in English and I'm not the only one who administers his site in English language due to more
  • documentation ( http://drupal.org/node/222401 ).

Here is what google tells us:

  • Most popular language where people search for drupal is Czech, English is 4th!?
    http://www.google.com/trends?q=drupal
  • Google results for "drupal" total "21.900.000" in Czech "278.000" German "310.000" English "749.000"

To get good stats about "who uses drupal" we would need to check at several places.

  • IP > Country on d.o on several places. One test purely for the downloads.
  • Where do sites that pingpack come from?
  • Anyone else a good idea?

I don't want to discuss here weather we need de.drupal.org but I think it should be considered by the people working on the drupal.org redesign and it can only be considered after we know if anyone would use it. Knowing your users is one of the most important things in webdesign so I think we should know too.

Would be nice if we could collect some more Information here.

#1

Gerhard Killesreiter - May 23, 2008 - 19:52

Here's some data from drupal.org from March this year.

AttachmentSize
awstats.drupal.org_.alldomains.xml_.tar_.gz 44.41 KB

#2

s.Daniel - May 24, 2008 - 18:11

Thanks, so when I count visitors from United States, Great Britain, Canada, Guam (USA), Virgin Islands (USA), Virgin Islands (British) as native English speakers (I know that’s not 100% correct but I wouldn't know how to calculate better and I think it does the job) and the rest as none native English speakers I get following results.

Pages / Hits
English 7801683 / 32288314
None native English speakers 8646965 / 37428032
total 16448648 / 69716346
That would be in % 52,6 / 53,7 None native English speakers

The numbers are without accesses that can't be mapped to a country (other 2141506 / 9474960).
When I assign those to "none native English speakers" the result look like this:

English 7801683 / 32288314
None native English speakers 10788471 / 46902992
total 18590154 / 79191306
That would be in % 58 / 59,2 None native English speakers

Couldn't calculate the Bandwidth used as the some numbers are in GB and others in MB / KB

Maybe I forgot some important English speaking countries?
Could someone please check if my calculations are correct? If the numbers are correct then I am surprised by the high amount of none native English speakers and would think that maybe those need to be taken into consideration of the redesign and marketing efforts more.

By the way - is "None native English speakers" the correct term? ;)

#3

kbahey - May 25, 2008 - 03:47

Are you sure this is correct?

What about Australia and New Zealand? They are English speaking countries.

For Canada, it is a country of some 33 million people, but more than 7 million are French speakers. So adjust for this.

#4

s.Daniel - May 25, 2008 - 16:44

No I am quite sure that this is wrong in a couple of ways. It's obvious for example that not everyone in the US or Canada has English as a native language and yes to my shame I forgot Australia and New Zealand. Also millions of people in the US wouldn't call English their first language. 14.000 Germans moved to the US in 2007 for example. So please add some more suggestions how I could calculate better then I'll be happy to check once more but I have the feeling that this won't change much on the big picture.

If I haven’t over sighted a major mistake only about 50 % of the drupal.org Users are native English speakers and the other 50 % are not taken into consideration allot at the moment.

/Edit. With Australia and New Zealand as aditional English speaking countries we the above Results change to 50,07% / 51,44% (55,82% / 57,25% with others counted as non native English speakers).

I have attached my Excel sheet if anyone wants to take a look a it.

AttachmentSize
d.o-language stats.zip 123.75 KB

#5

killes@www.drop.org - May 25, 2008 - 19:05

I am not sure what you are trying to tell us. THe non-native-English-speakers sure wouldn't visit d.o i they couldn't read English to some degree.

#6

s.Daniel - May 26, 2008 - 06:58

Tell, like convince? Nothing at all here! As I said I am not looking for further conclusions, in this thread I simply want to ask for the data and make the stats visible, then at other places in discussions about Drupal marketing and the usage of d.o this information can be used as reference.

#7

kbahey - May 26, 2008 - 15:20

14,000 Germans in 300 millions is statistically insignificant.

Here is another thing: crawlers vs. humans.

If you can segment the traffic, using the User Agent in Apache's log, then you can tell which visitors are crawlers and which are humans (roughly). This can tell you if the non-English areas are overrepresented in crawlers or on-par with the English areas.

Here is another thing that we can do, but means a patch for Drupal. Drupal can log the language/locale coming from the browsers (e.g. en-US, en-CA, fr-CA, de-DE), and that would give a more accurate breakdown of language vs. country, and even who has more than one language/locale enabled (e.g. chinese and english, vs. chinese only). That would be a more accurate estimate, coupled with the IP address (for location) and User Agent (human vs. crawler). It will have to be in the future too, can't use existing data.

#8

killes@www.drop.org - May 26, 2008 - 16:37

The data I provided did not include crawlers. Or at least awstats said so.

#9

s.Daniel - May 28, 2008 - 07:13

Dries has just posted some Data about d.o user language at his blog:
http://buytaert.net/drupalorg-country-demographics

Based on Dries numbers here we have around 45,16% native English speakers and 50,20% with English as a second language (The rest speaks Paschtu or Dari ;).

#10

Gerhard Killesreiter - September 27, 2008 - 21:26
Status:active» fixed

Dries has provided info, fixed.

#11

Anonymous (not verified) - October 11, 2008 - 21:31
Status:fixed» closed

Automatically closed -- issue fixed for two weeks with no activity.

 
 

Drupal is a registered trademark of Dries Buytaert.