Needs review
Project:
Tagging
Version:
6.x-2.5
Component:
Backend
Priority:
Normal
Category:
Feature request
Assigned:
Unassigned
Reporter:
Created:
4 Aug 2011 at 10:16 UTC
Updated:
18 Oct 2011 at 10:33 UTC
Jump to comment: Most recent file
Comments
Comment #1
ñull commentedThe stop words are merged from http://www.webconfs.com/stop-words.php and http://www.ranks.nl/resources/stopwords.html
From both sites I received an email in response to my request to allow it to be included in this module under GPL 2 license and they have given me permission.
Comment #2
eugenmayer commentedlooks good to me, thanks for the contribution. No reason to no include this one.
Comment #3
ñull commentedI added better multilingual support of stop words. In my case I have only one language active and as default. This led to all kind of errors. The changes work for me, but would need to be tested in multilingual set ups too (I had no time to set up a test environment for this). In this patch I also added stop words for Dutch, German, English and Spanish.
Comment #4
Thomas_Zahreddin commentedhi,
i think a list of stopword is a good idea.
there exists inside of de_stemmer a module stemmer_api with stopword support, maybe you want to give it a chance:
carve out the stemmer_api to a full module
Comment #5
eugenmayer commentedbe sure to create patches against https://github.com/EugenMayer/tagging, not d.o dev
The API Thomas suggests sound like a pretty good approach so we can include more of those. I case, i want to make it a submodule then. So we create a tagging_stopwords module, depending on those 2 above ( so we dont need to make tagging depending on those 2)