hello
is it possible to index automatically also parts of words?
if i search for "fahrer", it also finds "fahren" - this is nice, but it doesn't find "radfahrer" or "radfahren" ...
thanks
momper
hello
is it possible to index automatically also parts of words?
if i search for "fahrer", it also finds "fahren" - this is nice, but it doesn't find "radfahrer" or "radfahren" ...
thanks
momper
Comments
Comment #1
schildi commentedfinding parts of words is a different job.
The current implementation is based on "stemming" what means building the "Wortstamm". The stem of "radfahren" is not "fahr" (as far as I know. I'm not a linguist). The stemmer is derived from a so called "ported stemmer algorythm" which looks only at the endings of words and modifies them by simple rules. If you want to split up words into components like "Rad" + "fahren" you probably have to use a dictionary. That means setting up a database containing tens of thousands of words plus some logic. Too much for a simple porter stemmer.
But if you have a limited number of words to handle then you might have a look at the source. It contains a list of "exceptions". There you could set the word "radfahren" to be reduced to "fahr". May be that helps a bit. And don't forget that after modifying the source you have to re-index your site.
Regards
Comment #2
momper commentedthanks a lot ...
Comment #3
schildi commented