Currently keyword filter doesn't support UTF-8 'cause by default internal PHP functions are ASCII-only. Here's a small patch to fix this issue - tested by mine on russian feeds...

--- keyword_filter.inc.orig     2011-02-01 05:33:13.000000000 +0300
+++ keyword_filter.inc  2011-07-06 15:55:15.000000000 +0400
@@ -41,14 +41,14 @@
   $settings['regex'] = FALSE;

   if (!$settings['word_boundaries'] && $settings['case_sensitive']) {
-    $settings['func'] = 'strpos';
+    $settings['func'] = 'mb_strpos';
   }
   else if (!$settings['word_boundaries'] && !$settings['case_sensitive']) {
-    $settings['func'] = 'stripos';
+    $settings['func'] = 'mb_stripos';
   }
   else {
     foreach ($settings['word_list'] as &$word) {
-      $word = '/\b' . $word . '\b/';
+      $word = '/\b' . $word . '\b/u';

       if (!$settings['case_sensitive']) {
         $word .= 'i';

Comments

twistor’s picture

Assigned: Unassigned » twistor
Category: bug » feature
Status: Active » Fixed

Status: Fixed » Closed (fixed)

Automatically closed -- issue fixed for 2 weeks with no activity.