I have been looking at the modules code and what I noticed is that the search is always done within the body, title etc fields and not the language specific fields (body_LANGUAGE, title_LANGUAGE etc) which results into problematic searches with non-english languages.

For example, with Greek language, body_el is using the text_el field type which I've manually set to use the GreekStemFilterFactory in my schema.xml. Unfortunately though, the change I did plays no role at all because the field being searched is the body field which uses the text field type (which has the English Stemmer) and as a result, instead of being able to have a non accented-sensitive search with my Greek documents, i must use accents because the Greek Stemmer was never used.

I am not sure if am i am missing something here but if what i say is true, then what is the point of language specific fields if they are not being used in search? I am looking at apachesolr_multilingual_apachesolr_modify_query and I don't see anything being done in there to use the language specific fields for searching, only for highlighting comparisons and spell checking.

Comments

mkalkbrenner’s picture

Please don't open so many issues. You're basic issue is handled at #1288144: CCK Multilanguage values which still waits for your answer.

The language specific fields get queried by the apachesolr module itself according to your configuration at /admin/settings/apachesolr/query-fields

So we have to solve the issue why the language specific fields don't occur at this settings page, nothing else ...

arekanderu’s picture

My apologies first of all, i didn't mean to over do it but i am not sure if the #1288144: CCK Multilanguage values is related to what I am saying because If i look at the query log, i see the body field being queried instead of body_el. Shouldn't this field be queried even If i am having the CCK fields issue or do you think they are related?

Example query i see:

INFO: [] webapp=/solr path=/select params={spellcheck=true&facet=true&facet.mincount=1&spellcheck.q=ξενοδοχειο&qf=ts_cck_field_feature_spa^0.1
&qf=tags_h2_h3^3.0&qf=ts_cck_field_accessibility^0.1&qf=ts_cck_field_restauranttype^0.1&qf=taxonomy_names^2.0
&qf=ts_vid_3_names^0.3&qf=tags_a_en^0.5&qf=tags_a_el^0.5&qf=ts_cck_field_feature_pool^0.1&qf=tm_cck_field_music^0.1
&qf=tags_inline^1.0&qf=ts_cck_field_feature_wifi^0.1&qf=tm_cck_field_meals^0.1&qf=taxonomy_names_el^2.0
&qf=tags_h2_h3_el^3.0&qf=tags_h2_h3_en^3.0&qf=body_el^1.0&qf=title_en^5.0&qf=body_en^1.0&qf=ts_cck_field_alcohol^0.1
&qf=title_el^5.0&qf=tags_inline_el^1.0&qf=ts_cck_field_smoking^0.1&qf=tags_inline_en^1.0&qf=name^3.0
&qf=ts_cck_field_feature_gym^0.1&qf=ts_cck_field_uprofile_gender^1.0&qf=ts_cck_field_uprofile_ethnicity^1.0
&qf=ts_cck_field_gallery_image^1.0&qf=ts_cck_field_uprofile_children^1.0&qf=ts_cck_field_uprofile_facebook^1.0
&qf=ts_cck_field_uprofile_relationship^1.0&qf=ts_cck_field_uprofile_twitter^1.0&qf=ts_cck_field_uprofile_linkedin^1.0
&qf=ts_cck_field_node_offer^1.0&qf=ts_cck_field_node_event^1.0&qf=ts_cck_field_uprofile_friendfeed^1.0
&qf=ts_cck_field_my_teaser^1.0&qf=ts_cck_field_uprofile_occupation^1.0&qf=ts_cck_field_add_reviews^1.0
&qf=ts_cck_field_ddblock_if_image^1.0&qf=ts_cck_field_uprofile_name^1.0&qf=ts_cck_field_uprofile_religion^1.0
&qf=ts_cck_field_event_date^1.0&qf=ts_cck_field_image_cache^1.0&qf=ts_cck_field_uprofile_location^1.0
&qf=ts_cck_field_offer_date^1.0&qf=ts_cck_field_overall_rating^1.0&qf=ts_cck_field_uprofile_political^1.0
&qf=ts_cck_field_ddblock_if_slide_text^1.0&qf=ts_cck_field_uprofile_about_me^1.0&qf=ts_cck_field_ddblock_if_pager_item_text^1.0&qf=ts_cck_field_uprofile_zodiac_sign^1.0&qf=ts_cck_field_uprofile_interests^1.0&qf=ts_cck_field_uprofile_skype^1.0
&qf=ts_cck_field_uprofile_aim^1.0&qf=ts_cck_field_uprofile_google^1.0&qf=ts_cck_field_uprofile_yahoo^1.0
&qf=ts_cck_field_uprofile_icq^1.0&qf=ts_cck_field_listing_type^1.0&qf=ts_cck_field_map^1.0&qf=ts_cck_field_uprofile_website^1.0&qf=ts_cck_field_uprofile_blog^1.0&qf=ts_cck_field_uprofile_favorite1^1.0&qf=ts_cck_field_uprofile_favorite2^1.0
&qf=ts_cck_field_uprofile_favorite3^1.0&qf=ts_cck_field_story_video^1.0&qf=ts_cck_field_latitude^1.0
&qf=ts_cck_field_longitude^1.0&qf=ts_cck_field_logo^1.0&qf=ts_cck_field_add_video^1.0&qf=ts_cck_field_image_gallery^1.0
&qf=ts_cck_field_slide_header^1.0&qf=ts_cck_field_slogan^1.0&qf=ts_cck_field_website^1.0&qf=ts_cck_field_header_image^1.0
&qf=ts_cck_field_price_range^1.0&qf=ts_cck_field_feature_pool^1.0&qf=ts_cck_field_feature_gym^1.0
&qf=ts_cck_field_restauranttype^1.0&qf=ts_cck_field_feature_spa^1.0&qf=ts_cck_field_smoking^1.0
&qf=ts_cck_field_feature_wifi^1.0&qf=ts_cck_field_view_type^1.0&qf=ts_cck_field_accessibility^1.0&qf=ts_cck_field_meals^1.0
&qf=ts_cck_field_reservations^1.0&qf=ts_cck_field_parking^1.0&qf=ts_cck_field_alcohol^1.0&qf=ts_cck_field_music^1.0
&qf=ts_cck_field_massage^1.0&qf=ts_cck_field_outdoor_place^1.0&qf=ts_cck_field_safe_box^1.0
&qf=ts_cck_field_24_reception^1.0&qf=ts_cck_field_airport_shuttle^1.0&qf=ts_cck_field_ambience^1.0
&qf=ts_cck_field_hair_salon^1.0
&qf=ts_cck_field_happy_hour^1.0&qf=ts_cck_field_pet_friendly^1.0&qf=ts_cck_field_meeting_room^1.0
&qf=ts_cck_field_conference_facilities^1.0&qf=ts_cck_field_laundry_facilities^1.0&qf=ts_cck_field_stars^1.0
&qf=ts_cck_field_has_online_booking^1.0&qf=ts_cck_field_online_booking_url^1.0&qf=ts_cck_field_affiliate_url^1.0
&qf=body^40.0&qf=title^5.0&qf=name^3.0&qf=taxonomy_names^2.0&qf=tags_h1^5.0&qf=tags_h2_h3^3.0
&qf=tags_h4_h5_h6^2.0&qf=tags_inline^1.0&hl.fl=ts_cck_field_uprofile_gender,ts_cck_field_uprofile_ethnicity,
ts_cck_field_gallery_image,ts_cck_field_uprofile_children,ts_cck_field_uprofile_facebook,ts_cck_field_uprofile_relationship,
ts_cck_field_uprofile_twitter,ts_cck_field_uprofile_linkedin,ts_cck_field_node_offer,ts_cck_field_node_event,
ts_cck_field_uprofile_friendfeed,ts_cck_field_my_teaser,ts_cck_field_uprofile_occupation,ts_cck_field_add_reviews
,ts_cck_field_ddblock_if_image,ts_cck_field_uprofile_name,ts_cck_field_uprofile_religion,ts_cck_field_event_date
,ts_cck_field_image_cache,ts_cck_field_uprofile_location,ts_cck_field_offer_date,ts_cck_field_overall_rating,
ts_cck_field_uprofile_political,ts_cck_field_ddblock_if_slide_text,ts_cck_field_uprofile_about_me,
ts_cck_field_ddblock_if_pager_item_text,ts_cck_field_uprofile_zodiac_sign,ts_cck_field_uprofile_interests,
ts_cck_field_uprofile_skype,ts_cck_field_uprofile_aim,ts_cck_field_uprofile_google,ts_cck_field_uprofile_yahoo,
ts_cck_field_uprofile_icq,ts_cck_field_listing_type,ts_cck_field_map,ts_cck_field_uprofile_website,ts_cck_field_uprofile_blog,
ts_cck_field_uprofile_favorite1,ts_cck_field_uprofile_favorite2,ts_cck_field_uprofile_favorite3,ts_cck_field_story_video,
ts_cck_field_latitude,ts_cck_field_longitude,ts_cck_field_logo,ts_cck_field_add_video,ts_cck_field_image_gallery,
ts_cck_field_slide_header,ts_cck_field_slogan,ts_cck_field_website,ts_cck_field_header_image,ts_cck_field_price_range,
ts_cck_field_feature_pool,ts_cck_field_feature_gym,ts_cck_field_restauranttype,ts_cck_field_feature_spa,ts_cck_field_smoking,
ts_cck_field_feature_wifi,ts_cck_field_view_type,ts_cck_field_accessibility,ts_cck_field_meals,ts_cck_field_reservations
,ts_cck_field_parking,ts_cck_field_alcohol,ts_cck_field_music,ts_cck_field_massage,ts_cck_field_outdoor_place,
ts_cck_field_safe_box,ts_cck_field_24_reception,ts_cck_field_airport_shuttle,ts_cck_field_ambience,ts_cck_field_hair_salon,
ts_cck_field_happy_hour,ts_cck_field_pet_friendly,ts_cck_field_meeting_room,ts_cck_field_conference_facilities,
ts_cck_field_laundry_facilities,ts_cck_field_stars,ts_cck_field_has_online_booking,ts_cck_field_online_booking_url,
ts_cck_field_affiliate_url,body_el&json.nl=map&wt=json&rows=12&bq=type:directory_club_bar^0.5&bq=type:directory_entry^0.3
&bq=type:directory_hotel^0.8&fl=id,nid,title,comment_count,type,created,changed,score,path,url,uid,name,is_upload_count,
ss_image_relative,ts_cck_field_uprofile_gender,ts_cck_field_uprofile_ethnicity,ts_cck_field_gallery_image,
ts_cck_field_uprofile_children,ts_cck_field_uprofile_facebook,ts_cck_field_uprofile_relationship,ts_cck_field_uprofile_twitter,
ts_cck_field_uprofile_linkedin,ts_cck_field_node_offer,ts_cck_field_node_event,ts_cck_field_uprofile_friendfeed,
ts_cck_field_my_teaser,ts_cck_field_uprofile_occupation,ts_cck_field_add_reviews,ts_cck_field_ddblock_if_image,
ts_cck_field_uprofile_name,ts_cck_field_uprofile_religion,ts_cck_field_event_date,ts_cck_field_image_cache,
ts_cck_field_uprofile_location,ts_cck_field_offer_date,ts_cck_field_overall_rating,ts_cck_field_uprofile_political,
ts_cck_field_ddblock_if_slide_text,ts_cck_field_uprofile_about_me,ts_cck_field_ddblock_if_pager_item_text,
ts_cck_field_uprofile_zodiac_sign,ts_cck_field_uprofile_interests,ts_cck_field_uprofile_skype,ts_cck_field_uprofile_aim,
ts_cck_field_uprofile_google,
ts_cck_field_uprofile_yahoo,ts_cck_field_uprofile_icq,ts_cck_field_listing_type,ts_cck_field_map,
ts_cck_field_uprofile_website,ts_cck_field_uprofile_blog,ts_cck_field_uprofile_favorite1,ts_cck_field_uprofile_favorite2,
ts_cck_field_uprofile_favorite3,ts_cck_field_story_video,ts_cck_field_latitude,ts_cck_field_longitude,ts_cck_field_logo,
ts_cck_field_add_video,ts_cck_field_image_gallery,ts_cck_field_slide_header,ts_cck_field_slogan,ts_cck_field_website,
ts_cck_field_header_image,ts_cck_field_price_range,ts_cck_field_feature_pool,ts_cck_field_feature_gym,
ts_cck_field_restauranttype,
ts_cck_field_feature_spa,ts_cck_field_smoking,ts_cck_field_feature_wifi,ts_cck_field_view_type,ts_cck_field_accessibility,
ts_cck_field_meals,ts_cck_field_reservations,ts_cck_field_parking,ts_cck_field_alcohol,ts_cck_field_music,ts_cck_field_massage,
ts_cck_field_outdoor_place,ts_cck_field_safe_box,ts_cck_field_24_reception,ts_cck_field_airport_shuttle,ts_cck_field_ambience,
ts_cck_field_hair_salon,ts_cck_field_happy_hour,ts_cck_field_pet_friendly,ts_cck_field_meeting_room,
ts_cck_field_conference_facilities,ts_cck_field_laundry_facilities,ts_cck_field_stars,ts_cck_field_has_online_booking
,ts_cck_field_online_booking_url,ts_cck_field_affiliate_url&facet.sort=true&start=0&q=ξενοδοχειο&spellcheck.dictionary=spellchecker_el&bf=recip(rord(created),4,39,39)^200.0&fq=language:el} hits=0 status=0 QTime=45 

If you think they are related then just close this issue :)

mkalkbrenner’s picture

Category: feature » support

i see the body field being queried instead of body_el.

That's not true. In your log I see

&qf=body_el^1.0
&qf=body_en^1.0

But I also see

&qf=body^40.0

Didn't you "omit" body ("Body text - the full, rendered content:") at "/admin/settings/apachesolr/query-fields"?
In your settings body is 40 times more important than body_el.

arekanderu’s picture

It seems that i forgot to set to "Omit" the "Body text inside H4, H5, or H6 tags". Everything now seems to work properly.

Thanks and my apologies again.

mkalkbrenner’s picture

Status: Active » Closed (works as designed)

Your welcome.

I know that using this module is not that easy because it has to deal with the design decisions of the apachesolr module itself ;-)

I close this issue. But it would be nice if you can reply to #1288144: CCK Multilanguage values.