Comments

damien tournoud’s picture

Status: Active » Needs review
StatusFileSize
new1.53 KB
pwolanin’s picture

I don't think we want this definition as-is. However, it would probably make sense to at least lowercase string fields?

damien tournoud’s picture

Title: schema.xml should use alphaOnlySort, not string, for stitle and sname... » schema.xml should use analysers for "order only" fields
StatusFileSize
new4.37 KB

Added LowerCase and MappingCharFilterFactory to those fields to reduce character forms.

damien tournoud’s picture

StatusFileSize
new4.6 KB

The same patch, but without mapping on order fields.

pwolanin’s picture

Let's combine proposed schema changes with those from: http://drupal.org/node/370436

pwolanin’s picture

StatusFileSize
new5.87 KB

I'm not sure about changing the dynamic fields this way - need to look at how this affects faceting on those fields.

pwolanin’s picture

Status: Needs review » Needs work

So, as I feared, if you specify the lower case factory, the possible facet values are lower-cased in the response. I don't think that's what we want for names or taxonomy terms. For example:

The result in the a doc has:

<arr name="smfield_vid_Tags">
  <str>Drupal</str>
  <str>PHP</str>
  <str>Solr</str>
</arr>

but the facet values look like:

<lst name="facet_fields">
  <lst name="smfield_vid_Tags">
    <int name="drupal">2</int>
    <int name="amazon">1</int>
    <int name="php">1</int>
    <int name="solr">1</int>
  </lst>
</lst>
pwolanin’s picture

Status: Needs work » Needs review
StatusFileSize
new7.14 KB

Here's another go at it after much beating around.

pwolanin’s picture

StatusFileSize
new7.5 KB

play around with wildcard copyField directives in discussion w/ Damien.

Also, is there any reason at this point we need to use, e.g ssfield* rather than ss_* or *_ss?

pwolanin’s picture

StatusFileSize
new11.59 KB
pwolanin’s picture

StatusFileSize
new12.88 KB

missed some changes.

pwolanin’s picture

StatusFileSize
new12.97 KB

discussed w/ Jacob - we probably don't want the punctuation stripping out of the box. That's more of a site-specific requirement.

pwolanin’s picture

Title: schema.xml should use analysers for "order only" fields » compact field names, use analysers for "order only" fields in schema.xml

re-titleing the issue since we are also shortening the field names

pwolanin’s picture

StatusFileSize
new12.65 KB

taking out the non-analyzed fields - string fields are not analyzed in any case, so I don't think this really matters.

damien tournoud’s picture

The patch looks good at first sight, but I haven't actually tested it.

pwolanin’s picture

Status: Needs review » Fixed
StatusFileSize
new12.65 KB

minor comment cleanup. Committing this.

pwolanin’s picture

Status: Fixed » Needs review
StatusFileSize
new757 bytes

one bug in the committed schema

pwolanin’s picture

Status: Needs review » Fixed

committed

pwolanin’s picture

Status: Fixed » Active

per suggestion by Damien and agreement from Jacob, we should go for consistency with sort_name, sort_title, sort_ss_*

pwolanin’s picture

StatusFileSize
new2.95 KB

patch - also changes the irnored filed to mult valued (needs testing) since had reports of an error from Solr when trying to post an unknown multi-valued field.

pwolanin’s picture

Status: Active » Needs review
pwolanin’s picture

my testing does show that making the ignored field multi-values is better at preventing errors.

see also: https://issues.apache.org/jira/browse/SOLR-1022

pwolanin’s picture

StatusFileSize
new5.65 KB

oops - need to also change the copyField directives.

pwolanin’s picture

Status: Needs review » Fixed

committed to 6.x

Status: Fixed » Closed (fixed)

Automatically closed -- issue fixed for 2 weeks with no activity.