stripping date from search terms
MacRonin - May 21, 2007 - 18:11
| Project: | Search 404 |
| Version: | 5.x-1.x-dev |
| Component: | Code |
| Category: | feature request |
| Priority: | normal |
| Assigned: | zyxware |
| Status: | closed |
Jump to:
Description
First thanks for the module. It works great on many of my pagse. Unfortunately one specific type drives it crazy?
I would like to see the word/string removal function extended. Many of my pages are of the format
yyyy/mm/dd/title-of-page-and-content
2007/05/02/title-of-page-and-content
Unfortunately it brings in the date info as part of the search data. I would like to exclude that date from the generated search string. Since not everyone might agree, it would probably be best to do it as an optional feature though. It could be implemented as either removing any numeric-only levels, or by just using the last level(since thats where the title is)

#1
Great idea, I'll try figure out a way of doing this. An ignore numbers checkbox perhaps?
I'm afraid that I have to delay this a few weeks, but if someone steps up and provides a patch I'll certainly review that.
#2
Or even better: a regexbox for ignoring stuff
#3
Just as an addition; I'm not sure I fully understand this (nor does it really affect me, so I probably haven't read this thoroughly enough...), but would this have any effect of duplicate content, ie wouldn't it create the duplicate content effect?
If so, that's not a good thing for SEO... sorry if I've missed the point though.
#4
No, it won't affect SEO since a 404 header is returned by drupal core.
#5
I added a PCRE box to 5.x-1.x-dev. It requires you to know the PCRE syntax. I have no time write documentation on on how to use it.
(If you want to see it in stable; please review 5.x-1.x-dev on a non-production site)
#6
Since this feature has already been implemented on the dev branch I am marking this as done.
#7
Sorry I should have marked it as closed. Done.
#8
In the meanwhile you can get your system working the way you want by stripping out the numbers corresponding to the dates eg: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 2006 2007 2008 to get the required functionality.