stripping date from search terms

MacRonin - May 21, 2007 - 18:11
Project:Search 404
Version:5.x-1.x-dev
Component:Code
Category:feature request
Priority:normal
Assigned:zyxware
Status:closed
Description

First thanks for the module. It works great on many of my pagse. Unfortunately one specific type drives it crazy?

I would like to see the word/string removal function extended. Many of my pages are of the format
yyyy/mm/dd/title-of-page-and-content
2007/05/02/title-of-page-and-content

Unfortunately it brings in the date info as part of the search data. I would like to exclude that date from the generated search string. Since not everyone might agree, it would probably be best to do it as an optional feature though. It could be implemented as either removing any numeric-only levels, or by just using the last level(since thats where the title is)

#1

forngren - May 21, 2007 - 18:18
Version:5.x-1.1» 5.x-1.x-dev

Great idea, I'll try figure out a way of doing this. An ignore numbers checkbox perhaps?

I'm afraid that I have to delay this a few weeks, but if someone steps up and provides a patch I'll certainly review that.

#2

forngren - May 21, 2007 - 18:20

Or even better: a regexbox for ignoring stuff

#3

NikLP - January 13, 2008 - 09:00

Just as an addition; I'm not sure I fully understand this (nor does it really affect me, so I probably haven't read this thoroughly enough...), but would this have any effect of duplicate content, ie wouldn't it create the duplicate content effect?

If so, that's not a good thing for SEO... sorry if I've missed the point though.

#4

forngren - June 4, 2007 - 15:22

No, it won't affect SEO since a 404 header is returned by drupal core.

#5

forngren - August 5, 2007 - 12:44

I added a PCRE box to 5.x-1.x-dev. It requires you to know the PCRE syntax. I have no time write documentation on on how to use it.

(If you want to see it in stable; please review 5.x-1.x-dev on a non-production site)

#6

zyxware - January 12, 2008 - 12:51
Assigned to:Anonymous» zyxware
Status:active» fixed

Since this feature has already been implemented on the dev branch I am marking this as done.

#7

zyxware - January 12, 2008 - 12:57
Assigned to:zyxware» Anonymous
Status:fixed» closed

Sorry I should have marked it as closed. Done.

#8

zyxware - January 12, 2008 - 13:05
Assigned to:Anonymous» zyxware

In the meanwhile you can get your system working the way you want by stripping out the numbers corresponding to the dates eg: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 2006 2007 2008 to get the required functionality.

 
 

Drupal is a registered trademark of Dries Buytaert.