First thanks for the module. It works great on many of my pagse. Unfortunately one specific type drives it crazy?

I would like to see the word/string removal function extended. Many of my pages are of the format
yyyy/mm/dd/title-of-page-and-content
2007/05/02/title-of-page-and-content

Unfortunately it brings in the date info as part of the search data. I would like to exclude that date from the generated search string. Since not everyone might agree, it would probably be best to do it as an optional feature though. It could be implemented as either removing any numeric-only levels, or by just using the last level(since thats where the title is)

Comments

forngren’s picture

Version: 5.x-1.1 » 5.x-1.x-dev

Great idea, I'll try figure out a way of doing this. An ignore numbers checkbox perhaps?

I'm afraid that I have to delay this a few weeks, but if someone steps up and provides a patch I'll certainly review that.

forngren’s picture

Or even better: a regexbox for ignoring stuff

NikLP’s picture

Assigned: zyxware » Unassigned
Status: Closed (fixed) » Active

Just as an addition; I'm not sure I fully understand this (nor does it really affect me, so I probably haven't read this thoroughly enough...), but would this have any effect of duplicate content, ie wouldn't it create the duplicate content effect?

If so, that's not a good thing for SEO... sorry if I've missed the point though.

forngren’s picture

No, it won't affect SEO since a 404 header is returned by drupal core.

forngren’s picture

I added a PCRE box to 5.x-1.x-dev. It requires you to know the PCRE syntax. I have no time write documentation on on how to use it.

(If you want to see it in stable; please review 5.x-1.x-dev on a non-production site)

zyxware’s picture

Assigned: Unassigned » zyxware
Status: Active » Fixed

Since this feature has already been implemented on the dev branch I am marking this as done.

zyxware’s picture

Assigned: zyxware » Unassigned
Status: Fixed » Closed (fixed)

Sorry I should have marked it as closed. Done.

zyxware’s picture

Assigned: Unassigned » zyxware

In the meanwhile you can get your system working the way you want by stripping out the numbers corresponding to the dates eg: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 2006 2007 2008 to get the required functionality.

Status: Active » Closed (fixed)