First thanks for the module. It works great on many of my pagse. Unfortunately one specific type drives it crazy?
I would like to see the word/string removal function extended. Many of my pages are of the format
yyyy/mm/dd/title-of-page-and-content
2007/05/02/title-of-page-and-content
Unfortunately it brings in the date info as part of the search data. I would like to exclude that date from the generated search string. Since not everyone might agree, it would probably be best to do it as an optional feature though. It could be implemented as either removing any numeric-only levels, or by just using the last level(since thats where the title is)
Comments
Comment #1
forngren CreditAttribution: forngren commentedGreat idea, I'll try figure out a way of doing this. An ignore numbers checkbox perhaps?
I'm afraid that I have to delay this a few weeks, but if someone steps up and provides a patch I'll certainly review that.
Comment #2
forngren CreditAttribution: forngren commentedOr even better: a regexbox for ignoring stuff
Comment #3
NikLP CreditAttribution: NikLP commentedJust as an addition; I'm not sure I fully understand this (nor does it really affect me, so I probably haven't read this thoroughly enough...), but would this have any effect of duplicate content, ie wouldn't it create the duplicate content effect?
If so, that's not a good thing for SEO... sorry if I've missed the point though.
Comment #4
forngren CreditAttribution: forngren commentedNo, it won't affect SEO since a 404 header is returned by drupal core.
Comment #5
forngren CreditAttribution: forngren commentedI added a PCRE box to 5.x-1.x-dev. It requires you to know the PCRE syntax. I have no time write documentation on on how to use it.
(If you want to see it in stable; please review 5.x-1.x-dev on a non-production site)
Comment #6
zyxware CreditAttribution: zyxware commentedSince this feature has already been implemented on the dev branch I am marking this as done.
Comment #7
zyxware CreditAttribution: zyxware commentedSorry I should have marked it as closed. Done.
Comment #8
zyxware CreditAttribution: zyxware commentedIn the meanwhile you can get your system working the way you want by stripping out the numbers corresponding to the dates eg: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 2006 2007 2008 to get the required functionality.