Posted by djudd on June 29, 2009 at 11:57pm
Jump to:
| Project: | Search configuration |
| Version: | 6.x-1.x-dev |
| Component: | Code |
| Category: | bug report |
| Priority: | critical |
| Assigned: | Unassigned |
| Status: | closed (fixed) |
Issue Summary
I have a node for iPaper, where I am uploading PDF's of my publication. These nodes, for some reason, stop the search index from running and stall cron out until it locks up and i have to go in and manually delete the cron_semaphore element to get it running again, and then it stalls back out.
I have installed Search_Config, and checked the iPaper node type for index exclusion, but it is still trying to index these node types. It seems this component is not working.
I am running Drupal 6.12, which I believe has updates to the DB layer? Is there an update for this module forthcoming, or is there a bigger problem here?
Comments
#1
djudd,
Is the iPaper content type provided by CCK and if so what are the field types?
#2
I don't believe it's CCK based. You can take a look at the module here:
http://drupal.org/project/ipaper
Essentially, it uploads a PDF (or other document types) to the Scribd service and then links to that document.
What I discovered is that it tries to place the full text of the document into the search database, which is just way too much information for a large file. I have turned off it's indexing of all text, and that solves my problem.
However, I was never able to use Search Config to exclude the node type.
#3
Same here. Last week, a production site I work on, stopped indexing new content. Turns out there is a bug between Drupal core search module and this module.
Drupal indexes all nodes no matter what. Search config is supposed to tell the indexer not to index certain node types. This doesn't work.
The site in question uses an 'Editions' paradigm for homepage publishing, so there are at least 2 or 3 versions of the homepage every day. The unused editions get Unpublished and are not supposed to even be indexed.
Once I fixed the problem with real content not being indexed, by hacking the max number for the indexer up to 2000 from 500.
Now that the indexer is getting new content, it is also getting all the content that search config says it shouldn't.
This is critical. I'm going to have to hack core node to fix this site by hard-coding what nodes can and cannot be indexed.
What is the problem? Anyone?
#4
I think this is the problem I am having with re-index:
#565232: little bug in validate function
#5
I agree that this issue was related to the re-indexing bug that has since been closed. Please re-open if the problem persists.