Indexing not starting
| Project: | Swish-E Indexer |
| Version: | 5.x-1.x-dev |
| Component: | Miscellaneous |
| Category: | support request |
| Priority: | normal |
| Assigned: | Unassigned |
| Status: | active |
Jump to:
Drupal 5.6 on Debian 4.1
When I check the Begin Swish-E Indexing I get the following:
* .
err: No unique words indexed!
Writing main index...
* Your Swish-E settings has been saved.
in the apache log I see:
Error: Couldn't open file '/var/www/drupal5/files/KnowledgeTreeUserManua\.pdf'
If I invoke swish-e voa the command line:
swish-e -v 3 -i /var/www/drupal5/files/KnowledgeTreeUserManua\.pdf
Indexing Data Source: "File-System"
Indexing "/var/www/drupal5/files/KnowledgeTreeUserManua.pdf"Checking file "/var/www/drupal5/files/KnowledgeTreeUserManua.pdf"...
KnowledgeTreeUserManua.pdf - Using DEFAULT (HTML2) parser - (261340 words)Removing very common words...
no words removed.
Writing main index...
Sorting words ...
Sorting 80,634 words alphabetically
Writing header ...
Writing index entries ...
Writing word text: Complete
Writing word hash: Complete
Writing word data: Complete
80,634 unique words indexed.
4 properties sorted.
1 file indexed. 1,303,650 total bytes. 261,340 total words.
Elapsed time: 00:00:02 CPU time: 00:00:02
Indexing done!
Any ideas?

#1
My bad. It worked once libxml2-utils was installed.
#2
I have exactly the same problem even with libxml2-utils installed. I still get this error when I try to index from Drupal interface:
err: No unique words indexed!Writing main index...
Although I can successfully run swish-e vi command line:
swish-e -v 3 -i /var/www/drupal_multisite.com/docroot/sites/mysite.com/files/faculty.pdf
Indexing Data Source: "File-System"
Indexing "/var/www/drupal_multisite.com/docroot/sites/mysite.com/files/faculty.pdf"
Checking file "/var/www/drupal_multisite.com/docroot/sites/mysite.com/files/faculty.pdf"...
faculty.pdf - Using DEFAULT (HTML2) parser - (7359 words)
Removing very common words...
no words removed.
Writing main index...
Sorting words ...
Sorting 3,952 words alphabetically
Writing header ...
Writing index entries ...
Writing word text: Complete
Writing word hash: Complete
Writing word data: Complete
3,952 unique words indexed.
4 properties sorted.
1 file indexed. 36,666 total bytes. 7,359 total words.
Elapsed time: 00:00:00 CPU time: 00:00:00
Indexing done!
I guess the only difference is that I'm using 5.x-1.1, not 5.x-1.x-dev
Does anyone have any suggestions?
#3
#4
Do you use a symbolic link for the drupal file path? In my case, yes.
I chased the source code. First, if I changed a file path in a configuration file which swish module generates in /tmp directory TO an orignal directory, it worked.
So I put the following code into swish.integration.inc file on line 25:
fwrite($handle, "FollowSymLinks yes\n"); // Follow SymLink
This code adds 'FollowSymLinks yes' option to the configuration file which will be dynamically generated by swish module in /tmp.
Please try and feedback it. HTH
#5
Thank you for your response.
I wonder what do you mean by "symbolic link for the drupal file path"?
Do you mean Drupal multisite setup? I do have multisite Drupal installation, but I don't have symbolic link that would point back to Drupal codebase installation. Instead I have Apache config file for that host which points to Drupal codebase.
If you are talking about my Drupal "File System" setup (admin/settings/file-system) then I don't have any symbolic links there.
in my swish file within /temp directory I have this:
IncludeConfigFile /var/www/drupal-codebase.com/docroot/sites/all/modules/swish/conf/common.confIndexDir /var/www/drupal-codebase.com/docroot/sites/my-site.com/files
I don't know if that is what it supposed to be, but it looks right.