Indexing not starting

gilf - February 2, 2008 - 14:20
Project:Swish-E Indexer
Version:5.x-1.x-dev
Component:Miscellaneous
Category:support request
Priority:normal
Assigned:Unassigned
Status:active
Description

Drupal 5.6 on Debian 4.1

When I check the Begin Swish-E Indexing I get the following:

* .
err: No unique words indexed!
Writing main index...
* Your Swish-E settings has been saved.

in the apache log I see:

Error: Couldn't open file '/var/www/drupal5/files/KnowledgeTreeUserManua\.pdf'

If I invoke swish-e voa the command line:

swish-e -v 3 -i /var/www/drupal5/files/KnowledgeTreeUserManua\.pdf
Indexing Data Source: "File-System"
Indexing "/var/www/drupal5/files/KnowledgeTreeUserManua.pdf"

Checking file "/var/www/drupal5/files/KnowledgeTreeUserManua.pdf"...
KnowledgeTreeUserManua.pdf - Using DEFAULT (HTML2) parser - (261340 words)

Removing very common words...
no words removed.
Writing main index...
Sorting words ...
Sorting 80,634 words alphabetically
Writing header ...
Writing index entries ...
Writing word text: Complete
Writing word hash: Complete
Writing word data: Complete
80,634 unique words indexed.
4 properties sorted.
1 file indexed. 1,303,650 total bytes. 261,340 total words.
Elapsed time: 00:00:02 CPU time: 00:00:02
Indexing done!

Any ideas?

#1

gilf - February 16, 2008 - 21:13
Status:active» closed

My bad. It worked once libxml2-utils was installed.

#2

gd1008 - May 9, 2008 - 20:55

I have exactly the same problem even with libxml2-utils installed. I still get this error when I try to index from Drupal interface:

err: No unique words indexed!
Writing main index...

Although I can successfully run swish-e vi command line:

swish-e  -v 3 -i /var/www/drupal_multisite.com/docroot/sites/mysite.com/files/faculty.pdf
Indexing Data Source: "File-System"
Indexing "/var/www/drupal_multisite.com/docroot/sites/mysite.com/files/faculty.pdf"

Checking file "/var/www/drupal_multisite.com/docroot/sites/mysite.com/files/faculty.pdf"...
  faculty.pdf - Using DEFAULT (HTML2) parser -  (7359 words)

Removing very common words...
no words removed.
Writing main index...
Sorting words ...
Sorting 3,952 words alphabetically
Writing header ...
Writing index entries ...
  Writing word text: Complete
  Writing word hash: Complete
  Writing word data: Complete
3,952 unique words indexed.
4 properties sorted.                                             
1 file indexed.  36,666 total bytes.  7,359 total words.
Elapsed time: 00:00:00 CPU time: 00:00:00
Indexing done!

I guess the only difference is that I'm using 5.x-1.1, not 5.x-1.x-dev
Does anyone have any suggestions?

#3

gd1008 - May 9, 2008 - 20:56
Status:closed» active

#4

yas - May 14, 2008 - 19:38

Do you use a symbolic link for the drupal file path? In my case, yes.

I chased the source code. First, if I changed a file path in a configuration file which swish module generates in /tmp directory TO an orignal directory, it worked.

So I put the following code into swish.integration.inc file on line 25:

fwrite($handle, "FollowSymLinks yes\n"); // Follow SymLink

This code adds 'FollowSymLinks yes' option to the configuration file which will be dynamically generated by swish module in /tmp.

Please try and feedback it. HTH

#5

gd1008 - May 22, 2008 - 15:15

Thank you for your response.

I wonder what do you mean by "symbolic link for the drupal file path"?

Do you mean Drupal multisite setup? I do have multisite Drupal installation, but I don't have symbolic link that would point back to Drupal codebase installation. Instead I have Apache config file for that host which points to Drupal codebase.

If you are talking about my Drupal "File System" setup (admin/settings/file-system) then I don't have any symbolic links there.

in my swish file within /temp directory I have this:

IncludeConfigFile /var/www/drupal-codebase.com/docroot/sites/all/modules/swish/conf/common.conf
IndexDir /var/www/drupal-codebase.com/docroot/sites/my-site.com/files

I don't know if that is what it supposed to be, but it looks right.

 
 

Drupal is a registered trademark of Dries Buytaert.