No results !

Marcin Pajdzik - April 4, 2008 - 02:29
Project:Swish-E Indexer
Version:5.x-1.1
Component:Code
Category:support request
Priority:critical
Assigned:Unassigned
Status:active
Description

Well, I have tried to set up this module. I have uploaded a couple of files, indexed them but they do not appear in the search results. I have checked the database - the fulltext fields are filled with text from the files. Does anyone know how to solve this problem?

#1

joeBro - May 13, 2008 - 13:19

Same with me here.
The settings are all set. Swish-e recognizes all plug-ins (or at least Drupal recognizes the plug-ins' paths), the indexing seems to work (corresponding entries in the database), but a search returns no result. It is maybe interesting to note, that the "Beginn Swish-E Indexing"-Checkbox stays unchecked after checking and saving the settings (I get no error-message, but only an empty bullet point and a "Your Swish-E settings has been saved"-statement).

(see also http://drupal.org/node/163518)

#2

berlinonline2 - June 4, 2008 - 14:40

Hi Marcin, Hi joeBro,

I think I have a solution for your problem if you are using swish-e indexer on a windows system. I had the same issue and after an hour I found out that there was a problem with the Slash/Backslash-Handling in the Code. Try to change the following lines in the files mentioned below. That worked fine for me. Good luck.

In File: swish.integration.inc

Line 13 find
$file_path = getcwd() .'/' . file_directory_path();
and replace with:
$file_path = getcwd() .'\\' . file_directory_path();

Line 59 find
$swish_indx_cmd .= " -f $file_path/my_swish_index"; // save the index to the files directory
and replace with:
$swish_indx_cmd .= " -f $file_path\my_swish_index"; // save the index to the files directory

Line 60 find
exec (escapeshellcmd($swish_indx_cmd), $results, $rv);
and replace with:
exec ($swish_indx_cmd, $results, $rv);

In File: swish.module

Line 61 find
$swish_index = getcwd() .'/' . file_directory_path().'/'. 'my_swish_index';
and replace with:
$swish_index = getcwd() .'\\' . file_directory_path().'\\'. 'my_swish_index';

Line 78 find
$swish_command = variable_get("swish_path","/usr/local/bin/swish-e") . escapeshellcmd(" -m 50 -f $swish_index  -w ").$words;     
and replace with:
$swish_command = variable_get("swish_path","/usr/local/bin/swish-e") . " -m 50 -f $swish_index  -w ".$words;     

After applying the changes go to the config menue for swish-e and try again the "Begin Swish-E Indexing". After that run cron.php and then try a search. In the advanced search menue you will find a tab called "files" an there you can search within the files.

#3

nrasmus - July 8, 2008 - 19:47

I'm having the same issue here--and I'm on Debian Etch, so I don't think the slash issue is at play here . . .

#4

nrasmus - July 10, 2008 - 22:57

Just checking in about this--I have a multisite install, and am experiencing the same behavior on several sites. For each site where swish is enabled, it looks like each cron run is creating a new swishstring file in its respective tmp directory. Indexing via command line works, but nothing is getting indexed for the particular site. Any ideas?

#5

Marcin Pajdzik - July 22, 2008 - 16:53

Hi nrasmus

I have given up using swich indexer and installed this module: http://drupal.org/project/search_attachments
It works all right for me.

For my new project, a multisite, I am trying to use Xapian: http://www.trellon.com/blog/xapian-search-drupal
It looks promising. I have installed it and indexed all my nodes. However I have not tried to index any external files.

#6

geme4472 - July 29, 2008 - 15:58

In the common.conf file, set reporting to 4, run the on-the-fly indexing, and see if there are troubles on the indexing side of things.

IndexReport 4
ParserWarnLevel 4

You could also rip open the index file and check that there are words in there. It is semi-human-readable.

If it still looks like everything is fine, the issue is truly on the search side. There's always the chance that apache doesn't have rights to run swish--which may be why you can search from CLI and get results.

Swish-e is insanely fast, and, depending on your needs, might be a great option, so don't give up quite yet!

#7

miiimooo - November 20, 2008 - 13:20

I get this - any ideas why it doesnt use the converters?

html.doc - Using DEFAULT (HTML2) parser - (no words indexed)
Email_Internet.doc - Using DEFAULT (HTML2) parser - (no words indexed)
dreamweaver front.doc - Using DEFAULT (HTML2) parser - (no words indexed)
PGP.doc - Using DEFAULT (HTML2) parser - (no words indexed)
dreamweaver.doc - Using DEFAULT (HTML2) parser - (no words indexed)

Removing very common words...
no words removed.
Writing main index...
Sorting words ...
Sorting 65 words alphabetically
Writing header ...
Writing index entries ...
Writing word text: ... Writing word text: Complete
Writing word hash: ... Writing word hash: 10% Writing word hash: 20% Writing word hash: 30% Writing word hash: 40% Writing word hash: 50% Writing word hash: 60% Writing word hash: 70% Writing word hash: 80% Writing word hash: 90% Writing word hash: 100% Writing word hash: Complete
Writing word data: ... Writing word data: Complete
65 unique words indexed.
Sorting property: swishdocpath Sorting property: swishtitle Sorting property: swishdocsize Sorting property: swishlastmodified 4 properties sorted.
8 files indexed. 601,911 total bytes. 70 total words.
Elapsed time: 00:00:00 CPU time: 00:00:00
Indexing done!

 
 

Drupal is a registered trademark of Dries Buytaert.