I added support for powerpoint indexing. Although the catdoc package has a catppt tool, I found the ppthtml in xlhtml package to work much better.
There is also a funny problem with the title of the doc when using ppthtml, which I fixed temporarily by using $basename for the title when parsing the search results.
Would be nice to have the context of the word searched, but not sure if swish-e provides this.
Otherwise - great work!
| Comment | File | Size | Author |
|---|---|---|---|
| #3 | ppt.png | 819 bytes | seannyob |
| #2 | swish.patch | 3.56 KB | seannyob |
| swish.module | 11.48 KB | sfarestam |
Comments
Comment #1
sofiya commentedhi sfarestam. thanks for the lead. i'll surely integrate your patch in the next release.
Comment #2
seannyob commentedsfarestam did great work, but i found it odd that his comment was so positive about ppthtml yet actually implemented catppt. ;)
This patch does almost exactly what his does but utilizes ppthtml. Assumes default path to ppthtml is in /usr/local/bin, which is the assumption for most other filters here, however, as I use debian, I don't know where that application would be installed on boxen that use RPMs. Or windows, golly who knows.
In addition, I added ppt verbiage to the code comments, fixed a spelling error, etc.
Also please note that if you apply this patch to an exisiting swish.module you probably want to drop a ppt.png into modules/swish/images.
Compliments of Colley Graphics, LLC.
Comment #3
seannyob commentedOops. That link isn't right.
The a png ppt icon is available here, among other places: http://colleygraphics.com/files/ppt.png.
Trying to attach it to this comment, also. We'll see if that works.
Sean
Comment #4
populist commentedthe 4.7 version of swish-e module uses ppthtml
Comment #5
(not verified) commented