I added support for powerpoint indexing. Although the catdoc package has a catppt tool, I found the ppthtml in xlhtml package to work much better.

There is also a funny problem with the title of the doc when using ppthtml, which I fixed temporarily by using $basename for the title when parsing the search results.

Would be nice to have the context of the word searched, but not sure if swish-e provides this.

Otherwise - great work!

CommentFileSizeAuthor
#3 ppt.png819 bytesseannyob
#2 swish.patch3.56 KBseannyob
swish.module11.48 KBsfarestam

Comments

sofiya’s picture

Status: Active » Closed (fixed)

hi sfarestam. thanks for the lead. i'll surely integrate your patch in the next release.

seannyob’s picture

Status: Closed (fixed) » Needs review
StatusFileSize
new3.56 KB

sfarestam did great work, but i found it odd that his comment was so positive about ppthtml yet actually implemented catppt. ;)

This patch does almost exactly what his does but utilizes ppthtml. Assumes default path to ppthtml is in /usr/local/bin, which is the assumption for most other filters here, however, as I use debian, I don't know where that application would be installed on boxen that use RPMs. Or windows, golly who knows.

In addition, I added ppt verbiage to the code comments, fixed a spelling error, etc.

Also please note that if you apply this patch to an exisiting swish.module you probably want to drop a ppt.png into modules/swish/images.

Compliments of Colley Graphics, LLC.

seannyob’s picture

StatusFileSize
new819 bytes

Oops. That link isn't right.

The a png ppt icon is available here, among other places: http://colleygraphics.com/files/ppt.png.

Trying to attach it to this comment, also. We'll see if that works.

Sean

populist’s picture

Status: Needs review » Fixed

the 4.7 version of swish-e module uses ppthtml

Anonymous’s picture

Status: Fixed » Closed (fixed)