Skip to main content.
home | support | download

Back to List Archive

Filtering MS Word Documents

From: Sebastian Jayaraj <jayaraj(at)not-real.kosan.com>
Date: Fri Oct 14 2005 - 21:56:08 GMT
Hello All,

 I have been using swish-e for a while and it works beautifully while 
indexing PDF and XL files. I was trying to index MS word files and only 
the filenames were being indexed. So I tried a simple swish-filter-test 
and found this....

-------------------------------------------------
[root@tnt filters]# catdoc -V
Catdoc Version 0.93.3
[root@tnt filters]# swish-e -V
SWISH-E 2.4.2
[root@tnt filters]# swish-filter-test test.doc

Document test.doc was not filtered.
   Document:     test.doc  (test.doc)
   Content-Type: application/x-msword
   Parser type:

** /usr/local/bin/swish-filter-test:
  Skipping binary [test.doc]
------------------------------------------------

Catdoc by itself works fine and is in the right path. Any pointers or 
suggestions would be helpful.

thanks!
Sebastian
Received on Fri Oct 14 14:56:20 2005