Skip to main content.
home | support | download

Back to List Archive

Re: Filtering MS Word Documents

From: Bill Moseley <moseley(at)not-real.hank.org>
Date: Mon Oct 17 2005 - 05:09:30 GMT
On Fri, Oct 14, 2005 at 02:54:50PM -0700, Sebastian Jayaraj wrote:
> Hello All,
> 
>  I have been using swish-e for a while and it works beautifully while 
> indexing PDF and XL files. I was trying to index MS word files and only 
> the filenames were being indexed. So I tried a simple swish-filter-test 
> and found this....
> 
> -------------------------------------------------
> [root@tnt filters]# catdoc -V
> Catdoc Version 0.93.3
> [root@tnt filters]# swish-e -V
> SWISH-E 2.4.2
> [root@tnt filters]# swish-filter-test test.doc
> 
> Document test.doc was not filtered.
>    Document:     test.doc  (test.doc)
>    Content-Type: application/x-msword
>    Parser type:
> 
> ** /usr/local/bin/swish-filter-test:
>   Skipping binary [test.doc]
> ------------------------------------------------
> 
> Catdoc by itself works fine and is in the right path. Any pointers or 
> suggestions would be helpful.

One suggestion would be to try the above with the -v option.
And maybe run as a normal user instead of root.

-- 
Bill Moseley
moseley@hank.org

Unsubscribe from or help with the swish-e list: 
   http://swish-e.org/Discussion/

Help with Swish-e:
   http://swish-e.org/current/docs
   swish-e@sunsite.berkeley.edu
Received on Sun Oct 16 22:09:54 2005