Skip to main content.
home | support | download

Back to List Archive

Help Getting the PDF Filter to Work on a Windows Machine

From: Nathan Schile <nathan.schile(at)not-real.mchsi.com>
Date: Tue Nov 04 2003 - 02:49:37 GMT
I am trying to filter pdf files with SWISH-E.  My pdf file is located at =
F:/SWISH-E/TeenSnapshot.pdf

I used the example8.conf as my base point:

    IncludeConfigFile "F:/SWISH-E/conf/example4.config"
    IndexDir "F:/SWISH-E/"
    IndexOnly .pdf
    FileFilter .pdf "F:/SWISH-E/lib/swish-e/_pdf2html.pl"

I also made the following change in the _pdf2html.pl file
    =20
     $ENV{PATH} =3D 'F:/SWISH-E/lib/swish-e/'

When I run the index command, I recieve the following output:

F:\SWISH-E>SWISH-E -c "F:\SWISH-E\conf\example8.config"
Indexing Data Source: "File-System"
Indexing "F:/SWISH-E/"

Checking dir "F:/SWISH-E"...
'pdfinfo' is not recognized as an internal or external command,
operable program or batch file.
F:\SWISH-E\lib\swish-e\_pdf2html.pl: Failed close on pipe to pdfinfo for =
'F:\SWI
SH-E\TeenSnapshot.pdf': 256 at F:\SWISH-E\lib\swish-e\_pdf2html.pl line =
54.
Checking dir "F:/SWISH-E/conf"...
Checking dir "F:/SWISH-E/conf/stopwords"...
Checking dir "F:/SWISH-E/example"...
Checking dir "F:/SWISH-E/example/images"...
Checking dir "F:/SWISH-E/example/styles"...
Checking dir "F:/SWISH-E/html"...
Checking dir "F:/SWISH-E/html/images"...
Checking dir "F:/SWISH-E/lib"...
Checking dir "F:/SWISH-E/lib/swish-e"...
Checking dir "F:/SWISH-E/lib/swish-e/charsets"...
Checking dir "F:/SWISH-E/lib/swish-e/perl"...
Checking dir "F:/SWISH-E/lib/swish-e/perl/SWISH"...
Checking dir "F:/SWISH-E/lib/swish-e/perl/SWISH/Filters"...

Removing very common words...
no words removed.
Writing main index...
err: No unique words indexed!

Any help is appriciated, Thanks!


*********************************************************************
Due to deletion of content types excluded from this list by policy,
this multipart message was reduced to a single part, and from there
to a plain text message.
*********************************************************************
Received on Tue Nov 4 03:02:20 2003