Skip to main content.
home | support | download

Back to List Archive

Re: PDF indexing

From: Bill Moseley <moseley(at)not-real.hank.org>
Date: Fri Oct 04 2002 - 02:10:32 GMT
At 06:09 PM 10/03/02 -0700, sobrien@ci.bend.or.us wrote:
>It appears that pdf files passed to _pdf2html.pl for indexing that have
>spaces in the filenames are being rejected by pdfinfo and pdf2text.

>If I pass it the filename in quotes it processes it ok, I just can't figure
>out how to get swish-e to pass the filename off correctly.

Try editing _pdf2html.pl

  open F, "pdfinfo $file |" || die "$0: Failed to open $file $!";   

to

  open F, "pdfinfo '$file' |" || die "$0: Failed to open $file $!"; 

and change

  open F, "pdftotext $file - |" or die "$0: failed to run pdftotext: $!";

to

  open F, "pdftotext '$file' - |" or die "$0: failed to run pdftotext: $!";


-- 
Bill Moseley
mailto:moseley@hank.org
Received on Fri Oct 4 02:14:59 2002