Skip to main content.
home | support | download

Back to List Archive

Re: Freezing up on PDFs...

From: Bill Moseley <moseley(at)not-real.hank.org>
Date: Sat Aug 07 2004 - 03:49:50 GMT
On Fri, Aug 06, 2004 at 03:14:06PM -0700, Anthony Baratta wrote:
> I've been struggling with using swish-e on a Windows 2000 server. I'm
> spidering the target site and when I hit a pdf file with "errors" (Missing
> 'endstream') the spider can lockup.
> 
> I've replaced the pdftotext program with the latest version (v3 1/22/2004)
> and tested it on the problematic pdfs. It throws the same errors but does
> create a "text" file with some garbage characters with all the text. It
> appears that swish-e is either waiting for an exit code that never comes
> from pdftotext or can not handle the output with garbage characters.

I've never seen this, but I'm not using Windows.

Can you create a test case that will show the problem and others can
try?

BTW --

> config file
> 
> IndexDir perl.exe
> SwishProgParameters "C:\\Progra~1\\SWISH-E\\lib\\swish-e\\spider.pl"
> default "http://www.site.com"

Can Window 2K run perl scripts directly?

-- 
Bill Moseley
moseley@hank.org
Received on Fri Aug 6 20:50:12 2004