Skip to main content.
home | support | download

Back to List Archive

illegal seek?

From: David VanHook <dvanhook(at)not-real.mshanken.com>
Date: Tue Oct 22 2002 - 18:44:00 GMT
I'm using the spider.pl on a weekly basis to spider my entire site, about
20,000 files.  Here's the code from the simple perl script which runs it
(ugly, probably, but I'm no Perl expert).  What it does is pipe the spider
to a gzipped file, then launches two different swish indexings (one with
Fuzzy indexing, one with regular) on that same file:

===========================

($sec,$min,$hour,$mday,$mon,$year,$wday,$yday) = gmtime(time-86400);
$year += 1900;
$mon += 1;
if ($mon < 10) { $mon="0$mon" };
if ($mday< 10) { $mday="0$mday" };

system("/opt/swish-e/spider.pl | /usr/local/bin/gzip >
/opt/swish-e/docs.$mon$mday$year.gz");

system("/usr/local/bin/gzip -dc /opt/swish-e/docs.$mon$mday$year.gz |
nice -19 /usr/local/bin/swish-e -c /opt/swish-e/swish.config -S prog -i
stdin -v 1 >> /opt/swish-e/logs/SwishWeeklyLogs.$mon$mday$year.dat");

system("/usr/local/bin/gzip -dc /opt/swish-e/docs.$mon$mday$year.gz |
nice -19 /usr/local/bin/swish-e -c /opt/swish-e/swish.fuzzy.config -S
prog -i stdin -v 1 >> SwishWeeklyLogs.$mon$mday$year.dat");

===========================


When I look at the logfile this creates, I see this at the top:

============
Indexing Data Source: "External-Program"
Indexing "stdin"

Warning: Failed to properly close external program: Illegal seek
============


It appears to be going through the files properly, but is this "Illegal
seek" something to be concerned about?

Thanks very much.

David VanHook
Received on Tue Oct 22 18:47:55 2002