Skip to main content.
home | support | download

Back to List Archive

Re: A problem with swishspider

From: Andrew <editorial(at)not-real.mcmail.com>
Date: Sun Mar 07 1999 - 12:58:18 GMT
Dear Marina and Swish-e users,

we have noticed this exact same problem, though strangely it was not until we switrched servers that this occurred.

Our new server, like yours, contains the Parser.pm module, but not Parse.pm.

Our server technician thought that this may have spelling error, but changing the spelling  to HTML::Parser just introduced new errors.

However, I checked the CPAN archive and there is a HTML::Parse module there, but again our server people are adamant that this particular library which contains the module has been fully installed.

We therefore think the problem must be due to a pathing error in the call to the server by the spider script but unfortunately are not sure how to resolve this.

Andrew Cadman
NetLondon Ltd.

Marina Markus wrote:

> Dear SWISH-E users,
>
> I hope that somebody of you with experience will be able
> to help us with a strange installation problem.
>
> We are trying to install SWISH 1.3.1 on our Web Server.
> The installation itself passes OK. However, when we try to
> run it:
>
> swish-e -c user.config
>
> we get an error message that "HTML/Parse.pm" demanded in @INC
> in "swishspider" is absent.
>
> Really, we see that under "HTML" directory under "site-perl" (where it
> looks for include files) there is a file "Parser.pm" and not
> "Parse.pm". The simple change in the name in @INC doesn't help,
> because there is  no function "parse_html" called in "swishspider"
> in that another file.
>
> Our Perl version is "5.004_04 built for irix-n32". All additional relevant
> packages are installed - libwww-perl (v5.36), MIME-Base64 (v 2.11),
> HTML-Parser (v. 2.22), libnet (v. 1.0606), Digest-MD5 (v.2.02).
>
> The full text of error message is:
> ----
> Indexing Data Source: "HTTP-Crawler"
> Can't locate HTML/Parse.pm in @INC (@INC contains: /usr/share/lib/perl5/irix-n32/5.00404 /usr/share/lib/perl5 /usr/share/lib/perl5/site_perl/irix-n32 /usr/share/lib/perl5/site_perl /usr/share/lib/perl5/sgi_perl .) at /usr/local/src/swish-1.3.4/src/swishsp
> ider line 7.
> BEGIN failed--compilation aborted at /usr/local/src/swish-1.3.4/src/swishspider line 7.
> retrieving http://www.bgu.ac.il/index.html (0)...
> Can't locate HTML/Parse.pm in @INC (@INC contains: /usr/share/lib/perl5/irix-n32/5.00404 /usr/share/lib/perl5 /usr/share/lib/perl5/site_perl/irix-n32 /usr/share/lib/perl5/site_perl /usr/share/lib/perl5/sgi_perl .) at /usr/local/src/swish-1.3.4/src/swishsp
> ider line 7.
> BEGIN failed--compilation aborted at /usr/local/src/swish-1.3.4/src/swishspider line 7.
>
> Removing very common words... no words removed.
> Writing main index... no unique words indexed.
> Writing file index... no files indexed.
> Running time: 1 minute.
> Indexing done!
> ----
>
> I will be very grateful for any idea what to try to do.
>
> Thank you in advance,
>
> Marina Markus
> Ben-Gurion University Computer Center
> Beer-Sheva,Israel
> ======================
> mary@bgumail.bgu.ac.il
Received on Sun Mar 7 04:56:10 1999