Dear Kei,
The best person to ask about the Swish Spider is Ron Samuel Klatchko.
Address your messages to the discussion group at
swish-e@sunsite.berkeley.edu
(I can see that you have cc'ed a copy of your email to this address, so
that should be fine)
As far as I know, the Swish Spider is a Perl program which enables Swish-E
to Spider sites on the Web. It reads through links on Web documents (such
as the <a href="website"></a> links) enabling Swish-E to index those
documents.
To use the Spider, use the "-S http" option on the Swish-E command line. In
the .config file, set the spider's maximum depth using "MaxDepth" and the
starting document using "IndexDir".
You are recommended to use the HTTP file access method, i.e. the spidering
method, only when you cannot use the FILESYSTEM method. This is because the
HTTP method is slower.
Note:
Although the documentation says that you cannot use "NoContents" when using
the HTTP method, this does appear to work, and is a useful way of getting
Swish-E to ignore certain files you do not wish to index.
Chris Humphries
-----Original Message-----
From: 97909585d [SMTP:97909585d@polyu.edu.hk]
Sent: Monday, February 21, 2000 2:39 PM
To: ChrisJMH@vermilion99.freeserve.co.uk
Cc: swish-e@sunsite.berkeley.edu
Subject: some questions of swishspider
Hi Chris
I have some questions about swish-e. What is the function of swishspider?
and
what is the role of the swishspider in SWISH-E? Where is calling this
swishspider
in C. I don't know which one I should turn these questions
to. So if the method for asking these questions is wrong,
Could you forward this message to suitable person that answer these
questions.
If right, please reply me as soon as possible. Thanks.
Kei
Received on Mon Feb 21 10:36:27 2000