Skip to main content.
home | support | download

Back to List Archive

Re: error trying to spider

From: Bill Moseley <moseley(at)not-real.hank.org>
Date: Tue Mar 25 2003 - 18:32:36 GMT
On Tue, 25 Mar 2003, Jody Cleveland wrote:

> $ swish-e -S http -c swish.config
> Indexing Data Source: "HTTP-Crawler"
> Indexing "http://oshkoshpubliclibrary.org"
> retrieving http://oshkoshpubliclibrary.org (0)...
> 
> Removing very common words...
> no words removed.
> Writing main index...
> err: No unique words indexed!
> 
> I just know there's got to be something simple I'm missing here.

Here's just a guess:

HEAD http://oshkoshpubliclibrary.org
500 Can't connect to oshkoshpubliclibrary.org:80 (Bad hostname 'oshkoshpubliclibrary.org')
Client-Date: Tue, 25 Mar 2003 18:16:47 GMT

> HEAD http://www.oshkoshpubliclibrary.org
200 OK
Connection: close
Date: Tue, 25 Mar 2003 18:21:24 GMT
Accept-Ranges: bytes
ETag: "f068857ac6ecc21:9cb"
Server: Microsoft-IIS/5.0
Content-Length: 20623
Content-Location: http://www.oshkoshpubliclibrary.org/welcome.html
Content-Type: text/html
Last-Modified: Mon, 17 Mar 2003 20:47:52 GMT
Client-Date: Tue, 25 Mar 2003 18:18:04 GMT
Client-Response-Num: 1


-- 
Bill Moseley moseley@hank.org
Received on Tue Mar 25 18:36:34 2003