I'm running spider.pl to index a small site and i'm running into a problem
I haven't had with other sites. Swish-e seems to index alright but the index
missing after it's finished. My IndexFile in the config points to the right
place but the
file is missing. I'm not sure if this output helps but this is what I get:
Summary for: http://www.generac-portables.com
Connection: Close: 293 (0.1/sec)
Duplicates: 3,701 (0.9/sec)
Off-site links: 1,285 (0.3/sec)
Total Bytes: 160,789,314 (41175.2/sec)
Total Docs: 294 (0.1/sec)
Unique URLs: 294 (0.1/sec)
http://www.generac-portables.com/data/pdf_files/pw/1421_0enw.pdf - Using
DEFAULT (HTML2) parser - (170 words)
Removing very common words...
no words removed.
Writing main index...
Sorting words ...
Sorting 8,466 words alphabetically
Writing header ...
Writing index entries ...
Writing word text: Complete
Writing word hash: Complete
Writing word data: Complete
8,466 unique words indexed.
6 properties sorted.
294 files indexed. 160,789,314 total bytes. 66,288 total words.
thanks for any help
Received on Fri Oct 8 10:02:21 2004