7-12 download of 2.1
swish-e 2.1 hangs for a very long time
What did I do wrong?
here's the scenario
/usr/local/bin/swish-e \
-i http://members.aol.com/CamelsRFun \
-c swish-e/SPIDER.GENERIC.CONFIG \
-f swish-e/spider.CamelsRFun.index.tmp -v 3 -S http
Parsing config file 'swish-e/SPIDER.GENERIC.CONFIG'
Indexing Data Source: "HTTP-Crawler"
Indexing "http://members.aol.com/CamelsRFun"
retrieving http://members.aol.com/CamelsRFun (0)...
retrieving http://members.aol.com/CamelsRFun/ (0)...
Gets stuck here for maybe 5-10 minutes with 99% CPU usage
but no packets are being sent/received via the network.
It then moves on in what appears to be a normal fashion
Note run time:
Removing very common words...
Getting IgnoreLimit stopwords: Complete
no words removed. Writing main index... Sorting words ... Sorting 3188
words alphabetically Writing header ... Writing index entries ...
Writing word text: Complete
Writing word hash: Complete
Writing word data: Complete
3188 unique words indexed.
7 properties sorted. 65
files indexed. 321038 total bytes. 22202 total words. Elapsed time:
00:13:56 CPU time: 00:00:01 Indexing done!
Config file....
IndexDir http://www.insulin-pumpers.org
IndexFile ./swish.index
IndexName "Insulin Pumpers Mail Archive"
IndexDescription "no other index was specified."
IndexPointer "www.insulin-pumpers.org"
IndexAdmin "webmaster@insulin-pumpers.org"
MetaNames author description datamodified
IndexReport 3
UseStemming yes
PropertyNames author description datamodified
IgnoreTotalWordCountWhenRanking yes
MinWordLimit 4
WordCharacters abcdefghijklmnopqrstuvwxyz0123456789.-_'"
IgnoreLimit 80 1000
IndexComments 0
MaxDepth 4
Delay 5
TmpDir ./
Michael@Insulin-Pumpers.org
Received on Sat Jul 13 08:56:27 2002