On Wed, 2002-02-13 at 13:50, Bill Moseley wrote:
> Yes, try this (and it works on ME)
> swish-e -S prog -c swish.conf -i stdin < ouput.txt
> But filters are still broken. Geeze.
I copied the test.txt, SwishSpiderConfig.pl, and command line from the
original message. This is using the SWISH-E 2002-02-09 Win32 build.
Windows 2000 Test (excuse the funky line breaks ;-):
C:\SWISH-E>swish-e -c test.txt -S prog -v 3
Indexing Data Source: "External-Program"
Indexing "c:\perl\bin\perl.exe"
c:\swish-e\spider.pl: Reading parameters from 'SwishSpiderConfig.pl'
-- Starting to spider: http://arena.internet2.edu/sample.htm --
?Testing 'test_url' user supplied function #1
'http://arena.internet2.edu:80/sam
ple.htm'
+Passed all 1 tests for 'test_url' user supplied function
?Testing 'test_response' user supplied function #1
'http://arena.internet2.edu:8
0/sample.htm'
+Passed all 1 tests for 'test_response' user supplied function
>> +Fetched 0 Cnt: 1 http://arena.internet2.edu:80/sample.htm 200 OK
text/html 3
3 parent:
! Found 0 links in http://arena.internet2.edu:80/sample.htm
c:\swish-e\spider.pl: Max indexed files Reached
Summary for: http://arena.internet2.edu/sample.htm
Total Bytes: 33 (33.0/sec)
Total Docs: 1 (1.0/sec)
Unique URLs: 1 (1.0/sec)
http://arena.internet2.edu:80/sample.htm - Using DEFAULT (HTML) parser
- (2 wor
ds)
Removing very common words...
no words removed.
Writing main index...
Sorting words ...
Sorting 2 words alphabetically
Writing header ...
Writing index entries ...
Writing word text: Complete
Writing word hash: Complete
Writing word data: Complete
2 unique words indexed.
4 properties sorted.
1 file indexed. 33 total bytes. 2 total words.
Elapsed time: 00:00:03 CPU time: 00:00:03
Indexing done!
--
David Norris
Dave's Web - http://www.webaugur.com/dave/
Augury Net - http://augur.homeip.net/
ICQ Universal Internet Number - 412039
E-Mail - dave@webaugur.com
"If you stare into an abyss long enough
it begins to stare back into you." --Nietzsche
Received on Wed Feb 13 23:20:17 2002