Skip to main content.
home | support | download

Back to List Archive

Re: swish-e 2.1 hangs for a very long time

From: Michael <michael(at)not-real.insulin-pumpers.org>
Date: Sat Jul 13 2002 - 17:31:49 GMT
> On Sat, 2002-07-13 at 03:52, Michael wrote:
> > Gets stuck here for maybe 5-10 minutes with 99% CPU usage
> 
> 99% CPU usage?  That's definitely odd.
> 
> Which operating system and compiler?  CPU type?  PERL version?
> 

500mhz Celeron +256meg ram, lots of disk
Linux pandora 2.4.17-pre8 #3 Thu Dec 13 11:42:50 PST 2001 i686 unknown

Reading specs from /usr/lib/gcc-lib/i386-slackware-linux/2.95.3/specs
gcc version 2.95.3 20010315 (release)

Summary of my perl5 (revision 5.0 version 6 subversion 1)
configuration:


> What happens when you run this:
>   perl swishspider ./testing http://members.aol.com/CamelsRFun/
> 

> Does this take 5-10 minutes and use 100% CPU?
> 
> 
immediately after start
  PID USER     PRI  NI  SIZE  RSS SHARE STAT %CPU %MEM   TIME COMMAND
22725 diabetes  19   0  4484 4484  1552 R    99.3  1.8   0:28 perl

~5-6 minutes later
  PID USER     PRI  NI  SIZE  RSS SHARE STAT %CPU %MEM   TIME COMMAND
22725 diabetes  18   0  4496 4496  1552 R    98.2  1.8   5:09 perl

~ 7-8 minutes
  PID USER     PRI  NI  SIZE  RSS SHARE STAT %CPU %MEM   TIME COMMAND
22725 diabetes  17   0  4496 4496  1552 R    98.6  1.8   7:03 perl

there does not appear to be any network traffic at all after the 
first snapshot per netwatch -- which show no packet activity to aol 
addresses. Network activity in the first few seconds is...
       remote                  xmit           rcv
member6.aol.com       45187       1943
this remains unchanged after task terminates normally

output files remain in this state until the very end
      43075 Jul 13 10:18 testing.contents
          0 Jul 13 10:18 testing.links
         14 Jul 13 10:18 testing.response

when they change to....
      43075 Jul 13 10:18 testing.contents
        4290 Jul 13 10:26 testing.links
          14 Jul 13 10:18 testing.response

time marks .. start .. end
Sat Jul 13 10:18:22 PDT 2002
Sat Jul 13 10:26:20 PDT 2002

output files attached

> Your config file and command line options are working fine here
> using the current CVS under Redhat 7.1.  SWISH-E's average CPU load
> is 0.1% with your config on my 1 GHz Athlon.
> 
> 
> SWISH-E's Report:
> 
> Removing very common words...
>   Getting IgnoreLimit stopwords: Complete                           
>   
> no words removed.
> Writing main index...
> Sorting words ...
> Sorting 3188 words alphabetically
> Writing header ...
> Writing index entries ...
>   Writing word text: Complete
>   Writing word hash: Complete
>   Writing word data: Complete
> 3188 unique words indexed.
> 7 properties sorted.                                              65
> files indexed.  321038 total bytes.  22202 total words. Elapsed
> time: 00:05:31 CPU time: 00:00:00 Indexing done!
> 
> -- 
>  David Norris
>   Dave's Web - http://www.webaugur.com/dave/
>   Augury Net - http://augur.homeip.net/
>   ICQ - 412039
> 
Received on Sat Jul 13 17:35:23 2002