Skip to main content.
home | support | download

Back to List Archive

Re: [swish-e] all URLs

From: Alexander Dolgarev <a.dolgarev(at)>
Date: Sun Feb 03 2008 - 19:31:16 GMT
I've noticed alsp that for each following URL sider writes:

Summary for: http://XXX/msg00298.html
     Connection: Close:     1  (1.0/sec)
Connection: Keep-Alive:     4  (4.0/sec)
            Duplicates:     8  (8.0/sec)
        Off-site links:     2  (2.0/sec)
           Total Bytes: 7,950  (7950.0/sec)
            Total Docs:     5  (5.0/sec)
           Unique URLs:     5  (5.0/sec)
             text/html:     1  (1.0/sec)

Summary for: http://XXX/maillist.html
Duplicates: 1  (1.0/sec)

Summary for: http://XXX/msg00297.html
Duplicates: 1  (1.0/sec)

but these URLs are not duplicates. Where the problem is?

On Feb 3, 2008 8:53 PM, Alexander Dolgarev <> wrote:
> I have a problem with When I run
> /usr/local/lib/swish-e/ default <SOME_URL> | swish-e -c
> swish.conf -S prog -i stdin -f test
> I've become a lot of following messages:
> Warning: document 'XXX' has no content
> When I look at created index-file I see that only document <SOME_URL>
> was indexed, ALL other URLs (that were in this document) were not
> indexed. Log files on the HTTP server shows that retrieves
> URLs and becomes responses, e.g:
> [03/Feb/2008:18:46:43 +0100] <XXX> GET /XXX HTTP/1.1 "200" 14758
> "swish-e" "-"       18
> That means that 14758 bytes was sent to the for URL <XXX>,
> but says: Warning: document 'XXX' has no content
Users mailing list
Received on Sun Feb 3 14:31:17 2008