Skip to main content.
home | support | download

Back to List Archive

Re: Spider.pl not getting past the first page.

From: Joseph Couture <joseph.couture(at)not-real.gmail.com>
Date: Mon Nov 14 2005 - 16:28:38 GMT
Here are the headers beeing returned:

HTTP/1.1 200 OK
Date: Mon, 14 Nov 2005 16:00:11 GMT
Server: Apache/2.0.46 (Red Hat)
Accept-Ranges: bytes
X-Powered-By: PHP/4.3.2
Connection: close
Content-Type: text/html; charset=ISO-8859-1


On 11/14/05, Bill Moseley <moseley@hank.org> wrote:
> On Mon, Nov 14, 2005 at 10:07:35AM -0500, Joseph Couture wrote:
> > I turned on all the debugging options and went through the output. No errors.
> >
> > This is the last thing before the summary:
> >
> > ! Found 65 links in http://support4.sbcma.com/support4/xchange4.html
> >
> > sleeping 5 seconds
> > Unexpected field value
> > http://support4.sbcma.com/support4/xchange4.html at (eval 13) line
> > 1
>
> Well, google turns up that it's an error in HTTP::Headers.pm.  So it
> looks like there's a problem in an http header.  Maybe the page you
> are spidering is using an invalid header??
>
> --
> Bill Moseley
> moseley@hank.org
>
> Unsubscribe from or help with the swish-e list:
>   http://swish-e.org/Discussion/
>
> Help with Swish-e:
>   http://swish-e.org/current/docs
>   swish-e@sunsite.berkeley.edu
>
>
Received on Mon Nov 14 08:29:13 2005