Skip to main content.
home | support | download

Back to List Archive

Re: Swish-e Spider and Following Image-based Links

From: James <swish.enhanced(at)not-real.gmail.com>
Date: Wed Jan 10 2007 - 15:52:21 GMT
On 1/10/07, Bill Moseley <moseley@hank.org> wrote:
> On Wed, Jan 10, 2007 at 07:22:16AM -0800, James wrote:
> > > Show your work.
> >
> > It's not necessarily "my work" or my site.
>
> Look, you claim that the spider isn't following a link.  Support your
> claim with an example that shows the problem.  You are just wasting my
> time now.

I do not endeavor to waste anyone's time.  I have supported my claim
with the code.

>
>
> > It's not the image links.  It's the image links without alt tags.
>
> Again, on what basis are you making that up?
>
> > <a href="http://mydomain.org/home/" target="_self"><img
> > src="images/entersite.gif" alt="" width="129" height="15" border="0"
> > /></a></td>
> >
> > That's the best I can do.  Swish-e won't follow that.
>
> Right.  You can't come up with a working example like the docs I
> pointed to you suggest?  Don't you have a web server?

I responded with simply the code, since Peter said that I didn't need
to provide a link.  So, you're just being downright rude.  Of course I
have a web-server.

> Oh, you don't
> even need one:
>
> moseley@bumby:~$ cat james.html
> <a href="holy_shit.html" target="_self">
> <img src="images/entersite.gif" alt="" width="129" height="15" border="0"> />
> </a>
>
> moseley@bumby:~$ cat holy_shit.html
> It works!
>
>
> moseley(at)not-real.bumby:~$ b/lib/swish-e/spider.pl default file:///home/moseley/james.html
> b/lib/swish-e/spider.pl: Reading parameters from 'default'
> Path-Name: file:///home/moseley/james.html
> Content-Length: 129
> Last-Mtime: 1168443376
> Document-Type: html*
>
> <a href="holy_shit.html" target="_self">
> <img src="images/entersite.gif" alt="" width="129" height="15" border="0"> />
> </a>

Why don't you test this on a real server.  By the way, you are a
filthy man.  That's terribly unprofessional of you.  Here you have an
open source project on a university site and you are being filthy and
unprofessional.

>
>
>
> Path-Name: file:///home/moseley/holy_shit.html
> Content-Length: 10
> Last-Mtime: 1168443388
> Document-Type: html*
>
> It works!
>
> Summary for: file:///home/moseley/james.html
>      Connection: Close:   1  (1.0/sec)
> Connection: Keep-Alive:   1  (1.0/sec)
>            Total Bytes: 139  (139.0/sec)
>             Total Docs:   2  (2.0/sec)
>            Unique URLs:   2  (2.0/sec)
>              text/html:   2  (2.0/sec)

Try it on a real site on a real server and stop being so filthy.

>
> --
> Bill Moseley

Thanks for your time,
No thanks for your filth,

James
Received on Wed Jan 10 07:52:22 2007