Skip to main content.
home | support | download

Back to List Archive

Swish-E and HTML documents with frames

From: Chris Humphries <ChrisJMH(at)not-real.vermilion99.freeserve.co.uk>
Date: Thu Feb 24 2000 - 18:50:47 GMT
I have been testing a new version of the Swish-E spider that can handle 
HTML documents with frames.

If the spider program detects that the document is a framed HTML, it 
recursively builds content by reading through the <frame src> pointers, and 
builds up a list of all the <a href> links that it finds. It then passes 
*this* back to the C program, which indexes the document as if it were one 
big HTML. The spidering will work as if all the <a href> links found in in 
the frameset HTML files were at level 1.

Would this be of use to anyone?

Chris Humphries
Received on Thu Feb 24 13:54:27 2000