Skip to main content.
home | support | download

Back to List Archive

Re: avoid indexing php code

From: Klaus Hollenbach <hollenbach(at)not-real.scholze.de>
Date: Thu Dec 20 2001 - 08:53:21 GMT
Hi Bill,

I'm running SWISH-E 2.1 dev-24 "interim" Release (Win32) build
(http://www.webaugur.com/wares/files/swish-e-20011108-win32.zip) by
David Norris.

It seems that only parts of the php-script in a html-file are being
indexed.

content of test.php:
01 <html>
02 <head>
03 <title>Titletext</title>
04 </head>
05 <body>
06 Bodytext
07 <?php
08 do{
09         ...something...
10         echo ("<option>");
11 }while( expression );
12 ?>
13 </body>
14 </html>

content of c.conf:
Defaultcontents HTML

D:\>swishe -c c.conf -i test.php -v 0 -T indexed_words
Indexing Data Source: "File-System"
    Adding:[swishdefault:1]   'titletext'   Pos:1  Stuct:0x7 ( HEAD
TITLE FILE )
    Adding:[swishdefault:1]   'bodytext'   Pos:2  Stuct:0x9 ( BODY FILE
)
    Adding:[swishdefault:1]   'while'   Pos:3  Stuct:0x9 ( BODY FILE )
    Adding:[swishdefault:1]   'expression'   Pos:4  Stuct:0x9 ( BODY
FILE )
Indexing done!

Everything that comes after the ">"-Symbol in test.php, line 10, seems
to be incorporated in the index.

Replacing <?php ... ?> with <% ... %> in test.php leads to the same
results.

On running the above with "Defaultcontents HTML2" in c.conf, produces
the following swishe-errormessage:

err: Defaultcontents: Unknown document type "HTML2"

-- 
Klaus Hollenbach
SCHOLZE Ingenieurgesellschaft mbH
E-Mail:  hollenbach@scholze.de
Received on Thu Dec 20 08:53:31 2001