This is not an easy task...
We discussed it and there is IMO also some code for this topic
done by Jose.
We identified the following problems (as I remember...):
- filtered documents have modified (filtered) output ...
- Tags / stopwords may be removed ...
- returning the result position(s) (multiple occurence
of word in a doc) is not clear.
- Postprocessing for documents (e.g. for highlighting)
needed...
Today, IMO the best way to do so is the following:
- implement the following into the search cgi.
- when clicking on the document (e.g. html)
-> process searchwords by replacing
each found searchword in the doc with
the text you want to have...
like:
s#(searchword)#<FONT...>\$1</FONT>#g
cu rainer
> -----Original Message-----
> From: Scott Schultz [mailto:scott@ceweekly.com]
> Sent: Thursday, March 01, 2001 1:20 AM
> To: Multiple recipients of list
> Subject: [SWISH-E] RE: Word Locations
>
>
> Okay, I admit it. Trying to understand the Swish-E
> source code makes my head swim.
>
> Does the location structure used to store the location
> of individual words? Is this where the "position"
> variable in the results list elements comes from?
>
> In other words, is it possible to add some code to
> swish-e that will return the offsets of the words that were
> successfully matched? This could be used by the wrapper
> scripts to do keyword hilighting.
>
----------------------------------------------------------------------
This Mail has been checked for Viruses
Attention: Encrypted Mails can NOT be checked !
* * *
Diese Mail wurde auf Viren ueberprueft
Hinweis: Verschluesselte Mails koennen NICHT geprueft werden !
----------------------------------------------------------------------
Received on Thu Mar 1 12:59:54 2001