On Sat, 15 Sep 2001, Paul Thomas wrote:
> Prev by Date:
> Next by Date:
> Prev by thread:
> Next by thread:
>
> These fields contain the Subject: text of emails which often contain
> the keyword being searched for. The result is a list of emails is
> presented after a keyword search that only contain the keyword in
> reference links to other pages, not in the text of the email itself.
I hate it when I search using Google.com and come across messages like
that.
You'll have to filter out those lines somehow, of course. Check out using
the "-S prog" command line option. Basically, you'll be indexing the
output of a program (this will be a program, e.g. perl script, that you
write, which reads the Mhonarc files and spits them out without the
offending lines) rather than indexing the real .html files.
It's a bit more complicated, but I think that's the only solution.
Received on Sun Sep 16 00:02:15 2001