Skip to main content.
home | support | download

Back to List Archive

Re: Ranking Question

From: Nikolaus Rath <Nikolaus(at)not-real.rath.org>
Date: Wed Dec 18 2002 - 00:37:05 GMT
Bill Moseley <moseley@hank.org> wrote:
> On Tue, 17 Dec 2002, Nikolaus Rath wrote:
> 
>> I have some questions regarding the ranking of search results:
>> 
>> - Does the context of the match influence its ranking? (e.g.,
>>  "<h1>match</h1>" ranks higher than "<p>match" when searching for
>>  "match"). And if so, which tags are used?
> 
> Yes.  It's a hole in the documentation, it seems.
[..]

That is what i wanted to know. Thanks a lot.

>> - Does swish-e search the given pattern also in all the meta tags? If
>>   so, does a meta match count more than a "normal" match?
> 
> Yes.

Hmm. I installed and tried:

----snip----
nikratio:~/test$ ls
config  test2.html  test.html
nikratio:~/test$ cat config 
IndexDir .
IndexReport 1
MetaNames key
nikratio:~/test$ cat test.html 
<html>
<head>
<meta name="key" content="pattern">
</head>
<body>
some words blub foo bar
</body>
</html>
nikratio:~/test$ cat test2.html 
<html>
<head>
<meta name="key" content="pattern">
</head>
<body>
some words blub foo bar
pattern
</body>
</html>
nikratio:~/test$ swish-e -c config
Indexing Data Source: "File-System"
Indexing "."
Removing very common words...
no words removed.
Writing main index...
Sorting words ...
Sorting 14 words alphabetically
Writing header ...
Writing index entries ...
  Writing word text: Complete
  Writing word hash: Complete
  Writing word data: Complete
14 unique words indexed.
4 properties sorted.                                              
5 files indexed.  401435 total bytes.  23 total words.
Elapsed time: 00:00:00 CPU time: 00:00:00
Indexing done!
nikratio:~/test$ swish-search -w "pattern"
# SWISH format: 2.2.1
# Search words: pattern
# Number of hits: 1
# Search time: 0.000 seconds
# Run time: 0.032 seconds
1000 ./test2.html "test2.html" 113
.
nikratio:~/test$ swish-search -w "key=pattern"
# SWISH format: 2.2.1
# Search words: key=pattern
# Number of hits: 2
# Search time: 0.000 seconds
# Run time: 0.032 seconds
1000 ./test2.html "test2.html" 113
1000 ./test.html "test.html" 105
.
-----snip------

It seems that swish does not match the pattern in the meta fields
unless a meta match is explicitly requested. 

   --Nikolaus
Received on Wed Dec 18 00:37:25 2002