Skip to main content.
home | support | download

Back to List Archive

Re: [swish-e] Ok - I do not understand what the problem is

From: at <Peter>
Date: Sat, 07 Jul 2012 20:33:46 -0500
Joseph Norris wrote on 6/29/12 6:12 PM:
> I have the following:
> swish-e -w "China Confirms Gold Price Suppression" -s names desc title 
> desc -f /home/cs/sw/swishe/gold/i
> ndex.swish-e
> No hits
> however when I do this:
> grep -l "China Confirms Gold Price Suppression" /home/cs/sw/swishe/gold/*
> I get a hit 

Try reducing your sample size to one doc, with the -T options as David suggests.

Here's a small self-contained example. I suggest you put together a similar
example of your own, which IME will often reveal the problem.

[karpet(at)not-real.pekmac:~/tmp/s]$ swish-e -c conf
Indexing Data Source: "File-System"
Indexing "docs"
Removing very common words...
no words removed.
Writing main index...
Sorting words ...
Sorting 8 words alphabetically
Writing header ...
Writing index entries ...
  Writing word text: Complete
  Writing word hash: Complete
  Writing word data: Complete
8 unique words indexed.
6 properties sorted.
1 file indexed.  96 total bytes.  8 total words.
Elapsed time: 00:00:00 CPU time: 00:00:00
Indexing done!

[karpet(at)not-real.pekmac:~/tmp/s]$ swish-e -w "China Confirms Gold Price Suppression" -s
names desc title
# SWISH format: 2.5.8
# Search words: China Confirms Gold Price Suppression
# Removed stopwords:
# Number of hits: 1
# Search time: 0.002 seconds
# Run time: 0.009 seconds
1000 docs/china.xml "china.xml" 96

[karpet(at)not-real.pekmac:~/tmp/s]$ cat docs/china.xml
 <title>China Confirms Gold Price Suppression</title>
 <names>foo bar baz</names>

[karpet(at)not-real.pekmac:~/tmp/s]$ cat conf
IndexReport 1
IndexOnly .xml
IndexContents XML*  .xml
IndexDir docs
IndexFile index.swish-e
ParserWarnLevel 1
IgnoreTotalWordCountWhenRanking yes
PropertyNames names title

Peter Karman  .  .  peter(at)
Users mailing list
Received on Sun Jul 08 2012 - 01:33:49 GMT