Skip to main content.
home | support | download

Back to List Archive

Re: [swish-e] Looking for the list of indexed words

From: at <Peter>
Date: Sun, 11 May 2014 14:06:07 -0500
On 5/11/14 9:39 AM, Bernard T. Higonnet wrote:
> Hello,
> 
> Perhaps I am confusing swish/e with htdig which I used a long time ago, 
> but it seems to me that the indexing process produces a file containing 
> all the words actually encountered in the corpus which was indexed.
> 
> I have two reason for wanting to look at this file:
> 
> 1) it is very useful for detecting misspellings
> 2) I'm trying to figure out why TranslateCharacters doesn't seem to work 
> for me
> 

there is no stand-alone file. You can dump all the words in the index with:

 % swish-e -T INDEX_WORDS

look at this script for example:

http://svn.swish-e.org/libswish3/trunk/perl/countwords.pl



-- 
Peter Karman  .  http://peknet.com/  .  peter(at)not-real.peknet.com
_______________________________________________
Users mailing list
Users(at)not-real.lists.swish-e.org
http://lists.swish-e.org/listinfo/users
Received on Sun May 11 2014 - 19:06:12 GMT