Skip to main content.
home | support | download

Back to List Archive

Re: [swish-e] Looking for the list of indexed words

From: Bernard T. Higonnet <"Bernard>
Date: Sun, 11 May 2014 23:20:09 +0200
Thanks!

Hit the spot. I have found  some misspellings as expected, and now know 
why TranslateCharacters "doesn't work". It was really quite trivial, but 
I will only tell you what it was if you insist on  knowing what a nudnik 
I am...

Thanks again
Bernard Higonnet

On 05/11/14 21:06, Peter Karman wrote:
> On 5/11/14 9:39 AM, Bernard T. Higonnet wrote:
>> Hello,
>>
>> Perhaps I am confusing swish/e with htdig which I used a long time ago,
>> but it seems to me that the indexing process produces a file containing
>> all the words actually encountered in the corpus which was indexed.
>>
>> I have two reason for wanting to look at this file:
>>
>> 1) it is very useful for detecting misspellings
>> 2) I'm trying to figure out why TranslateCharacters doesn't seem to work
>> for me
>>
>
> there is no stand-alone file. You can dump all the words in the index with:
>
>   % swish-e -T INDEX_WORDS
>
> look at this script for example:
>
> http://svn.swish-e.org/libswish3/trunk/perl/countwords.pl
>
>
>

_______________________________________________
Users mailing list
Users(at)not-real.lists.swish-e.org
http://lists.swish-e.org/listinfo/users
Received on Sun May 11 2014 - 21:20:08 GMT