At 10:56 AM 01/29/02 -0800, Gordon Jessop wrote:
>Pedro Martins wrote:
>>
>> my question is: is
>> there a way i can see what words are beeing indexed...
>
>The following will print out all the words in your index:
>
>swish-e -k '*'
>
>The following will print out all the words in your index that start with the
>letter 'a'
>
>swish-e -k 'a'
Also, for more control:
Try ./swish-e -T help
Available debugging options for swish-e:
INDEX_HEADER => Show the headers from the index
INDEX_WORDS => List words stored in index
INDEX_WORDS_ONLY => List only words, one per line, stored in index
INDEX_WORDS_META => List only words and associated metaID separated
by a tab
INDEX_WORDS_FULL => List words stored in index (more verbose)
INDEX_STOPWORDS => List stopwords stored in index
INDEX_FILES => List file data stored in index
INDEX_METANAMES => List metaname table stored in index
INDEX_ALL => Dump data ALL above data from index file
-- indexing --
INDEXED_WORDS => Display words as they are indexed
PARSED_WORDS => Display words as they are parsed from source
PROPERTIES => Display properties associted with each file as
they are indexed
REGEX => Debug regular expression processing
PARSED_TAGS => Show meta tags as they are found
PARSED_TEXT => Show text as it's parsed
IIRC, for dumping the index you can also use -b and -m to dump a range of
files. But better to work with a few files when debugging.
--
Bill Moseley
mailto:moseley@hank.org
Received on Tue Jan 29 19:49:39 2002