Skip to main content.
home | support | download

Back to List Archive

stemming problem and output

From: Gaye Karagulle <gkaragulle(at)not-real.yahoo.com>
Date: Mon Feb 25 2002 - 20:43:04 GMT
hi.

sorry for the mail traffic today but I have two more
questions. 

1) I use this as config file,

IndexDir .
IgnoreWords File:
c:/swish-e/conf/stopwords/english.txt            
IndexOnly .html
UseStemming yes
IndexReport 1
 
it gives the warnings below, although it does some
stemming as seen below, and the directory it looks for
exists..



C:\SWISH-E>swish-e -c swish.conf
Indexing Data Source: "File-System"
Indexing "."

Warning: Failed to open dir
'.\example\SWISH-Stemmer-0.05\CVS\' :No such fil
 directory

Warning: Failed to open dir
'.\example\SWISH-Stemmer-0.05.tar\SWISH-Stemmer-
\' :No such file or directory
Removing very common words...
no words removed.
Writing main index...
Sorting words ...
Sorting 5 words alphabetically
Writing header ...
Writing index entries ...
  Writing word text: Complete
  Writing word hash: Complete
  Writing word data: Complete
5 unique words indexed.
4 properties sorted.
1 file indexed.  194 total bytes.  12 total words.
Elapsed time: 00:00:00 CPU time: 00:00:00
Indexing done!

C:\SWISH-E>swish-e -T index_words_full

-----> WORD INFO in index index.swish-e <-----

appl
 Meta:1 .\11.html Freq:2 Pos/Struct:6/9,7/9

docum
 Meta:1 .\11.html Freq:1 Pos/Struct:2/7

index
 Meta:1 .\11.html Freq:5
Pos/Struct:8/9,9/9,10/9,11/9,12/9

run
 Meta:1 .\11.html Freq:3 Pos/Struct:3/9,4/9,5/9

untitl
 Meta:1 .\11.html Freq:1 Pos/Struct:1/7


why do you things it gives these warnings?


2). 
 I specially want to learn that if there is an option
to take the output above, which we get using "swish-e
-T index_words_full" command, into a readable file,
for example TXT?

 thanks for your help.



__________________________________________________
Do You Yahoo!?
Yahoo! Sports - Coverage of the 2002 Olympic Games
http://sports.yahoo.com
Received on Mon Feb 25 20:43:42 2002