to solve my problem with the doc2txt.pm, i had been done to add this lines
before the doc2txt sub in your script :
my $file = shift || die "Usage: $0 <filename>\n";
system("catdoc -a $file > /tmp/toto.txt");
system("cat /tmp/toto.txt");
system("unlink /tmp/toto.txt");
sub doc2txt {........etc.
---------------------------------------------------
Zeni Corporation - Integrated Net Maker
Société cotée à la Bourse de Paris - Euroclear: 7543
Tél: 0825 003 111 - Fax : 03 44 57 35 55
www.zeni.fr - michael.remy@zeni.fr
---------------------------------------------------
-----Message d'origine-----
De : Michael REMY [mailto:michael.remy@zeni.fr]
Envoyé : mardi 11 février 2003 09:53
À : Multiple recipients of list
Objet : SWISH-e Problem with doc2txt
WHY DO I HAVE THIS MESSAGE with my index command :
swish-e -cind_138.conf -l -v 3 -T
----------------------------------------------------------------------------
----
Parsing config file 'ind_138.conf'
Indexing Data Source: "File-System"
Indexing "test1.doc"
Checking file "test1.doc"...
test1.doc - Using TXT parser - (no words indexed)
Indexing "test2.doc"
Checking file "test2.doc"...
test2.doc - Using TXT parser - (no words indexed)
Removing very common words...
no words removed.
Writing main index...
err: No unique words indexed!
.
----------------------------------------------------------------------------
---
why ?????????? no words ?? err ???
Whatever the doc2txt.pm file, the result (error message and no words
indexed) is the same
---------------------------------------------------
Zeni Corporation - Integrated Net Maker
Société cotée à la Bourse de Paris - Euroclear: 7543
Tél: 0825 003 111 - Fax : 03 44 57 35 55
www.zeni.fr - michael.remy@zeni.fr
---------------------------------------------------
Received on Tue Feb 11 10:47:23 2003