Skip to main content.
home | support | download

Back to List Archive

RE: SWISH-e Problem with doc2txt

From: Michael REMY <michael.remy(at)not-real.zeni.fr>
Date: Tue Feb 11 2003 - 10:46:30 GMT
to solve my problem with the doc2txt.pm, i had been done to add this lines
before the doc2txt sub in your script :

my $file = shift || die "Usage: $0 <filename>\n";
system("catdoc -a $file > /tmp/toto.txt");
system("cat /tmp/toto.txt");
system("unlink /tmp/toto.txt");

sub doc2txt {........etc.


---------------------------------------------------
Zeni Corporation - Integrated Net Maker
Société cotée à la Bourse de Paris - Euroclear: 7543
Tél: 0825 003 111 - Fax : 03 44 57 35 55
www.zeni.fr - michael.remy@zeni.fr
---------------------------------------------------

-----Message d'origine-----
De : Michael REMY [mailto:michael.remy@zeni.fr]
Envoyé : mardi 11 février 2003 09:53
À : Multiple recipients of list
Objet : SWISH-e Problem with doc2txt


WHY DO I HAVE THIS MESSAGE with my index command :
swish-e -cind_138.conf -l -v 3 -T

----------------------------------------------------------------------------
----
Parsing config file 'ind_138.conf'
Indexing Data Source: "File-System"
Indexing "test1.doc"

Checking file "test1.doc"...
  test1.doc - Using TXT parser -  (no words indexed)
Indexing "test2.doc"

Checking file "test2.doc"...
  test2.doc - Using TXT parser -  (no words indexed)

Removing very common words...
no words removed.
Writing main index...
err: No unique words indexed!
.

----------------------------------------------------------------------------
---

why ?????????? no words ?? err ???
Whatever the doc2txt.pm file, the result (error message and no words
indexed) is the same


---------------------------------------------------
Zeni Corporation - Integrated Net Maker
Société cotée à la Bourse de Paris - Euroclear: 7543
Tél: 0825 003 111 - Fax : 03 44 57 35 55
www.zeni.fr - michael.remy@zeni.fr
---------------------------------------------------
Received on Tue Feb 11 10:47:23 2003