Skip to main content.
home | support | download

Back to List Archive

RE: Indexing pdf and doc

From: David L Norris <dave(at)not-real.webaugur.com>
Date: Tue Dec 21 2004 - 20:58:05 GMT
On Tue, 2004-12-21 at 12:30 -0800, Smith, Sarah wrote:
> WvWare appears to work from the command line and catdoc works up to a
> point, where it inserts a number of question marks.

Question marks may be normal for catdoc.  Wv has a far better document
parser.  Catdoc basically just dumps raw text from a document.

I'm not really sure what's happening, though.  I'll try to give it some
thought.

-- 
 David Norris
  http://www.webaugur.com/dave/
  ICQ - 412039
Received on Tue Dec 21 12:58:16 2004