Skip to main content.
home | support | download

Back to List Archive

Re: swish-e 2.4.3 windows 2003 iis success!

From: Revillini, James <JRevillini(at)not-real.txcc.commnet.edu>
Date: Thu Jun 23 2005 - 14:04:56 GMT
> -----Original Message-----
> From: swish-e@sunsite3.berkeley.edu
[mailto:swish-e@sunsite3.berkeley.edu]
> On Behalf Of Bill Moseley
> Sent: Wednesday, June 22, 2005 5:23 PM
> To: Multiple recipients of list
> Subject: [SWISH-E] Re: swish-e 2.4.3 windows 2003 iis success!
> 
> On Wed, Jun 22, 2005 at 05:05:53PM -0400, Revillini, James wrote:
> > RTF's are killing it now.  As soon as it runs into one, the output
file
> > from dirtree.pl goes like this:
> 
> $ perl -MMIME::Types -le 'print
MIME::Types->new->mimeTypeOf("foo.rtf")'
> text/rtf
> 
> I think catdoc will handle rtr somewhat.  So add text/rtf to the list
> of content types that it will handle.
> 
> I thought I added and easy way to add mime-types from the calling
> program (ie DirTree.pl), but don't see it right now.  Try adding it to
> Filters/Doc2HTML.pm:
> 
>     mimetypes   => [
>         qr!application/(x-)?msword!,
>         qr!text/rtf!,
>     ]

Tried this, but it didn't seem to take.  When I ran Dirtree.pl in debug
mode, the output was

>> Starting to process new document: text/rtf
 ++Checking filter [SWISH::Filters::Doc2txt=HASH(0x1bf1000)] for
text/rtf
 ++Checking filter [SWISH::Filters::Pdf2HTML=HASH(0x1c03c68)] for
text/rtf

Final Content type for //fileservername/folder/path/to/the/ArTeeEff.rtf
is text/rtf
  *No filters were used

I did end up getting it to work, though.  Here's how:

1. download RTF2html.pm from the link posted on swish-e.org:
http://swish-e.org/archive/2005-03/9136.html .  Place this in your
filter directory - for me it's (I'm on windows 2003)
\swish-e\lib\swish-e\perl\SWISH\Filters\

2. download and install unrtf:
http://www.gnu.org/software/unrtf/unrtf.html
   NOTE: windoze users, you want the executable install from
http://gnuwin32.sourceforge.net/packages/unrtf.htm (complete package,
except sources - you can dl the sources using the exe actually)

3. copy unrtf.exe to your swish-e bin directory, or else add the GnuWin
directory to your path.  I did the former (copy \GnuWin\bin\unrtf.exe to
\swish-e\lib\swish-e\

4. rerun dirtree (ON A DIR WITH ONLY A FEW RTFs - they take a long time
to process!) in debug mode and the output from rtf's should be something
like:

>> Starting to process new document: text/rtf
 ++Checking filter [SWISH::Filters::Doc2txt=HASH(0x1bf1000)] for
text/rtf
 ++Checking filter [SWISH::Filters::Pdf2HTML=HASH(0x1c03c68)] for
text/rtf
 ++Checking filter [SWISH::Filters::Rtf2html=HASH(0x1c0e1b8)] for
text/rtf
This is UnRTF, version 0.19.3
By Dave Davey and Marcos Serrou do Amaral
Original Author: Zach T. Smith
Processing //fileservername/folder/path/to/the/ArTeeEff.rtf...
 ++ text/rtf *WAS* filtered by SWISH::Filters::Rtf2html=HASH(0x1c0e1b8)


Final Content type for //fileservername/folder/path/to/the/ArTeeEff.rtf
is text/
html
  >Filter SWISH::Filters::Rtf2html=HASH(0x1c0e1b8) converted from
[text/rtf] to
[text/html]


I also tested the import of the new out.txt into swish-e and it worked
beautifully.  I'm really close to having the right setup.


-Jim

> 
> 
> 
> --
> Bill Moseley
> moseley@hank.org
> 
> Unsubscribe from or help with the swish-e list:
>    http://swish-e.org/Discussion/
> 
> Help with Swish-e:
>    http://swish-e.org/current/docs
>    swish-e@sunsite.berkeley.edu
> 
> 
> 
>
------------------------------------------------------------------------
--
> -----
>  WARNING: The sender of this email is different from the email address
>  shown in the headers. The real sender of this message is: swish-
> e@sunsite3.berkeley.edu
>  If you want to add this sender to your Safe or Blocked Senders List,
you
>  would need to add swish-e@sunsite3.berkeley.edu
>
------------------------------------------------------------------------
--
> -----
Received on Thu Jun 23 07:04:56 2005