Skip to main content.
home | support | download

Back to List Archive

Re: [swish-e] Parsing plain text emails to use the subject line as the title

From: Peter Karman <peter(at)not-real.peknet.com>
Date: Sat Jan 02 2010 - 04:16:13 GMT
Troy Wical wrote on 12/31/09 4:59 AM:
> Hello all,
> 	I am currently working on a project where Swish will index email  
> archives.  This appears to be quite a common practice these days, but  
> I am running into what I believe are some rather simple issues.  What  
> I am working with are plain text email archives that are in the  
> maildir format.
> 	My goal at this point is to use the subject line in the email, as the  
> title in the search results. After a bit of reading, it "appears" that  
> what I need to use is metanames to define those areas.  However, I am  
> having a hard time figuring out what to put in the conf file to have  
> swish-e do that.  Is metanames only usable with html and xml?

yes, metanames/properties only work if the content is xml or html.

You might look at SWISH::Prog::Aggregator::Mail which will crawl a maildir (or 
mbox iirc) tree and generate the xml for you.

http://search.cpan.org/dist/SWISH-Prog/lib/SWISH/Prog/Aggregator/Mail.pm

-- 
Peter Karman  .  http://peknet.com/  .  peter(at)not-real.peknet.com
_______________________________________________
Users mailing list
Users@lists.swish-e.org
http://lists.swish-e.org/listinfo/users
Received on Fri Jan 1 23:16:17 2010