> -----Original Message-----
> From: Bill Moseley [mailto:moseley@hank.org]
> Sent: Friday, June 01, 2001 4:07 PM
> To: Multiple recipients of list
> Subject: [SWISH-E] Parsing Excel (was: last modified date in swish-e
> index file)
>
>
> At 03:38 AM 06/01/01 -0700, Rainer.Scherg@rexroth.de wrote:
> >> If you are indexing .doc (Word) files, then there's the
> >> catdoc program to
> >> extract out the text. I believe I saw a utility to extract
> >> out xls files, but I don't remember anything specific.
>
> I haven't tried either of these, but there are two Excel
> modules on CPAN.
>
> Spreadsheet::ParseExcel claims to extract the data from Excel docs.
>
> "XML::Excel provides functions to easily transform Excel
> documents into XML."
> But XML::Excel uses Spreadsheet::ParseExcel.
>
> I'm curious. What kind of content are you searching for in
> your spreadsheets?
Normally just the content of the XLS cells.
Known problems on XLS filters:
- XLS format versions(!) (Office version).
- Multi sheet XLS files.
- Macros/VB-Script enabled sheets.
BTW: some XLS filters are just CSV tools. 8-/
cu - rainer
-----------------------------------------------------------
This Mail has been checked for Viruses
Attention: Encrypted Mails can NOT be checked !
***
Diese Mail wurde auf Viren ueberprueft
Hinweis: Verschluesselte Mails koennen NICHT geprueft werden!
------------------------------------------------------------
Received on Fri Jun 1 14:52:51 2001