Skip to main content.
home | support | download

Back to List Archive

[swish-e] Encoding problems

From: Patricio Mac Adden <pmacadden(at)not-real.cespi.unlp.edu.ar>
Date: Tue Mar 16 2010 - 16:58:11 GMT
Hello, this is my first mail to this mailing list. I'm from La Plata,
Argentina and my problem is this:

I'm trying to index several document types: pdf, doc, xsl, txt, zip,
etc.. The documents may be encoded with UTF-8 or ISO-8859-1. I'm also
using the directive TranslateCharacters _áéíóúñ -aeioun so words "papá"
is indexed as "papa" and so on.

Supose that in my indexed dir I have 3 documents, 2 containing the text
"papá" and 1 containing the text "papa". So:

$ swish-e -w papa

must give me 3 hits instead of 1.

I'm using ubuntu 9.10, swish-e 2.4.5-5, libxml2 2.7.5.

Thanks in advanced.

PS: I hope you understand, my english isn't good.

-- 
Patricio Mac Adden
Desarrollo - CeSPI - UNLP

_______________________________________________
Users mailing list
Users@lists.swish-e.org
http://lists.swish-e.org/listinfo/users
Received on Tue Mar 16 12:58:15 2010