Skip to main content.
home | support | download

Back to List Archive

Re: non-English charaters in XML files

From: <dasoso(at)not-real.alumni.uv.es>
Date: Tue Nov 09 2004 - 18:20:24 GMT
> On Tue, Nov 09, 2004 at 06:52:24AM -0800, dasoso@alumni.uv.es wrote:
> > Swish-e splits the words in ISO-8859. I like the way that works 
with 
> > the UTF-8. 
> 
> So I guess that means your source xml is encoded in UTF-8.


  Yes, but I noticed that my server has files encoded in UTF-8 and 
others in ISO-8859, so I'll have files with 's indexed as n and 
others whit the words splitted. Anyone has this problem with the xml 
files? How do you resolve it and index your XML files? Don't know 
what to do.



David Soriano.
Received on Tue Nov 9 10:20:38 2004