UTF-8 encoding problem

Tech-Archive recommends: Fix windows errors by optimizing your registry



Hi All,

I am having a GUI which accepts a Unicode string and searches a given
set of xml files for that string.

Now, i have 2 XML files both of them saved in UTF-8 format, having
characters of different language.

Although both of them are having UTF-8 as BoM, but only first file is
having UTF-8 defined in XML declration at the top of the XML file as
well.

Now, when i search for some different langauge character in that
directory using a third party GUI for desktop search, it shows that the
charcter exist in the first file (in which XML declation was also
there), but not in the second file (having only BoM)

Initilally i thought that the problem is mainly because of UTF-8 being
supporting both MultiBye and Unicode, but could not find much on it.

Please help.

Regards,
Shreshth

.



Relevant Pages

  • Re: UTF-8 encoding problem
    ... Now, i have 2 XML files both of them saved in UTF-8 format, having ... Although both of them are having UTF-8 as BoM, but only first file is ... directory using a third party GUI for desktop search, ...
    (comp.lang.perl.misc)
  • Re: to_yaml and international characters
    ... accented characters were encoded in UTF-8 and not html escaped). ... How to get true UTF-8 ... Thanks a lot for your explanation and your ... Does anyone know how to get those characters into the final xml files? ...
    (comp.lang.ruby)
  • Re: How to clean an xml files from non-utf-8 chars?
    ... On Sep 17, 2008, at 4:07 AM, Krzysieq wrote: ... jmeter, that are stored in xml files. ... anything else that relies on the xml files being utf-8. ...
    (comp.lang.ruby)
  • How to clean an xml files from non-utf-8 chars?
    ... I'm trying to parse with ruby some test results from ... anything else that relies on the xml files being utf-8. ... I don't really care much about the data - if it's missing some chars, ...
    (comp.lang.ruby)
  • UTF-8 encoding problem
    ... I am having a GUI which accepts a Unicode string and searches a given ... Now, i have 2 XML files both of them saved in UTF-8 format, having ... Although both of them are having UTF-8 as BoM, ...
    (comp.lang.java.programmer)