Re: Extra characters being placed on XML prior to send



We are using Ethereal to sniff what is coming and going from port 80 on both
servers. The encoding is UTF-8.

OK, so that's what I'm not getting. How are you translating 43 30 30 30 to C0 00? They are obviously not the same. I'd expect to maybe see a BOM in there, but the length doesn't match either (if it was an UTF-8 BOM you'd be seeing EF BB BF instead).

Can you show us a complete disassembled package? (including the whole HTTP request, if possible).

--
Tomas Restrepo
tomasr@xxxxxxxx
http://www.winterdom.com/weblog/


.



Relevant Pages

  • Re: Defacto standard string library
    ... context was strings that were known to be UTF-8, ... that other programs can recognize the encoding. ... since the BOM convention developed for UTF-16 ... I tried the Vista speech recognition by running the tutorial. ...
    (comp.lang.c)
  • Re: Defacto standard string library
    ... I challenge you to find a file in the wild that starts with that sequence where it is _not_ a BOM encoded in UTF-8. ... it clearly is not absolute. ... that other programs can recognize the encoding. ...
    (comp.lang.c)
  • Re: How to detect text file encoding in Perl
    ... The BOM is the relevant encoding of the Unicode character U+FEFF. ... and in utf-8 it's *three* bytes. ... the Unicode FAQ points out that utf-8 can usefully ...
    (comp.lang.perl.misc)
  • Re: Transmitting strings via tcp from a windows c++ client to a Java server
    ... That algorithm will not give you the size in bytes of a UTF-8 encoded string. ... There is no way to compute the length of the UTF-8 encoding of a Unicode ... or Unicode characters. ... I would probably decide that a BOM must not be used, ...
    (comp.lang.java.programmer)
  • Re: Library function to detect UTF-8 streams without BOM
    ... If no "Encoding" attribute is present in the XML's prolog, ... BOM is present, either UTF-8 or UTF-16 can be used, but if a BOM is not ...
    (borland.public.delphi.thirdpartytools.general)