Re: XML Processing instruction for UTF-8
- From: "Alex Blekhman" <tkfx.N05P4M@xxxxxxxxx>
- Date: Wed, 25 Jan 2006 16:27:51 +0200
Jochen Kalmbach [MVP] wrote:
> Hi Alex!
>
>> How can you tell that it is not UTF-8?
>
> Most UTF-8 documents containing a BOM:
> http://www.unicode.org/faq/utf_bom.html
Yes, I know that BOM can be used to determine serialization
encoding. However, MSXML will not save BOM for UTF-8, as you
probably already know. Actually, BOM is not required by XML
specification and UTF-8 is always assumed unless BOM is
present or processing instruction specifies otherwise.
Strictly speaking, BOM is misnomer for UTF-8 stream since
"byte ordering" concept is inapplicable for UTF-8 (unlike
UTF-16/32) and actually BOM is used as magic number. It is
noted in above mentioned FAQ, too.
(http://www.unicode.org/faq/utf_bom.html#3)
So, under Windows using MSXML one will get BOM'less XML
files by default.
.
- References:
- XML Processing instruction for UTF-8
- From: ama
- Re: XML Processing instruction for UTF-8
- From: Alex Blekhman
- Re: XML Processing instruction for UTF-8
- From: Jochen Kalmbach [MVP]
- XML Processing instruction for UTF-8
- Prev by Date: Running external program and capture output Shellexecute?
- Next by Date: Re: Running external program and capture output Shellexecute?
- Previous by thread: Re: XML Processing instruction for UTF-8
- Next by thread: Re: XML Processing instruction for UTF-8
- Index(es):
Relevant Pages
|