Re: encoding

From: Igor Tandetnik (itandetnik_at_mvps.org)
Date: 05/25/04

  • Next message: Igor Tandetnik: "Re: encoding"
    Date: Tue, 25 May 2004 16:18:26 -0400
    
    

    UTF-8 is endianness-neutral, it does not need BOM mark.

    -- 
    With best wishes,
        Igor Tandetnik
    "For every complex problem, there is a solution that is simple, neat,
    and wrong." H.L. Mencken
    "Tank" <anonymous@microsoft.com> wrote in message
    news:eLZ%23Z7xPEHA.4036@TK2MSFTNGP12.phx.gbl
    > You must have Byte Order Mark (BOM) header written to the beginning
    > of your file. You might want to read about BOM :
    >
    >
    http://msdn.microsoft.com/library/default.asp?url=/library/en-us/intl/unicode_42jv.asp
    >
    > Hope this help.
    >
    > regards,
    > tank
    >
    > "Paul" <anonymous@discussions.microsoft.com> wrote in message
    > news:3E356341-48BB-471D-80CB-E8AFDEE54D10@microsoft.com...
    >> Hi,
    >>
    >> How do I add encoding = UTF-8 to a file that was just created by
    > _tfopen()?  Or... how do I create a file with encoding other than ANSI
    > (UTF-8, Unicode, etc.) in C++?
    >>
    >> Thanks,
    >> Paul
    

  • Next message: Igor Tandetnik: "Re: encoding"

    Relevant Pages

    • Re: Defacto standard string library
      ... context was strings that were known to be UTF-8, ... that other programs can recognize the encoding. ... since the BOM convention developed for UTF-16 ... I tried the Vista speech recognition by running the tutorial. ...
      (comp.lang.c)
    • Unicode BOM marks
      ... If I understand well, into the UTF-8 unicode binary representation, some ... systems add at the beginning of the file a BOM mark, ... file from one platform to another, even with the same Unicode encoding). ... BOM_UTF32_BE: UTF-32 only ...
      (comp.lang.python)
    • Re: Unicode BOM marks
      ... UTF-8 has no byte-order issues that a "byte order mark" would deal with. ... system that decides to add or leave out the UTF-8 signature, ... The BOM mark decodes as U+FEFF: ... > the UTF-8 encoding standard. ...
      (comp.lang.python)
    • Re: Defacto standard string library
      ... I challenge you to find a file in the wild that starts with that sequence where it is _not_ a BOM encoded in UTF-8. ... it clearly is not absolute. ... that other programs can recognize the encoding. ...
      (comp.lang.c)
    • Re: How to detect text file encoding in Perl
      ... The BOM is the relevant encoding of the Unicode character U+FEFF. ... and in utf-8 it's *three* bytes. ... the Unicode FAQ points out that utf-8 can usefully ...
      (comp.lang.perl.misc)