Re: IsTextUnicode and Windows 98

From: Tom Widmer (tom_usenet_at_hotmail.com)
Date: 10/05/04


Date: Tue, 05 Oct 2004 17:35:48 +0100

On Tue, 5 Oct 2004 11:40:40 +0200, "George Ionescu"
<geoionescu@hotmail.com> wrote:

>> CHeck if each char is between 0 and 127.
>
>Why? Isn't, for example, 200 (È) a *valid* ASCII char code?

No, ASCII usually refers to US-ASCII, which is a 7-bit encoding.
Latin-1 (aka ISO 8859-1, where 200 (È) is defined) is an ASCII
extension. The first 128 UNICODE character codes are the same as those
in US-ASCII. Actually, I have a feeling that the first 256 codes
equate to Latin-1.

Tom



Relevant Pages

  • Re: [PATCH -mm] sisusbvga: Fix bug and build warnings
    ... The encoding is set to ISO-2022-JP, ... US-ASCII 'quotes' around wherever it points out the name of an identifier ... -- and that confused alpine into thinking I was writing in Japanese ... I've explicitly set all mails to be sent out as US-ASCII only ...
    (Linux-Kernel)
  • Strange change in character encoding
    ... accept the character encoding that was used, ... The e-mail to which I replied shows that it was encoded in US-ASCII. ... Under the International Options on the Mail Formatting tab in Tools | ... checked Internet Explorer: it's set to choose the encoding ...
    (microsoft.public.outlook.general)
  • Re: persian languages charset, and what DOCTYPE?
    ... with me only understanding the theory of the character representation, ... "Save and change character encoding". ... support saving in us-ascii explicitly?). ...
    (alt.html)
  • Re: Standard character attributes for Hebrew?
    ... in US-ASCII. ... characters from Windows-1251 other than US-ASCII here. ... I think I understand what you're saying, but to me it's like describing the paths of the planets in terms of revolving around the earth with epicycles instead of in terms of revolving around the sun. ... Whatever encoding is being used to stored character data in a file, ...
    (comp.infosystems.www.authoring.html)
  • Re: I18n issue with optik
    ... Terminal encoding is not US-ASCII. ... terminals I tried are set to US-ASCII (rxvt under Cygwin, ... running bash, Poderosarunning bash). ...
    (comp.lang.python)