Re: Microsoft Layer for Unicode on Windows 95/98/Me systems



"Thorsten Albers" <albersRE@xxxxxxxxxxxxxxxxxxx> wrote in message
news:01c5541b$f28e7640$978ee684@xxxxxxxxxxx
> mayayana <mayaXXyana1a@xxxxxxxxxxxxxxxx> schrieb im Beitrag
> <i_ufe.9554$7F4.9418@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx>...
>> Codepoint? I gather you're going to make
>> me look that up to figure out what you're
>> talking about.
>
> Sorry, it is "code point", not "codepoint"...
>
> For the term please refer to
> http://www.unicode.org/glossary/index.html#code_point
>
> This term is used to differentiate the Unicode character encodings from
> standard single byte character encodings. Unicode code points are used not
> only to encode characters but also to encode administrative data, control
> codes, etc. And some characters are encoded by 2 code points
> ("surrogates";
> 4 byte instead of 2). So the term "code point" seems to be a bit more
> comprehensive than the term "character code".


Code points are dependent on code pages. If a code page is not specified,
the code point is indeterminate.

Are you sure that all wide string versions of SDK functions are dependent on
a code page? I think not, but I am not sure. Unless all wide string versions
of SDK functions are dependent on a code page, it is not reasonable to
expect VB to always use code pages to interpret Unicode characters.


.



Relevant Pages

  • Re: write smiley to file
    ... use Encode; ... There is a difference between UTF-8 and Unicode characters. ... the same Unicode character. ... The Unicode character is higher than 127, so we can ignore the first rule. ...
    (perl.beginners)
  • Re: Unicode in regexp
    ... I have some text which has unicode character \u+2013 for example: ... Save your script in UTF-8 encoding and just use the unicode ... decode it first, perform substitution, and encode it back: ...
    (comp.lang.perl.misc)
  • Re: output ampersand using XML::Twig
    ... XML data, then it receives unescaped utf8 strings from the parser ... first 2 solutions) being to get the unicode character for   ... turn off XML escapes for the element content ... {# use an Encode output filter that encodes (using decimal ...
    (comp.lang.perl.modules)
  • Re: editing perl script through TEXTAREA
    ... not converted to the corresponding HTML entity. ... then browsers do strange things with them. ... As the Perl Encode ... character which cannot be represented in the prevailing character ...
    (comp.lang.perl.misc)
  • Re: unicode
    ... 'ascii' codec can't encode character u'\u9999' in ... it looks like when I try to display the string, ... If you try to print a Unicode string, then Python will attempt to first ... encode it using the default encoding for that file. ...
    (comp.lang.python)