Re: Microsoft Layer for Unicode on Windows 95/98/Me systems



mayayana <mayaXXyana1a@xxxxxxxxxxxxxxxx> schrieb im Beitrag
<i_ufe.9554$7F4.9418@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx>...
> Codepoint? I gather you're going to make
> me look that up to figure out what you're
> talking about.

Sorry, it is "code point", not "codepoint"...

For the term please refer to
http://www.unicode.org/glossary/index.html#code_point

This term is used to differentiate the Unicode character encodings from
standard single byte character encodings. Unicode code points are used not
only to encode characters but also to encode administrative data, control
codes, etc. And some characters are encoded by 2 code points ("surrogates";
4 byte instead of 2). So the term "code point" seems to be a bit more
comprehensive than the term "character code".

--
----------------------------------------------------------------------
THORSTEN ALBERS Universität Freiburg
albers@
uni-freiburg.de
----------------------------------------------------------------------

.



Relevant Pages

  • Re: output ampersand using XML::Twig
    ... XML data, then it receives unescaped utf8 strings from the parser ... first 2 solutions) being to get the unicode character for   ... turn off XML escapes for the element content ... {# use an Encode output filter that encodes (using decimal ...
    (comp.lang.perl.modules)
  • Re: editing perl script through TEXTAREA
    ... not converted to the corresponding HTML entity. ... then browsers do strange things with them. ... As the Perl Encode ... character which cannot be represented in the prevailing character ...
    (comp.lang.perl.misc)
  • Re: unicode
    ... 'ascii' codec can't encode character u'\u9999' in ... it looks like when I try to display the string, ... If you try to print a Unicode string, then Python will attempt to first ... encode it using the default encoding for that file. ...
    (comp.lang.python)
  • Re: POSTing Chinese characters
    ... For the example string I mention, simply encode as ... the client locale could be anywhere... ... > The basic idea of %-encoding is to treat character encoding as a sequence ...
    (microsoft.public.inetserver.iis)
  • Re: Microsoft Layer for Unicode on Windows 95/98/Me systems
    ... > This term is used to differentiate the Unicode character encodings from ... > standard single byte character encodings. ... > only to encode characters but also to encode administrative data, ...
    (microsoft.public.vb.winapi)

Loading