Re: Microsoft Layer for Unicode on Windows 95/98/Me systems

Tech Tip: Click here to run a free scan for Windows Errors and optimize PC performance



"Thorsten Albers" <albersRE@xxxxxxxxxxxxxxxxxxx> wrote in message
news:01c5541b$f28e7640$978ee684@xxxxxxxxxxx
>
> This term is used to differentiate the Unicode character encodings from
> standard single byte character encodings. Unicode code points are used not
> only to encode characters but also to encode administrative data, control
> codes, etc. And some characters are encoded by 2 code points
> ("surrogates";
> 4 byte instead of 2). So the term "code point" seems to be a bit more
> comprehensive than the term "character code".

The term "code point" existed long before Unicode did. In the context of
Unicode, the term is used as if it is unique to Unicode, but it definitely
is not.

Look at:
http://support.sas.com/documentation/periodicals/obs/nls_article.html ("SAS
System Support for International Character Sets"), which is from the SAS web
site. SAS is software that has existed a long time. Under "EBCDIC" it says
"the rest of the code points". It is talking about EBCDIC, which is the IBM
charcter set that extended ASCII long before Unicoder; I think EBCDIC was
created about 1964. I could find similar comments in the IBM web site, but I
hope the SAS page is enough.

In IBM's AIX documentation is the "Printer Code Page Translation Tables"
page; see:
http://publib.boulder.ibm.com/infocenter/pseries/index.jsp?topic=/com.ibm.aix.doc/aixbman/printrgd/prt_code_page.htm
It talks about code points, but not Unicode code points.


.



Relevant Pages

  • Re: VB - Ascii to Unicode and then Unicode to UTF-8 conversion (Very desperate!!)
    ... Latin together) then you have to use a Unicode column type. ... AscW returns the real Unicode character ... for Chinese characters, ... then the next thing to worry about is your CSV file. ...
    (microsoft.public.vb.general.discussion)
  • Re: Unicode Support
    ... if two Unicode strings are the same? ... UTF-16 is basically telling everyone "ok we all got to start ... character, and will likely support *both* endians. ... UTF-8 encodings are also easy to learn to ...
    (alt.lang.asm)
  • Re: Determining if a string is Unicode
    ... there's nothing magic about Unicode. ... where each character occupies 2 bytes, as opposed to a Single-Byte Character ... You could load up a string with rubbish, ... > INF file like so: ...
    (microsoft.public.vb.general.discussion)
  • Re: KANJD212
    ... >>Who decides the factors and what are their criteria, Unicode? ... But once a character is defined/get a codepoint in Unicode it ... standard modifies the codepoint of the kanji to a totally new ... I can use a code like JIS X0208 along with a font ...
    (sci.lang.japan)
  • Re: Enhanced Unicode support for "Go" tools
    ... the point to remember is that UNICODE is a _character ... It's the fonts, the OS and the application which work together ... society for the protection of French from English ...
    (alt.lang.asm)