Re: Microsoft Layer for Unicode on Windows 95/98/Me systems
- From: "Thorsten Albers" <albersRE@xxxxxxxxxxxxxxxxxxx>
- Date: Tue, 10 May 2005 04:58:26 -0700
Sam Hobbs <samuel@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx> schrieb im Beitrag
<ujS2xNQVFHA.3244@xxxxxxxxxxxxxxxxxxxx>...
> In the context of Windows, code pages are very relevant. Most fonts
provided
> by Windows are not Unicode fonts. The one or two fonts that are Unicode
> fonts are very large and until recently disk space was costlier than it
is
> now. Therefore it was unreasonable to provide a variety of Unicode fonts.
> The non-Unicode fonts use code pages.
Sorry, but:
a) Most fonts provided by Windows (from Windows 95 on) >are< indeed
'Unicode fonts'.
b) I can not see any coherence of 'Unicode fonts', disk space, and this
thread on the whole. Like all other fonts a 'Unicode font' may be large or
small in size. Like with all TrueType or PostScript fonts this does not
depend on whether it is a 'Unicode font' or not, but on the glyphs that the
font defines.
c) Last but not least: In fact there isn't anything that could be called a
'Unicode font'. A TrueType font may hold several character encoding tables.
If the glyphs of a font may be accessed by Unicode character encodings
depends on whether the font holds a table for this.
> See: http://support.microsoft.com/default.aspx/kb/816039 (KB article
816039,
> "FIX: Code Point Comparison Semantics for SQL_Latin1_General_Cp850_BIN
> Collation"). At the botton it says "Code point is a numeric value in
Unicode
> encoding, or in a code page that corresponds to a character.".
Yeah, "or"! The Unicode character encoding does not make use of "code
pages" (unless there is someone - and presumably there is since there is
always someone - who likes to call a Unicode codepoint range a code page).
In the context of single byte character encodings "code page" defines the
relation of the font's character set to one or more human scripts. The term
"page" is used because single byte character sets provide a frame of of
only 256 characters into which all human scripts have to be mapped somehow.
Unicode on the contrary already includes all (or almost all) human scripts
hitherto known. The code point space of Unicode is subdivided into code
point ranges and not in "code pages" because the code point ranges are very
different in size. A "code page" is used to specify to which human script
glyph a single character code shall be mapped. The "code page"
specification is necessary because the respective character code may be
mapped to several different human script glyphs, i.e. the same character
code is used in different "code pages". This isn't the case with Unicode:
Each code point by its value already includes the information to which
human script it belongs and which human script glyph it is encoding.
> See: http://www.unicode.org/glossary/index.html, the "Glossary of Unicode
> Terms" in the Unicode.org web site. It has a definition of "Code Page".
> Since the definition is in the Glossary of Unicode Terms, the implication
is
> that the term is relevant to Unicode.
That's really neat: Because a term is within the Unicode glossary it has to
be relevant to Unicode. Ho, ho, ho! The glossary also includes a definition
for the term "HTML". Is "HTML" relevant to Unicode?
Of course, the term "code page" is indeed somehow relevant to Unicode: It
is relevant if texts have to be converted from character encodings where
"code pages" are in use to the Unicode character encoding and vice versa.
> See: http://cpdetector.sourceforge.net ("cpdetector, free java code page
> detection"). It has a description of code pages and their relevance to
> Unicode.
But it does not state anywhere that Unicode >makes use< of "code pages".
The only relevance is if conversion from one character encoding scheme to
the other has to be done.
--
----------------------------------------------------------------------
THORSTEN ALBERS Universität Freiburg
albers@
uni-freiburg.de
----------------------------------------------------------------------
.
- Follow-Ups:
- Re: Microsoft Layer for Unicode on Windows 95/98/Me systems
- From: Sam Hobbs
- Re: Microsoft Layer for Unicode on Windows 95/98/Me systems
- References:
- Microsoft Layer for Unicode on Windows 95/98/Me systems
- From: Howard Kaikow
- Re: Microsoft Layer for Unicode on Windows 95/98/Me systems
- From: TC
- Re: Microsoft Layer for Unicode on Windows 95/98/Me systems
- From: mayayana
- Re: Microsoft Layer for Unicode on Windows 95/98/Me systems
- From: Thorsten Albers
- Re: Microsoft Layer for Unicode on Windows 95/98/Me systems
- From: mayayana
- Re: Microsoft Layer for Unicode on Windows 95/98/Me systems
- From: Thorsten Albers
- Re: Microsoft Layer for Unicode on Windows 95/98/Me systems
- From: mayayana
- Re: Microsoft Layer for Unicode on Windows 95/98/Me systems
- From: Thorsten Albers
- Re: Microsoft Layer for Unicode on Windows 95/98/Me systems
- From: Sam Hobbs
- Re: Microsoft Layer for Unicode on Windows 95/98/Me systems
- From: Thorsten Albers
- Re: Microsoft Layer for Unicode on Windows 95/98/Me systems
- From: Sam Hobbs
- Microsoft Layer for Unicode on Windows 95/98/Me systems
- Prev by Date: Re: Creating Thread under VB
- Next by Date: Re: Creating Thread under VB
- Previous by thread: Re: Microsoft Layer for Unicode on Windows 95/98/Me systems
- Next by thread: Re: Microsoft Layer for Unicode on Windows 95/98/Me systems
- Index(es):
Relevant Pages
|
|