Re: National Language - question on alphabet/sort order

Tech-Archive recommends: Repair Windows Errors & Optimize Windows Performance



Thanks Joe and MrAsm

You confirmed and added to my understanding of the problem.
Since this is something I just came across while doing something else, I
shall pass it on to my boss for consideration in future development. Since
no-one has reported this as a problem (yet) it is my guess that our clients
are ok about using the English ASCII sequence for the moment. Our Chinese
support agent is in the UK next month so we can ask him then. The Russian
and Hungarian offices are in contact via email on another issue so I can
mention this to them.

Thanks
Les

"Joseph M. Newcomer" <newcomer@xxxxxxxxxxxx> wrote in message
news:ik4c5390avrf4gi27bq9ok0tjc3seg8c3v@xxxxxxxxxx
There are some issues here. First, you need to identify what the letters
of the alphabet
are. This only works for languages that have alphabets. Iconographic
languages are a
completely different can of worms, but I'm reasonably confident that ++ is
not going to
generate the next "logical" character in any way that would make sense to
a native of that
culture.

In languages with diacritical marks, do you include the accented
characters in the
sequence or not? This has serious implications because an engineering
drawing that took
you to drawing À-É might be confused with one that took you to Á-È, which
strikes me as a
bit risky. The reason I mention this is that the information that might
let you discover
what are universally "alphabetic characters" can therefore lead to these
problems.

You can download my Locale Explorer and go investigate the Unicode page
and look at the
General Category column. Ll and Lu (Letter lowercase and Letter
upprecase) are useful to
look at.
joe

On Thu, 24 May 2007 16:05:24 +0100, "Les" <l.neilson@xxxxxxxxxxxxxxxxxxx>
wrote:


I'm looking through the National Language Support documentation but don't
think what I need can be done automatically.

Let me explain :

On our engineering drawings there are often several sectional views and
these are labelled A-A, B-B etc.
So in our code which automatically generates these sectional views there
is
a starting dialog which has a box for the section label ("A" in this case)
and the code increments the next character until "Z" and wraps round to
"A"
again. (Highly unlikely that there will be more than 26 sections on one
drawing but the start letter _could_ be near the end of the alphabet and a
wrap is needed)
and Yes this particular increment function assumes the English ASCII code
sequence! :-(

Have you guessed my question yet?

Is there a method of incrementing and wrapping non-English alphabets?
Or : of identifying first and last characters in a set? (Sort sequence?)
European and Cyrillic alphabets might be straight forward, but obviously
Asian sets are a different kettle of fish ("whole new ballgame" in modern
vernacular)!

Les

Joseph M. Newcomer [MVP]
email: newcomer@xxxxxxxxxxxx
Web: http://www.flounder.com
MVP Tips: http://www.flounder.com/mvp_tips.htm


.



Relevant Pages

  • Re: Etymology of "du-siach"?
    ... I readily admit that I am just a lousy Master of Arts ... scripts, and that you have even published academic treatises or expert ... books about alphabets, abugidas, abjads and the rest. ... understood that you are familiar with several Semitic languages. ...
    (sci.lang)
  • Re: Cell validation
    ... Would need a macro to restrict a set of cells with only alphabets of length 3 to 35 ... The total characters are therefore 10. ... Microsoft Excel MVP ... The first 5 characters must be uppercase letters A-Z ...
    (microsoft.public.excel.worksheet.functions)
  • Re: Etymology of "du-siach"?
    ... I readily admit that I am just a lousy Master of Arts ... scripts, and that you have even published academic treatises or expert ... books about alphabets, abugidas, abjads and the rest. ... understood that you are familiar with several Semitic languages. ...
    (sci.lang)
  • Re: Case sensitivity in programming languages.
    ... How do you propose to deal with different alphabets and languages? ... Handle all alphabets with a concept of case in a case-insensitive ... Bad for a lot of programmers in non-English speaking countries. ... variableName/ClassName/CONSTANT convention myself, ...
    (comp.lang.php)
  • Re: Cell validation
    ... (there's no email, no snail mail, but somewhere should be gmail in my addy) ... although I do note some ALT+0### "type-able" keyboard characters will pass the test; however, it would be foolproof for the "normal" keyboard characters. ... First 5 should be alphabets, next 4 should be numbers and the ...
    (microsoft.public.excel.worksheet.functions)