Re: Unicode Character Issue

Tech-Archive recommends: Fix windows errors by optimizing your registry



Hello -

There is an overloaded constructor that takes an Encoding (e.g.
System.Text.UnicodeEncoding). Visual Basic treats all strings as
Unicode so if you do a ReadLine() to a String it should work.

Joe



On Jun 4, 12:04 pm, "Samuel" <samuel.shul...@xxxxxxxxxxxx> wrote:
Hi

I am trying to read text files that are saved in ANSI format with Unicode
characters such as French e German big S etc, and as I read the file these
characters appear as squares etc.

I know that if the file would be saved as Unicode this wouldn't be a
problem.

The question is whether there is an option that when I create the Stream
Reader the application will recognize it as Unicode characters.

Thank you,
Samuel

.



Relevant Pages

  • Unicode and ANSI Common Lisp
    ... sequences of Unicode code points, is the internal format that best ... better left to a higher layer above ANSI Common Lisp. ... how characters are counted is the only way for LENGTH to return the same ... values across implementations for the same external strings. ...
    (comp.lang.lisp)
  • Re: Optimization of code
    ... that leet alphabet, with excessive accents. ... Latest MSVC releases can handle UNICODE C sources, ... Swedish, German, French, Hungarian, etc. that use accented characters). ... that require ASCII text strings as part of their protocol. ...
    (microsoft.public.vc.mfc)
  • Re: UTF-8 / German, Scandinavian letters - is it really this difficult?? Linux & Windows XP
    ... For string literals, with the "coding" declaration, Python will accept ... "coding" declaration to produce a Unicode object which unambiguously ... represents the sequence of characters - ie. something that can be ... > strings and/or gibberished characters in Tk GUI title? ...
    (comp.lang.python)
  • Re: Generic innerHTML functionality and other minor questions...
    ... > Now i know that all strings in JavaScript 1.0 are in Unicode ... strings are encoded using UTF-16 in accordance with ECMAScript. ... which was the first JScript version to support encodeURIComponent(). ... RFC2986) will be used for characters below code point 0x80 and UTF-8 ...
    (comp.lang.javascript)
  • Re: Generic innerHTML functionality and other minor questions...
    ... > Now i know that all strings in JavaScript 1.0 are in Unicode ... strings are encoded using UTF-16 in accordance with ECMAScript. ... which was the first JScript version to support encodeURIComponent(). ... RFC2986) will be used for characters below code point 0x80 and UTF-8 ...
    (comp.lang.javascript)