Re: My symbols turn to question marks

Tech-Archive recommends: Repair Windows Errors & Optimize Windows Performance

From: Ronx (ronx917_at_hotmail.com)
Date: 10/23/04


Date: Sat, 23 Oct 2004 10:23:06 +0100

The characters with ASCII codes between 32 (space) and 126(~) will display
correctly under UTF-8, and almost every other character set.
I can only assume that the spaces that display as ? are not true spaces.
Word has a non-breaking space (CTRL+SHIFT+Spacebar) which may be causing
this.
What does the HTML code for the character look like?

-- 
Ron Symonds   (Microsoft MVP - FrontPage)
Reply only to group - emails will be deleted unread.
"Bjorn" <Bjorn@discussions.microsoft.com> wrote in message 
news:A4CFA0B0-417B-4036-B800-E5C6621A8A4F@microsoft.com...
>I have a similar problem. When creating a web page with MS word or Excel, 
>any
> space turns to ?. In Internet Explorer, if I change Encoding from UTF-8 to
> Western European, everything is fine. But next time the page is open, the
> browser again insists on UTF-8 encoding. Even when the page is created to 
> "
> force" Western European encoding.
> This is also a new problem with the newest releases only.
>
> "Ronx" wrote:
>
>> Your symbols need to be inserted as entities in the form & #159; 
>> (without
>> the space between & and #)  where the number is the ASCII code for the
>> character.
>>
>> It is possible that your server is delivering the page with UTF-8 
>> character
>> set, over-riding any character set you have defined in the page.  UTF-8 
>> will
>> display symbols inserted into the HTML with their ASCII codes  as ? marks 
>> or
>> boxes, whereas entities will always display correctly.
>>
>> A list of entities can be found at
>> http://www.w3schools.com/html/html_entitiesref.asp
>>
>> -- 
>> Ron
>>
>> Reply only to group - emails will be deleted unread.
>>
>>
>> "news" <news@zipspots.com> wrote in message
>> news:yfe7d.28847$tT2.1818570@news20.bellglobal.com...
>> > My apostrophes, copyright symbols and some Latin accents become 
>> > question
>> > marks or other oddball symbols. I have tried different font's. I look 
>> > at
>> > them in FP2003 Preview...they are OK...but when I (and others) see them 
>> > on
>> > the web, what was SUPPOSED to be an apostrophe or copyright symbol is a
>> > question mark. On one site I use the city Bogota a lot. The "a" has an
>> > accent over it. It should be Bogotá but it end up being Bogot (square)
>> >
>> > Alex
>> >
>> >
>> >
>>
>>
>> 


Relevant Pages

  • [PATCH] console UTF-8 fixes
    ... I send a patch to the UTF-8 part of the vt driver. ... If a certain character is not found in the glyph ... characters) is to simply display the glyph loaded in that position. ...
    (Linux-Kernel)
  • RE: [PHP] Re: Unicode translation
    ... single byte character set you will see the C Circumflex as two separate ... UTF-8 should display fine on web page if you set the charset ... ISO-8859-1 (Which is the most prevalent character set on the internet) is ...
    (php.general)
  • Re: UTF-8 Character Encodings and "NO-BREAK SPACE" (dec: 202, hex: CA) Character
    ... text bodies as UTF-8, rather than their real encoding, Mac-Roman. ... I were fixing that bug I'd make the contents UTF-8 rather than change ... if they are truly following UTF-8 they should display an error glyph ... Don't confuse the Unicode code for a character with the octets you use ...
    (alt.html)
  • Re: Unicode Support
    ... > Not knowing much about UTF-8 (my Unicode knowledge extends as far as ... > literal strings of this form as long as the character code for quote ... > can never appear in a MBCS (multibyte character sequence). ... then XP Notepad directly understands UNICODE and you can ...
    (alt.lang.asm)
  • Re: Attention: European C/C++/C#/Java Programmers-Call for Input
    ... Simply make a straight decision now - you will use UTF-8. ... character format) much like UTF-8 which itself ... I would have little more than UNICODE left. ... generator is assembly language. ...
    (comp.arch.embedded)