Top "Unicode" questions

Unicode is a standard for the encoding, representation and handling of text with the intention of supporting all the characters required for written text incorporating all writing systems, technical symbols and punctuation.

ASCII vs Unicode + UTF-8

Was reading Joel Spolsky's 'The Absolute Minimum' about character encoding. It is my understanding that ASCII is a Code-point + Encoding …

unicode utf-8 character-encoding ascii
Why is IE failing to show UTF-8 encoded text?

I have a some Chinese characters that I'm trying to display on a Kentico-powered website. This text is copy/pasted …

internet-explorer unicode character-encoding ie8-compatibility-mode
removing emojis from a string in Python

I found this code in Python for removing emojis but it is not working. Can you help with other codes …

python string unicode special-characters emoji
What's the ASCII character code for '—'?

I am working on decoding text. I am trying to find the character code for the — character, not to be …

unicode character-encoding ascii special-characters
Converting TMemoryStream to 'String' in Delphi 2009

We had the following code prior to Delphi 2009: function MemoryStreamToString(M : TMemoryStream): String; var NewCapacity: Longint; begin if (M.Size = &…

delphi string unicode delphi-2009 memorystream
<0xEF,0xBB,0xBF> character showing up in files. How to remove them?

I am doing compressing of JavaScript files and the compressor is complaining that my files have  character in them. How …

file unicode utf-8 utf
How well is Unicode supported in C++11?

I've read and heard that C++11 supports Unicode. A few questions on that: How well does the C++ standard library …

c++ unicode c++11
Difference between Char.IsDigit() and Char.IsNumber() in C#

What's the difference between Char.IsDigit() and Char.IsNumber() in C#?

c# .net unicode
Output unicode strings in Windows console app

Hi I was trying to output unicode string to a console with iostreams and failed. I found this: Using unicode …

c++ unicode iostream windows-console
Printing UTF-8 strings with printf - wide vs. multibyte string literals

In statements like these, where both are entered into the source code with the same encoding (UTF-8) and the locale …

c unicode utf-8 printf multibyte