Unicode Conversion in c#

manoj picture manoj · Jan 28, 2013 · Viewed 50.6k times · Source

i am trying to assign Unicode on string but it return "Привет" string as "Привет" But i need "Привет", i am converting by following function .

public string Convert(string str)
{
    byte[] utf8Bytes = Encoding.UTF8.GetBytes(str);
    str = Encoding.UTF8.GetString(utf8Bytes);
    return str;
}

what can i do for solve this problem to return "Привет".

Answer

devio picture devio · Jan 28, 2013

П is Unicode character 0x041F, and its UTF-8 encoding is 0xD0 0x9F resulting in П.

Since the function only returns the input parameter, as commenters already discussed, I conclude that your original input string is actually in UTF-8, and you want to convert it into native .Net string.

Where does the original string come from?

Instead of reading the input into a C# string, change your code to read a byte[], and then call Encoding.UTF8.GetString(inputUtf8ByteArray).