Top "Surrogate-pairs" questions

Unicode characters with code above 0xFFFF, are encoded in UTF-16 by pairs of 16-bit code units called **surrogate pairs**.

Java charAt used with characters that have two code units

From Core Java, vol. 1, 9th ed., p. 69: The character ℤ requires two code units in the UTF-16 encoding. Calling String sentence = "ℤ …

java unicode utf-16 surrogate-pairs astral-plane
How do I create a string with a surrogate pair inside of it?

I saw this post on Jon Skeet's blog where he talks about string reversing. I wanted to try the example …

c# string utf-16 utf-32 surrogate-pairs
UnicodeEncodeError: 'utf-8' codec can't encode character '\udc43' in position 1: surrogates not allowed

I have a list containing placenames and I want to create another array, initially empty, and then iterate the list …

python character-encoding surrogate-pairs