Unicode characters with code above 0xFFFF, are encoded in UTF-16 by pairs of 16-bit code units called **surrogate pairs**.
I was reading the documentation for StringBuffer, in particular the reverse() method. That documentation mentions something about surrogate pairs. What …
java unicode utf-16 surrogate-pairsI want to use this unicode character in my resource file. But whatever I do, I end with dalvikvm crash (…
android unicode utf-8 surrogate-pairsIn your experience which Unicode characters, codepoints, ranges outside the BMP (Basic Multilingual Plane) are the most common so far? …
unicode cjk codepoint surrogate-pairs astral-planeWe already have a question about getting the first 16-bit char of a string. This includes the question code: MyString.…
c# string unicode utf-16 surrogate-pairsThis is a follow-up to Converting to Emoji. In that question, the OP had a json.dumps()-encoded file with …
python python-3.x unicode surrogate-pairsI'm dealing with code that does various IO operations with files, and I want to make it able to deal …
java file unicode filenames surrogate-pairsBMP being Basic Multilingual Plane According to JavaScript: the Good Parts: JavaScript was built at a time when Unicode was …
javascript unicode utf-16 surrogate-pairs astral-planeI am trying to store some text (e.g. č) in a Postgres database, however when retrieving this value, it appears …
.net postgresql encoding utf-16 surrogate-pairsConsider the following code: byte aBytes[] = { (byte)0xff,0x01,0,0, (byte)0xd9,(byte)0x65, (byte)0x03,(byte)0x04, (byte)0x05, (byte)0…
java unicode surrogate-pairsI have a problem with Unicode surrogates encoding in Python (3.4): >>> b'\xCC'.decode('utf-16_be', 'surrogateescape').encode(…
python unicode utf-16 surrogate-pairs