Top "Utf-8" questions

UTF-8 is a character encoding that describes each Unicode code point using a byte sequence of one to four bytes.

What is the proper way to URL encode Unicode characters?

I know of the non-standard %uxxxx scheme but that doesn't seem like a wise choice since the scheme has been …

unicode utf-8 character-encoding urlencode web-standards
PHP DOMDocument loadHTML not encoding UTF-8 correctly

I'm trying to parse some HTML using DOMDocument, but when I do, I suddenly lose my encoding (at least that …

php utf-8 character-encoding
Incorrect string value: '\xF0\x9F\x8E\xB6\xF0\x9F...' MySQL

I am trying to store a tweet in my MYSQL table. The tweet is: quiero que me escuches, no te …

mysql twitter utf-8 emoticons
How to remove non UTF-8 characters from text file

I have a bunch of Arabic, English, Russian files which are encoded in utf-8. Trying to process these files using …

linux bash text utf-8 character-encoding
Why does modern Perl avoid UTF-8 by default?

I wonder why most modern solutions built using Perl don't enable UTF-8 by default. I understand there are many legacy …

perl unicode utf-8
UnicodeDecodeError: ('utf-8' codec) while reading a csv file

what i am trying is reading a csv to make a dataframe---making changes in a column---again updating/reflecting changed value …

python pandas utf-8 python-unicode
How to use Greek symbols in ggplot2?

My categories need to be named with Greek letters. I am using ggplot2, and it works beautifully with the data. …

r graphics unicode utf-8 ggplot2
Adding a line break to a UITextView

I have a UITextView that takes an NSString with formatting stringWithUTF8String. It is getting its values from a database …

utf-8 nsstring uitextview whitespace
Python decoding Unicode is not supported

I am having a problem with my encoding in Python. I have tried different methods but I can't seem to …

python encoding utf-8 character-encoding
How to print UTF-8 encoded text to the console in Python < 3?

I'm running a recent Linux system where all my locales are UTF-8: LANG=de_DE.UTF-8 LANGUAGE= LC_CTYPE="de_…

python shell encoding utf-8 ascii