Top "Utf-8" questions

UTF-8 is a character encoding that describes each Unicode code point using a byte sequence of one to four bytes.

Language codes for simplified Chinese and traditional Chinese?

We are creating multi-language subsites on our website. I would like to use the 2-letter language codes. Spanish and French …

utf-8 internationalization cjk chinese-locale
Getting ’ instead of an apostrophe(') in PHP

I've tried converting the text to or from utf8, which didn't seem to help. I'm getting: "It’s Getting the …

php utf-8 character-encoding mojibake
Displaying the hex value of a string from a oracle varchar2?

We are having problems with text that is encoded in some different ways but kept in a single column in …

oracle unicode utf-8 turkish ojdbc
How to write UTF-8 characters using bulk insert in SQL Server?

I am doing a BULK INSERT into sqlserver and it is not inserting UTF-8 characters into database properly. The data …

sql-server utf-8 sql-server-2008-r2 bulkinsert
How to convert csv files encoding to utf-8

Is it possible to convert csv data that has iso-8859-13 encoding to UTF-8? My old system does not have …

csv encoding utf-8 import converter
How to convert a string to UTF8 in Ruby

I'm writing a crawler which uses Hpricot. It downloads a list of strings from some webpage, then I try to …

ruby file encoding utf-8 dump
setting a UTF-8 in java and csv file

I am using this code for add Persian words to a csv file via OpenCSV: String[] entries="\u0645 \u062E\…

java unicode csv utf-8 opencsv
Really Good, Bad UTF-8 example test data

So we have the XSS cheat sheet to test our XSS filtering - but other than an example benign page …

unicode utf-8
How many characters can UTF-8 encode?

If UTF-8 is 8 bits, does it not mean that there can be only maximum of 256 different characters? The first 128 code …

utf-8 character-encoding ascii
What is the encoding of Chinese characters on Wikipedia?

I was looking at the encoding of Chinese characters on Wikipedia and I'm having trouble figuring out what they are …

utf-8 character-encoding cjk url-encoding