Convert Unicode/UTF-8 string to lower/upper case using pure & pythonic library

Viet picture Viet · Jan 27, 2010 · Viewed 16.5k times · Source

I use Google App Engine and cannot use any C/C++ extension, just pure & pythonic library to do conversion of Unicode/UTF-8 strings to lower/upper case. str.lower() and string.lowercase() don't.

Answer

Ignacio Vazquez-Abrams picture Ignacio Vazquez-Abrams · Jan 27, 2010

str encoded in UTF-8 and unicode are two different types. Don't use string, use the appropriate method on the unicode object:

>>> print u'ĉ'.upper()
Ĉ

Decode str to unicode before using:

>>> print 'ĉ'.decode('utf-8').upper()
Ĉ