Unicode string normalization in C/C++

Ghassen Hamrouni picture Ghassen Hamrouni · Feb 3, 2011 · Viewed 10.9k times · Source

Am wondering how to normalize strings (containing utf-8/utf-16) in C/C++. In .NET there is a function String.Normalize .

I used UTF8-CPP in the past but it does not provide such a function. ICU and Qt provide string normalization but I prefer lightweight solutions.

Is there any "lightweight" solution for this?

Answer

Avi picture Avi · Feb 3, 2011

As I wrote in another question, utf8proc is a very nice, lightweight, library for basic Unicode functionality, including Unicode string normalization.