Top "Character-properties" questions

character-properties are a set of attributes supplied by the Unicode Standard.

What is the {L} Unicode category?

I came across some regular expressions that contain [^\\p{L}]. I understand that this is using some form of a …

java regex unicode character-properties
Matching Unicode Dashes in Java Regular Expressions?

I'm trying to craft a Java regular expression to split strings of the general format "foo - bar" into "foo" …

java regex unicode character-properties
How to know the preferred display width (in columns) of Unicode characters?

In different encodings of Unicode, for example UTF-16le or UTF-8, a character may occupy 2 or 3 bytes. Many Unicode applications …

unicode text-formatting character-properties mbcs
POSIX character equivalents in Java regular expressions

I would like to use a regular expression like this in Java : [[=a=][=e=][=i=]]. But Java doesn't support the …

java regex posix-ere character-properties
List of Unicode alphabetic characters

I need the list of ranges of Unicode characters with the property Alphabetic as defined in http://www.unicode.org/…

unicode alphabetic character-properties
Python: Split unicode string on word boundaries

I need to take a string, and shorten it to 140 characters. Currently I am doing: if len(tweet) > 140: tweet = …

python unicode internationalization character-properties