unicodedata

This module provides access to the Unicode Character Database which defines character properties for all Unicode characters. The data in this database is based on the UnicodeData.txt file version 5.2.0 which is publically available from ftp://ftp.unicode.org/.

The module uses the same names and symbols as defined by the UnicodeData File Format 5.2.0 (see http://www.unicode.org/reports/tr44/tr44-4.html).

Functions

bidirectional(unichr) Returns the bidirectional class assigned to the Unicode character unichr as string.
category(unichr) Returns the general category assigned to the Unicode character unichr as string.
combining(unichr) Returns the canonical combining class assigned to the Unicode character unichr as integer.
decimal(unichr[, default]) Returns the decimal value assigned to the Unicode character unichr as integer.
decomposition(unichr) Returns the character decomposition mapping assigned to the Unicode character unichr as string.
digit(unichr[, default]) Returns the digit value assigned to the Unicode character unichr as integer.
east_asian_width(unichr) Returns the east asian width assigned to the Unicode character unichr as string.
lookup(name) Look up character by name.
mirrored(unichr) Returns the mirrored property assigned to the Unicode character unichr as integer.
name(unichr[, default]) Returns the name assigned to the Unicode character unichr as a string.
normalize(form, unistr) Return the normal form ‘form’ for the Unicode string unistr.
numeric(unichr[, default]) Returns the numeric value assigned to the Unicode character unichr as float.

Classes

UCD