nltk.text_type

nltk.text_type

alias of unicode

Methods

capitalize(() -> unicode) Return a capitalized version of S, i.e.
center((width[, fillchar]) -> unicode) Return S centered in a Unicode string of length width.
count((sub[, start[, end]]) -> int) Return the number of non-overlapping occurrences of substring sub in Unicode string S[start:end].
decode(...) Decodes S using the codec registered for encoding.
encode(...) Encodes S using the codec registered for encoding.
endswith((suffix[, start[, end]]) -> bool) Return True if S ends with the specified suffix, False otherwise.
expandtabs(([tabsize]) -> unicode) Return a copy of S where all tab characters are expanded using spaces.
find((sub [,start [,end]]) -> int) Return the lowest index in S where substring sub is found, such that sub is contained within S[start:end].
format((*args, **kwargs) -> unicode) Return a formatted version of S, using substitutions from args and kwargs.
index((sub [,start [,end]]) -> int) Like S.find() but raise ValueError when the substring is not found.
isalnum(() -> bool) Return True if all characters in S are alphanumeric and there is at least one character in S, False otherwise.
isalpha(() -> bool) Return True if all characters in S are alphabetic and there is at least one character in S, False otherwise.
isdecimal(() -> bool) Return True if there are only decimal characters in S, False otherwise.
isdigit(() -> bool) Return True if all characters in S are digits and there is at least one character in S, False otherwise.
islower(() -> bool) Return True if all cased characters in S are lowercase and there is at least one cased character in S, False otherwise.
isnumeric(() -> bool) Return True if there are only numeric characters in S, False otherwise.
isspace(() -> bool) Return True if all characters in S are whitespace and there is at least one character in S, False otherwise.
istitle(() -> bool) Return True if S is a titlecased string and there is at least one character in S, i.e.
isupper(() -> bool) Return True if all cased characters in S are uppercase and there is at least one cased character in S, False otherwise.
join((iterable) -> unicode) Return a string which is the concatenation of the strings in the iterable.
ljust((width[, fillchar]) -> int) Return S left-justified in a Unicode string of length width.
lower(() -> unicode) Return a copy of the string S converted to lowercase.
lstrip(([chars]) -> unicode) Return a copy of the string S with leading whitespace removed.
partition(sep) -> (head, sep, tail) Search for the separator sep in S, and return the part before it, the separator itself, and the part after it.
replace((old, new[, count]) -> unicode) Return a copy of S with all occurrences of substring old replaced by new.
rfind((sub [,start [,end]]) -> int) Return the highest index in S where substring sub is found, such that sub is contained within S[start:end].
rindex((sub [,start [,end]]) -> int) Like S.rfind() but raise ValueError when the substring is not found.
rjust((width[, fillchar]) -> unicode) Return S right-justified in a Unicode string of length width.
rpartition(sep) -> (head, sep, tail) Search for the separator sep in S, starting at the end of S, and return the part before it, the separator itself, and the part after it.
rsplit(([sep [,maxsplit]]) -> list of strings) Return a list of the words in S, using sep as the delimiter string, starting at the end of the string and working to the front.
rstrip(([chars]) -> unicode) Return a copy of the string S with trailing whitespace removed.
split(([sep [,maxsplit]]) -> list of strings) Return a list of the words in S, using sep as the delimiter string.
splitlines((keepends=False) -> list of strings) Return a list of the lines in S, breaking at line boundaries.
startswith((prefix[, start[, end]]) -> bool) Return True if S starts with the specified prefix, False otherwise.
strip(([chars]) -> unicode) Return a copy of the string S with leading and trailing whitespace removed.
swapcase(() -> unicode) Return a copy of S with uppercase characters converted to lowercase and vice versa.
title(() -> unicode) Return a titlecased version of S, i.e.
translate((table) -> unicode) Return a copy of the string S, where all characters have been mapped through the given translation table, which must be a mapping of Unicode ordinals to Unicode ordinals, Unicode strings or None.
upper(() -> unicode) Return a copy of S converted to uppercase.
zfill((width) -> unicode) Pad a numeric string S with zeros on the left, to fill a field of the specified width.