nltk.PorterStemmer

class nltk.PorterStemmer[source]

A word stemmer based on the Porter stemming algorithm.

Porter, M. “An algorithm for suffix stripping.” Program 14.3 (1980): 130-137.

A few minor modifications have been made to Porter’s basic algorithm. See the source code of this module for more information.

The Porter Stemmer requires that all tokens have string types.

Methods

__init__()
stem(word)
stem_word(p[, i, j]) Returns the stem of p, or, if i and j are given, the stem of p[i:j+1].
unicode_repr()