`nltk.classify.RTEFeatureExtractor`¶

class nltk.classify.RTEFeatureExtractor(rtepair, stop=True, lemmatize=False)[source]¶: This builds a bag of words for both the text and the hypothesis after throwing away some stopwords, then calculates overlap and difference.

Methods¶

__init__(rtepair[, stop, lemmatize])

param rtepair:	a `RTEPair` from which features should be extracted

hyp_extra(toktype[, debug]) Compute the extraneous material in the hypothesis.

overlap(toktype[, debug]) Compute the overlap between text and hypothesis.