nltk.tag.StanfordTagger

class nltk.tag.StanfordTagger(model_filename, path_to_jar=None, encoding='utf8', verbose=False, java_options='-mx1000m')[source]

An interface to Stanford taggers. Subclasses must define:

  • _cmd property: A property that returns the command that will be executed.
  • _SEPARATOR: Class constant that represents that character that is used to separate the tokens from their tags.
  • _JAR file: Class constant that represents the jar file name.

Methods

__init__(model_filename[, path_to_jar, ...])
evaluate(gold) Score the accuracy of the tagger against the gold standard.
parse_output(text[, sentences])
tag(tokens)
tag_sents(sentences)