nltk.tokenize.sent_tokenize()

nltk.tokenize.sent_tokenize(text, language='english')[source]

Return a sentence-tokenized copy of text, using NLTK’s recommended sentence tokenizer (currently PunktSentenceTokenizer for the specified language).

Parameters:
  • text – text to split into sentences
  • language – the model name in the Punkt corpus