在一段句子中是由各种词汇组成的.有名词,动词,形容词和副词.要理解这些句子,首先就需要将这些词类识别出来.将词汇按它们的词性(parts-of-speech,POS)分类并相应地对它们进行标注.这个过程叫做词性标注. 要进行词性标注,就需要用到词性标注器(part-of-speech tagger).代码如下 text=nltk.word_tokenize("customer found there are abnormal issue") print(nltk.pos_tag(tex
import nltk from nltk.book import * nltk.corpus.gutenberg.fileids() emma = nltk.corpus.gutenberg.words('austen-emma.txt') len(emma) emma = nltk.Text(nltk.corpus.gutenberg.words('austen-emma.txt')) emma.concordance("surprize") from nltk.corpus im