我正在處理telugu文本以分析幾個文本標記。 >>> sent = "నా పేరు కరీం ఉంది. నేను భారత ఆహార ప్రేమ.".decode('utf-8')
>>> text = sent
>>> text = nltk.word_tokenize(text)
>>> result = nltk.pos_tag(text)
>>> for val in re
我開始甩使用含有特定的句子中的文件: with open(labelFile, "wb") as out:
json.dump(result, out,indent=4)
的JSON中這句話是這樣的: "-LSB- 97 -RSB- However , the influx of immigrants from mainland China , approximating NUMB