我想讀取一個文件,並創建一個字典,每個單詞作爲一個關鍵字,並將其作爲一個單詞作爲值。更新字典值與文件中的下一個單詞?
例如,如果我有一個包含文件:
'Cake is cake okay.'
創建應該包含的詞典:
{'cake': ['is', 'okay'], 'is': ['cake'], 'okay': []}
到目前爲止,我已經成功地做我的代碼相反。我已經用文件中的前一個字更新了字典值。我不太清楚如何改變它以使其按預期工作。
def create_dict(file):
word_dict = {}
prev_word = ''
for line in file:
for word in line.lower().split():
clean_word = word.strip(string.punctuation)
if clean_word not in word_dict:
word_dict[clean_word] = []
word_dict[clean_word].append(prev_word)
prev_word = clean_word
謝謝你們提前幫忙!
編輯
與最新進展:
def create_dict(file):
word_dict = {}
next_word = ''
for line in file:
formatted_line = line.lower().split()
for word in formatted_line:
clean_word = word.strip(string.punctuation)
if next_word != '':
if next_word not in word_dict:
word_dict[next_word] = []
if clean_word == '':
clean_word.
next_word = clean_word
return word_dict