1
該代碼旨在將文件作爲輸入,將所有字母更改爲小寫,並刪除任何非字母字符。然後它應該打印文件中每個單詞的重複。讀取文件時出現語法錯誤
#!/usr/bin/python
import sys
def main(argv):
try:
tf = open(sys.argv[1],"r")
except IOError:
print("The file ",tf," was not found")
sys.exit()
data = tf.read()
data.lower()
data.replace("-"," ")
validLetters = " abcdefghijklmnopqrstuvwxyz"
cleanData = ''.join([i for i in data if i in validLetters])
frequency = {}
words = []
words = cleanData.split()
for x in words:
if frequency.has_key(x):
frequency[x] = frequency[x] + 1
else:
frequency[x]
print sorted(frequency.values())
tf.close()
這是我得到的命令行:
$ python -m py_compile q1_word_count.py drake.txt
File "drake.txt", line 1
I Was A Teenage Hacker
^
SyntaxError: invalid syntax
「我是一個十幾歲的黑客」是文本文件的第一行..
就像這樣'python q1_word_count.py drake.txt'運行它,這是一個非常小的速度增益在啓動時,沒有執行時,這是字節碼編譯,而不是動態編譯。 –