2013-08-23 61 views
0
>>> with open('Book2.csv', 'rb') as f: 
    inpreader = csv.reader(f, delimiter=',') 
    for row in inpreader: 
     print ','.join(row) 

IdNo, skillsList 
BAXA0000206_NEENA_TIWARI_0.htm,[u'Training', u'E-Learning', u'PowerPoint', u'Teaching', u'Accounting', u'Team Management', u'Team Building', u'Microsoft Excel', u'Microsoft Office', u'Financial Accounting', u'Microsoft Word', u'Customer Service'] 
BAXA0000227_ABDUR_RAZZAQUE_0.htm,[u'Telecommunications', u'Data Center', u'ISO 27001', u'Management', u'BS25999', u'Technology', u'Information Technology...', u'Certified PMP\xae', u'Certified BS25999 Lead...'] 
BAXA0000261_Priya _ Lobo_0.htm,[u'Market Research', u'Segmentation', u'Marketing Strategy', u'Consumer Behavior', u'Experience Working with...'] 

我想找出skillsList每個技能的詞頻..我該怎麼辦呢?通過列表中的項目分組

感謝

+0

哪裏頻率字段? –

+0

skillsList。我想要每個技能的術語頻率。例如,培訓:45,電子學習:50 – user1946217

+0

CSV文件的外觀如何? (我想你現在正在顯示你的代碼的輸出,而不是輸入) –

回答

0
counter = {} 
for skill_list in lists: 
    for skill in skill_list: 
     counter[skill] = counter.get(skill, 0) + 1 

print counter 
+0

我這樣做,沒有工作。你能否糾正我錯誤的地方? >>>具有開放( 'Book2.csv', 'RB')爲f: \t inpreader = csv.reader(F,分隔符= ' ') \t用於inpreader行: \t \t解釋=',' 。加入(行) \t \t >>> LEN(列表) >>>計數器= {} >>>用於解釋skill_list: \t爲技術人員在skill_list: \t \t計數器[技能] = counter.get(skill,0)+ 1 \t \t >>> print counter {',':1} – user1946217

+0

counter = {} with open('Book2.csv','rb')as f: inpreader = csv.reader(f,delimiter =',' ) 用於inpreader行: 用於行技能[1]: 計數器[技能] = counter.get(技能,0)+ 1個 打印 '' 加入(行) 打印計數器 [鏈接] (http://simonsagi.com/sites/stackoverflow/frequency.txt) –