2015-03-03 55 views
0

我正在努力做一些關於熊貓數據框的數據爭奪。我一直在這個問題上停留了2天。與python熊貓數據爭執

我有一個數據幀,看起來像這樣:

['a','e'] 
['b','f'] 
['c','g'] 
['d','h'] 
['a','i'] 
['b','j'] 
['c','k'] 
['d','l'] 

,我需要把它變成這樣看

['a',['e','i']] 
['b',['f','j']] 
['c',['g','k']] 
['d',['h','l']] 

所以基本上旋轉第一列周圍的原始數據幀和然後從第二列創建一個字符串列表。

感謝

回答

3

你可以使用groupy/agg

import pandas as pd 

data = [['a','e'], ['b','f'], ['c','g'], ['d','h'], ['a','i'], ['b','j'], 
     ['c','k'], ['d','l']] 

df = pd.DataFrame(data, columns=['first', 'second']) 
print(df.groupby(['first']).agg(lambda x: x.tolist())) 

產量

 second 
first   
a  [e, i] 
b  [f, j] 
c  [g, k] 
d  [h, l] 
+0

奏效謝謝!看起來很簡單。 – user1274037 2015-03-03 03:19:04