2016-05-11 51 views
0

假設我們有一個數據幀df蟒蛇熊貓據幀:參數「保持」在drop_duplicates不工作()/複製

df=pd.DataFrame() 
df['c1']=[1,2,3,3,4] 
df['c2']=["a1","a2","a2","a2","a1"] 
df['c3']=[1,2,3,3,5] 

如果我請使用df.drop_duplicates(keep=False)df.duplicated(keep=False),我得到以下錯誤:

File "C:\Users\Kanika\Anaconda\lib\site-packages\pandas\util\decorators.py", line 88, in wrapper 
return func(*args, **kwargs) 

TypeError: duplicated() got an unexpected keyword argument 'keep' 
+0

的可能的複製[蟒:分離出具有在重複數據幀熊貓行(http://stackoverflow.com/questions/37154268/python-separate-out -row-which-have-duplicates-in-panda-dataframe) – ysearka

+0

你使用的是什麼版本的熊貓,因爲這個版本在0.18.0版本中工作正常 – EdChum

+0

@ysearka它不是重複的,OP在調用時遇到錯誤那些方法 – EdChum

回答

1

應當更新版本的熊貓,因爲它是從0.17.0版本增加,從what's new in v. 0.17.0

  • drop_duplicates and duplicated now accept a keep keyword to target first, last, and all duplicates.

在大熊貓0.18.1兩項工程:

In [116]: df 
Out[116]: 
    c1 c2 c3 
0 1 a1 1 
1 2 a2 2 
2 3 a2 3 
3 3 a2 3 
4 4 a1 5 

In [117]: df.drop_duplicates() 
Out[117]: 
    c1 c2 c3 
0 1 a1 1 
1 2 a2 2 
2 3 a2 3 
4 4 a1 5 

In [118]: df.drop_duplicates(keep=False) 
Out[118]: 
    c1 c2 c3 
0 1 a1 1 
1 2 a2 2 
4 4 a1 5