0
我的工作與數據框採用這種結構大熊貓的Python中的重複數據幀,採取信息重複行的每個組合
id,date,id_client,optionin,optionout
1,09/01/2017,123456,11,12
2,09/01/2017,123456,12,14
3,09/02/2017,1111111,85,45
4,09/02/2017,1111111,45,35
5,09/02/2017,1111111,35,58
6,09/01/2017,528585,1,2
7,09/01/2017,548123,37,12
8,09/01/2017,123588,117,512
9,09/01/2017,981358,116,152
我想在同一天擺脫重複的條目在同客戶。 我只想第一optionin的數據,並在同一行中的最後一個optionout,並與optionout
像這樣
id,id_end,date,id_client,optionin,optionout
1,2,09/01/2017,123456,11,14
3,5,09/02/2017,1111111,85,58
6,6,09/01/2017,528585,1,2
7,7,09/01/2017,548123,37,12
8,8,09/01/2017,123588,117,512
9,9,09/01/2017,981358,116,152
我如何能做到這一點的ID的新列?可能嗎?
節省一個步驟:'df.groupby([ '日期', 'id_client'],as_index =假).agg({'optionin':'first','optionout':'last'})':) – Wen