1
這裏充滿價值的NAS原題: Group by min and fill NAs with value from another column集團通過分和另一列第2部分
我有這樣的數據幀:
mydf = pd.DataFrame (data = {'uid': [1,1,1,2,2,3,4,4,4,4], 'pagename':
['home', 'blah',
'blah', 'home', 'blah', 'blah','blah','home','blah','blah'], 'startpage':
[np.nan, np.nan, np.nan, 'home',
'home', 'blah',np.nan,np.nan,np.nan,np.nan], 'date_time':
[0,1,2,5,9,1,1,2,3,4], 'page_event': [0,0,0,0,0,0,10,0,0,10]})
我想這個數據幀:
endingdf = pd.DataFrame (data = {'uid': [1,1,1,2,2,3,4,4,4,4], 'pagename':
['home', 'blah', 'blah', 'home', 'blah','blah','blah','home','blah','blah'],
'startpage': [np.nan, np.nan, np.nan, 'home',
'home','blah',np.nan,np.nan,np.nan,np.nan],
'date_time': [0,1,2,5,9,1,1,2,3,4], 'page_event': [0,0,0,0,0,0,10,0,0,10],
'new_start_page':['home', 'home', 'home', 'home', 'home', 'blah', 'home',
'home', 'home', 'home']})
我想要做的是按UID
分組,如果startpage
爲NULL
,則使用fir st pagename
的訪問(min_ date_time)但只有當page_event = 0
。所以如果第一個pagename
有page_event = 10
那就跳過那個,直到page_event = 0
。