2016-11-26 57 views
1

我檢索下列數據幀的cummax()值,蟒熊貓 - 檢索在cummax()索引時間戳值

     exit_price trend netgain  high  low MFE_pr 
exit_time                 
2000-02-01 01:00:00  1400.25  -1  1.00 1401.50 1400.25 1400.25 
2000-02-01 01:30:00  1400.75  -1  0.50 1401.00 1399.50 1399.50 
2000-02-01 02:00:00  1400.00  -1  1.25 1401.00 1399.75 1399.50 
2000-02-01 02:30:00  1399.25  -1  2.00 1399.75 1399.25 1399.25 
2000-02-01 03:00:00  1399.50  -1  1.75 1400.00 1399.50 1399.25 
2000-02-01 03:30:00  1398.25  -1  3.00 1399.25 1398.25 1398.25 
2000-02-01 04:00:00  1398.75  -1  2.50 1399.00 1398.25 1398.25 
2000-02-01 04:30:00  1400.00  -1  1.25 1400.25 1399.00 1398.25 
2000-02-01 05:00:00  1400.25  -1  1.00 1400.50 1399.25 1398.25 
2000-02-01 05:30:00  1400.50  -1  0.75 1400.75 1399.50 1398.25 

用下面的公式

trade ['MFE_pr'] = np.nan 
trade ['MFE_pr'] = trade ['MFE_pr'].where(trade ['trend']<0, trade.high.cummax()) 
trade ['MFE_pr'] = trade ['MFE_pr'].where(trade ['trend']>0, trade.low.cummin()) 

是有一種方法檢索從哪一行取得cummax()的行的時間戳?與.idxmax()類似,但對於cummax()?

回答

1

這可能是你在找什麼。

import pandas as pd 
import datetime 
df = pd.DataFrame({'a': [1, 2, 1, 3, 2, 5, 4, 3, 5]}, 
        index=pd.DatetimeIndex(start= 
             datetime.datetime.fromtimestamp(0), 
             periods=9, freq='D')) 
df['cummax'] = df.a.cummax() 
df['timestamp'] = df.index 
df = df.merge(df.groupby('cummax')[['timestamp']].first().reset_index(), on='cummax') 
df.rename(columns={'timestamp_y': 'max_timestamp'}, inplace=True) 
df.index=df.timestamp_x.values 
del df['timestamp_x'] 
print(df) 


        a cummax  max_timestamp 
1970-01-01 03:00:00 1  1 1970-01-01 03:00:00 
1970-01-02 03:00:00 2  2 1970-01-02 03:00:00 
1970-01-03 03:00:00 1  2 1970-01-02 03:00:00 
1970-01-04 03:00:00 3  3 1970-01-04 03:00:00 
1970-01-05 03:00:00 2  3 1970-01-04 03:00:00 
1970-01-06 03:00:00 5  5 1970-01-06 03:00:00 
1970-01-07 03:00:00 4  5 1970-01-06 03:00:00 
1970-01-08 03:00:00 3  5 1970-01-06 03:00:00 
1970-01-09 03:00:00 5  5 1970-01-06 03:00:00 
+0

是的這是非常有用的。 快速的問題,你建議從索引創建一個新的列'時間戳'來執行groupby: 'df ['timestamp'] = df.index' 有沒有辦法直接在索引上執行它? –

+0

我不能肯定地說,但似乎'df.groupby(...).first()'失去了原始數據幀的索引。我嘗試添加'as_index = False'選項,但它不起作用。如果不用索引副本創建temprorary列,現在無法找到這種方法。 –