2015-04-02 49 views
0

我試圖用mahout來訓練邏輯迴歸模型。命令行和輸出是這樣的:Mahout trainAdaptiveLogistic

mahout trainAdaptiveLogistic --passes 100 --input /home/cloudera/Desktop/final.csv --features 20 --output /home/cloudera/Desktop/model/adaptivemodel --target Action --categories 2 --predictors Open High Close --types n n n 
MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath. 
Running on hadoop, using /usr/lib/hadoop/bin/hadoop and HADOOP_CONF_DIR=/etc/hadoop/conf 
MAHOUT-JOB: /usr/lib/mahout/mahout-examples-0.7-cdh4.7.1-job.jar 
15/04/02 07:34:53 WARN driver.MahoutDriver: No trainAdaptiveLogistic.props found on classpath, will use command-line arguments only 
20 
Action ~ 0.000*Close + 0.000*High + 0.000*Open 
       Close 0.00003 
       High 0.00004 
       Open 0.00003 
    0.000000000  0.000033367  0.000036516  0.000000000  0.000000000  0.000000000  0.000000000  0.000000000  0.000000000  0.000000000  0.000000000  0.000000000  0.000000000  0.000000000  0.000000000  0.000000000  0.000000000  0.000034630  0.000000000  0.000000000 
15/04/02 07:38:30 INFO driver.MahoutDriver: Program took 216959 ms (Minutes: 3.6159833333333333) 

首先我用一個文件的幾行是這些:

Open,High,Low,Close,Volume,Adj Close,Action 
59.30,60.05,58.88,59.41,3373800,59.41,BUY 
59.64,60.26,58.88,59.83,3069100,59.83,BUY 
58.91,59.25,58.21,59.03,3559500,59.03,SELL 
59.57,60.44,58.67,58.68,3302000,58.68,BUY 

爲什麼係數非常小,因此接近零?嘗試使用數字預測器預測單詞是錯誤的嗎?

回答

1

這看起來像是爲預測變量「Action」選擇了錯誤類型的情況。在這種情況下,它應該是分類而不是純文本。您可以嘗試將二進制(0表示出售,1表示買入)變量分配給數據,然後使用適當的功能編碼器。

+0

謝謝,我一定會考慮一下。 – Chiffa 2015-09-16 21:20:05