2016-11-24 118 views
0

最近,我正在學習這本書 - 學習火花o-reilly-2015。我試圖運行Spark Streaming示例StreamingLogInput。的代碼如下:爲什麼我的火花流媒體演示不輸出任何東西

val conf = new SparkConf().setMaster(master).setAppName("StreamingLogInput") 
// Create a StreamingContext with a 1 second batch size 
val ssc = new StreamingContext(conf, Seconds(1)) 
// Create a DStream from all the input on port 7777 
val lines = ssc.socketTextStream("localhost", 7777) 
val errorLines = processLines(lines) 
// Print out the lines with errors, which causes this DStream to be evaluated 
errorLines.print() 
// start our streaming context and wait for it to "finish" 
ssc.start() 

def processLines(lines: DStream[String]) = { 
// Filter our DStream for lines with "error" 
lines.filter(_.contains("error")) 
} 

當我使用如下在singlenode機器運行該程序,

$SPARK_HOME/bin/spark-submit \ 
--class com.oreilly.learningsparkexamples.scala.StreamingLogInput \ 
--master spark://singlenode:7077 \ 
/home/hadoop/project/learning-spark/target/scala-2.10/learning-spark-examples_2.10-0.0.1.jar \ 
spark://singlenode:7077 

和在另一個窗口,I型的順序

nc -l 7777 

和鍵入一些假日誌 但沒有輸出錯誤日誌。 和日誌如下:?

16/11/24 04:20:48 INFO BlockManagerInfo: Added input-0-1479932447800 in memory 
on singlenode:37112 (size: 32.0 B, free: 267.2 MB) 
16/11/24 04:20:49 INFO JobScheduler: Added jobs for time 1479932449000 ms 
16/11/24 04:20:50 INFO JobScheduler: Added jobs for time 1479932450000 ms 
16/11/24 04:20:51 INFO JobScheduler: Added jobs for time 1479932451000 ms 
16/11/24 04:20:51 INFO BlockManagerInfo: Added input-0-1479932451000 in memory on singlenode:37112 (size: 33.0 B, free: 267.2 MB) 
16/11/24 04:20:52 INFO JobScheduler: Added jobs for time 1479932452000 ms 
16/11/24 04:20:53 INFO JobScheduler: Added jobs for time 1479932453000 ms 
16/11/24 04:20:54 INFO JobScheduler: Added jobs for time 1479932454000 ms 
16/11/24 04:20:55 INFO JobScheduler: Added jobs for time 1479932455000 ms 
16/11/24 04:20:56 INFO JobScheduler: Added jobs for time 1479932456000 ms 
16/11/24 04:20:57 INFO JobScheduler: Added jobs for time 1479932457000 ms 
16/11/24 04:20:58 INFO JobScheduler: Added jobs for time 1479932458000 ms 

爲什麼會出現這種情況的任何幫助表示讚賞!

+0

什麼是您正在運行的配置,請您分享您的羣集配置!這裏 –

+0

我只在我的虛擬機和一臺機器上運行程序。火花配置非常簡單,主人和工人在同一臺機器上運行。我可以成功運行其他火花程序,但不能像其他人一樣運行流。火花版本是1.3.1。 – Coinnigh

回答

0

我在提交應用程序時通過指定多個執行程序來解決它,例如local [3]。