2013-11-01 25 views
0

我在嘗試將我的抓取數據從索引索引到索引,並收到以下錯誤。任何幫助將不勝感激。索引nutch數據到索引時出錯

SOLRIndexWriter 
solr.server.url : URL of the SOLR instance (mandatory) 
solr.commit.size : buffer size when sending to SOLR (default 1000) 
solr.mapping.file : name of the mapping file for fields (default solrindex-mapping.xml) 
solr.auth : use authentication (default false) 
solr.auth.username : use authentication (default false) 
solr.auth : username for authentication 
solr.auth.password : password for authentication 


Exception in thread "main" java.io.IOException: Job failed! 
at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1357) 
at org.apache.nutch.indexer.IndexingJob.index(IndexingJob.java:123) 
at org.apache.nutch.indexer.IndexingJob.index(IndexingJob.java:81) 
at org.apache.nutch.indexer.IndexingJob.index(IndexingJob.java:65) 
at org.apache.nutch.crawl.Crawl.run(Crawl.java:155) 
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) 
at org.apache.nutch.crawl.Crawl.main(Crawl.java:55) 
+0

logs/hadoop.log文件的內容是什麼? – nimeshjm

回答

0

您是否看到過solr日誌?那些日誌記錄錯誤原因。 我曾經在nutch遇到過同樣的問題,並在solr的日誌中發現了一條消息「unknown field host」。 編輯完scheme.xml後,問題消失了。