如何從文本文件中讀取數據在HDFS中使用Rhipe，R？

我是新來的R，Rhipe和Hadoop想從文件，其內容是讀取數據，如何從文本文件中讀取數據在HDFS中使用Rhipe，R？

<Author>fallriverma 
<Content>Quality hotel at great price Very clean. 
<Date>Nov 25, 2008 
<Rating>5 5 5 5 5 5 5 5 
<Aspects> 
1 8826(grat):1  
3 3(clean):1 19(price):1 187(quality):1 
0 
0 
0 
3 0(staff):1 12(friendly):1 14(helpful):1 
3 6(breakfast):1 46(free):1 333(selection):1  
0 

<Author>yondaime1845 
<Content>Its the best of the best for a reason One of the more affordable and better hotels in the city of seattle. 
<Date>Jan 2, 2008 
<Rating>5 5 5 5 5 5 5 5 
<Aspects> 
4 41(city):1 374(reason):1 762(seattle):1 1062(affordable):1 
0 
0 
4 1(location):1 66(park):1 143(cheap):1 186(convenient):1 
0 
0 
4 5(time):1 9(service):1 12(friendly):1 608(employee):1 
0

我想從讀起來像8826的作者和方面價值「8826（qrat）：1」，並想顯示他們使用Hadoop，rhipe和R逐列米

的希望您的建議提前

來源

2013-02-27 Syed Raza

我試過這個，但不允許我多一個分隔符「ny <-read.table（pipe（」hadoop dfs -cat'new1'「），sep =」>：「，header = TRUE）」 – 2013-02-27 11:56:12

new1是我的文件，其中包含數據，如後所述 – 2013-02-28 04:47:14

rhls("/user/notroot/input/")

謝謝你，給你HDFS的完整路徑位置 - 它會工作

來源

2013-04-08 06:56:08 krishna

如何從文本文件中讀取數據在HDFS中使用Rhipe，R？

回答

相關問題