2016-08-19 83 views
1

我有一個文本數據如下所示:R:如何從文本數據刪除 n和<br />

hold that\nagainst me. i spend most of my days trying to build cool stuff for\nmy company. <br />\n<br />\ni'm an entrepreneur (like everyone else in sf, it seems) and i love\nwhat i do. 

我用下面的命令,但它只是刪除\ n和仍然存在。

gsub("\n <br />", " ", h) 

,當我試圖此命令,無論是人物被刪除,但在實際的文本數據的「重」也將被刪除。所以,我想知道什麼是刪除兩者的正確方法。

gsub("[\n <br />]", " ", h) 

回答

3
text <- "hold that\nagainst me. i spend most of my days trying to build cool stuff for\nmy company. <br />\n<br />\ni'm an entrepreneur (like everyone else in sf, it seems) and i love\nwhat i do." 

使用(pat1|pat2)選擇不同的模式([]僅適用於單字符的替代品)。

gsub("(\n|<br />)"," ",text) 
## [1] "hold that against me. i spend most of my days trying to build cool stuff for my company.  i'm an entrepreneur (like everyone else in sf, it seems) and i love what i do." 
+1

人們不必逃避'\ N'。另外,最好在'br'和'>'之間匹配可選的空格,並且允許一個可選的'/':[''gsub(「[\ n \ r] | 」,「」,text)'](http ://ideone.com/V3jwfP) –

+0

你能告訴我爲什麼讓這些角色更好?我對文本數據很陌生。 – lll