應用str_split的專欄中,我有以下DF名爲I:在數據幀
structure(list(price = c(11772, 14790, 2990, 1499, 21980, 27999
), fuel = c("diesel", "petrol", "petrol", "diesel", "diesel",
"petrol"), gearbox = c("manual", "manual", "manual", "manual",
"automatic", "manual"), colour = c("white", "purple", "yellow",
"silver", "red", "rising blue metalli"), engine_size = c(1685,
1199, 998, 1753, 2179, 1984), mileage = c(18839, 7649, 45058,
126000, 31891, 100), year = c("2013 hyundai ix35", "2016 citroen citroen ds3 cabrio",
"2007 peugeot 107 hatchback", "2007 ford ford focus hatchback", "2012 jaguar xf saloon",
"2016 volkswagen scirocco coupe"), doors = c(5, 2, 3, 5, 4, 3
)), .Names = c("price", "fuel", "gearbox", "colour", "engine_size",
"mileage", "year", "doors"), row.names = c(NA, 6L), class = "data.frame")
一些「年」列中的字被複制。我想刪除它們。作爲第一步,我想用單獨的單詞分隔此列中的字符串。 我能做到這一點單獨的字符串,但是當我嘗試將其應用到整個數據幀它給出了一個錯誤
unlist(str_split("2013 hyunday ix35", "[[:blank:]]"))
[1]「2013」「hyunday」,「ix35的」
for(k in 1:nrow(i))
+ i[k,7]<-unlist(str_split(i[k, 7], "[[:blank:]]"))
錯誤[<-.data.frame
(*tmp*
,K,7,值= C( 「2013」, 「現代」,: 更換已3行,數據具有1
它工作正常。我試圖使用sapply,但不知道如何將兩個功能(粘貼和獨特)結合在一起。 – Vasile