我一直試圖製作一個顯示因子水平的計數表的另一個因素。爲此,我查看了幾十頁,問題......試圖在某些軟件包(dplyr,reshape)中使用函數來完成工作,但沒有正確使用它們的任何成功。聚合因子水平計數 - 按因子
這就是我的了:
# my data:
var1 <- c("red","blue","red","blue","red","red","red","red","red","red","red","red","blue","red","blue")
var2 <- c("0","1","0","0","0","0","0","0","0","0","1","0","0","0","0")
var3 <- c("2","2","1","1","1","3","1","2","1","1","3","1","1","2","1")
var4 <- c("0","1","0","0","0","0","1","0","1","1","0","1","0","1","1")
mydata <- data.frame(var1,var2,var3,var4)
head(mydata)
嘗試N + 1:僅顯示通過另一個因素因素的總次數。
t(aggregate(. ~ var1, mydata, sum))
[,1] [,2]
var1 "blue" "red"
var2 " 5" "12"
var3 " 5" "18"
var4 " 6" "16"
嘗試n + 2:這是正確的格式,但我不能讓它在多個因素上工作。
library(dplyr)
data1 <- ddply(mydata, c("var1", "var3"), summarise,
N = length(var1))
library(reshape)
df1 <- cast(data1, var1 ~ var3, sum)
df1 <- t(df1)
df1
blue red
1 3 6
2 1 3
3 0 2
我想的是:
blue red
var2.0 3 10
var2.1 1 1
var3.1 3 6
var3.2 1 3
var3.3 0 2
var4.0 2 6
var4.1 2 5
我怎樣才能得到這種格式?因此,許多在此先感謝,
是的,編輯。謝謝! – Mareviv