我有一個數據框如下(稱爲DAT)獲取頻率
chr chrStart chrEnd Gene RChr RStart REnd Rname distance
chr1 39841 39883 Gene1 chr1 398 3984 Cha1b 0
chr1 39841 39883 Gene1 chr1 398 3985 Ab 0
chr1 39841 39883 Gene1 chr1 398 3986 Tia 0
chr1 39841 39883 Gene1 chr1 398 3987 MEA 0
chr1 39841 39883 Gene1 chr1 398 3988 La 0
chr1 39841 39883 Gene1 chr1 398 3989 M3 0
chr1 14893 15893 Gene2 chr1 398 3984 Cha1b 0
chr1 14893 15893 Gene2 chr1 398 3985 Cha1b 0
chr1 14893 15893 Gene2 chr1 398 3986 Cha1b 0
chr1 14893 15893 Gene2 chr1 398 3987 MEA 0
chr1 14893 15893 Gene2 chr1 398 3988 MEA 0
chr1 39841 39883 Gene1 chr1 398 3989 M3 0
我想要得到的是不同類型的RNAME出現每個基因所以上述結果應該是頻率像
Gene Rname Freq
Gene1 Cha1b 1
Gene1 Ab 1
Gene1 Tia 1
Gene1 MEA 1
Gene1 La 1
Gene1 M3 1
Gene2 Cha1b 3
Gene2 MEA 2
Gene2 M3 1
我試着做了兩個集團與dplyr但我認爲這是沒有意義的,反正它只是給我所有的Rnames的每個基因
頻率210
'base R' option is'subset(as.data.frame(table(dat [c('Gene','Rname')])),Freq!= 0)' – akrun 2015-04-05 12:22:09