2017-08-25 38 views
1

一個簡單的例子:不能理解seqefsub的輸出本功能

t<-data.frame(id=rep(1:3, each=3), time=c(0,2,4,0,4,2,0,2,2), event=rep(c("A", "B", "C"), 3)) tmp<-seqecreate(t) seqefsub(tmp, min.support=2)

我越來越:

 Subsequence Support Count 
     1   (A)  1  3 
     2  (A)-(B)  1  3 
     3  (A)-(C)  1  3 
     4   (B)  1  3 
     5   (C)  1  3 

我不明白爲什麼(A) - (B)和( A) - (C)計數3. 我有三個序列:

(A)-2-(B)-2-(C) 
    (A)-2-(C)-2-(B) 
    (A)-2-(B,C) 

它應該是:

 Subsequence Support Count 
     1   (A)  1  3 
     2   (B)  1  3 
     3   (C)  1  3 
     4  (A)-(B)  0.66 2 
     5  (A)-(C)  0.66 2 

非常感謝您的任何想法。

回答

1

你看到的(A) - (B)計數爲3的原因是因爲seqefsub全文(A) - <anything can go here> - (B)序列的(A) - (B)一個實例,你有三個這樣的實例。

如果你想看到(A) - (B)連續出現,只有連續的子序列,嘗試:

>seqefsub(tmp, min.support=2, constraint = seqeconstraint(window.size = 2)) 
    Subsequence Support Count 
1   (A) 1.0000000  3 
2   (B) 1.0000000  3 
3   (C) 1.0000000  3 
4  (A)-(B) 0.6666667  2 
5  (A)-(C) 0.6666667  2 

Computed on 3 event sequences 
    Constraint Value 
    window.size  2 
count.method COBJ