2017-04-05 129 views
0

我想在具有多個值的屬性中選擇幾個屬性值至print。在這個例子中在具有多個值的xml-tei屬性中選擇屬性值

#in R 
interpRef <- getNodeSet(doc,"//ns:ref[contains(@ana, 'whatAction')]", ns) 
interpRef_ana <- for (i in 1:length(interpRef)) print(paste(xmlGetAttr(interpRef[[i]],"ana"))) 

我有結果:

[[1]] 
<ref ana="whatAction #ktu1-3_ii_l6b_tḫtṣb #verb.competition #contend">Action belongs to verb competition subcategory contend 
            <stage ana="whatResult #result #defeate_ofOpposition"/></ref> 
[[2]] 
<ref ana="whatAction #ktu1-3_ii_l7_tmḫṣ #verb.emotion #humiliation">Action belongs to verb emotion, subcategory humiliation 
            <stage ana="whatResult #result #defeate_ofOpposition"/></ref> 
[[3]] 
<ref ana="whatAction #ktu1-3_ii_l8_tṣmt #verb.emotion #humiliation">Action belongs to verb emotion, subcategory humiliation</ref> 

#print 
[1] "whatAction #ktu1-3_ii_l6b_tḫtṣb #verb.competition #contend" 
[1] "whatAction #ktu1-3_ii_l7_tmḫṣ #verb.emotion #humiliation" 
[1] "whatAction #ktu1-3_ii_l8_tṣmt #verb.emotion #humiliation" 

我需要@ana屬性只是幾個屬性值,值2和3,例如print

[1] "#ktu1-3_ii_l6b_tḫtṣb #contend" 
[1] "#ktu1-3_ii_l7_tmḫṣ #humiliation" 
[1] "#ktu1-3_ii_l8_tṣmt #humiliation" 

我有做了幾次嘗試,其中之一,但它不工作:

interpRef_ana <- for (i in 1:length(interpRef)) print(paste(xmlGetAttr(interpRef[[i]],"ana",[2:3]))) 

==== XML實例====

每個<ref>是內<interp>,並且每個@ana如下從該詞彙來自預定義的分類標準相同的分層結構。

<interp xml:id="ktu1-3_ii_l6b_int" ana="#ktu1-3_ii_l6b" corresp="#ktu1-3_ii_6b"> 
    <desc> 
    <ref ana="whatAction #ktu1-3_ii_l6b_tḫtṣb #verb.competition #contend" 
            >Action belongs to verb competition subcategory contend 
    <stage ana="whatResult #result #defeate_ofOpposition" /> 
</ref> 
<castList> 
    <castItem> 
    <persName type="character" ana="#whatCharacter #Character #ANT #Female"> 
     <state ana="#whatRole #active" />ʾAnatu 
    </persName> 
    </castItem> 
</castList> 
<view> 
    <placeName ana="#whatContext #battle">battle 
    <location ana="#whatSphere #outside" /> 
    </placeName> 
</view> 
<stage ana="#whatBehavior"> 
    <span ana="#toDestroy #five_dD #rage">Voluntary 
             intentionality, to destroy of her free will, with rage 
             (level five).</span> 
    <span ana="#AffectEntity_and_other">The result of action has 
             an impact on ʾAnatu and others</span> 
    </stage> 
</desc> 
</interp> 
<interp xml:id="ktu1-3_ii_l7_int" ana="#ktu1-3_ii_l7" corresp="#ktu1-3_ii_l7"> 
<desc> 
    <ref ana="whatAction #ktu1-3_ii_l7_tmḫṣ #verb.emotion #humiliation" 
            >Action belongs to verb emotion, subcategory humuliation 
    <stage ana="whatResult #result #defeate_ofOpposition" /> 
</ref> 
<castList> 
    <castItem> 
    <persName type="character" ana="#whatCharacter #Character #ANT #Female"> 
     <state ana="#whatRole #active" />ʾAnatu 
    </persName> 
    <persName type="character" cert="low" ana="#Character #UNK #Unknown"> 
     <state ana="#behav #passive" />People from the West 
    </persName> 
    </castItem> 
</castList> 
<view> 
    <placeName ana="#whatContext #battle">battle 
    <location ana="#whatSphere #outside" />outside her household 
    </placeName> 
</view> 
<stage ana="#whatBehavior"> 
    <span ana="#toDestroy #free #five_dD">Voluntary 
             intentionality, to destroy of her free will, with rage 
             (level five)Five.</span> 
    <span ana="#affectEntity_and_other">The result of action has 
             an impact on ʾAnatu and others</span> 
    </stage> 
</desc> 
</interp> 

==== ====更新

我曾嘗試使用圖書館string,theoriticaly它的作品,我可以選擇我需要的屬性值:

x <- for (i in 1:length(interp)) print((cbind((y=(KTU = (xmlGetAttr(interp[[i]],"ana")))), (z=(verb.category = (xmlGetAttr(interpRef[[i]],"ana"))))))) 
x1 <- print (cbind(word(word(y,-1)),(word(z, -3, -2)))) 
x1 

> x <- for (i in 1:length(interp)) print((cbind((y=(KTU = (xmlGetAttr(interp[[i]],"ana")))), (z=(verb.category = (xmlGetAttr(interpRef[[i]],"ana"))))))) 
[,1]    [,2]               
[1,] "#ktu1-3_ii_l5b-6a" "whatAction #ktu1-3_ii_l5b-6a_tmtḫṣ #verb.competition #contend" 
[,1]    [,2]               
[1,] "#ktu1-3_ii_l6b" "whatAction #ktu1-3_ii_l6b_tḫtṣb #verb.competition #contend" 
[,1]   [,2]              
[1,] "#ktu1-3_ii_l7" "whatAction #ktu1-3_ii_l7_tmḫṣ #verb.emotion #humiliation" 
[,1]   [,2]              
[1,] "#ktu1-3_ii_l8" "whatAction #ktu1-3_ii_l8_tṣmt #verb.emotion #humiliation" 
[,1]     [,2]              
[1,] "ktu1-3_ii_l11b_12a" "whatAction #ktu1-3_ii_l11b-12a_ʿtkt #put_together #action" 
[,1]     [,2]              
[1,] "#ktu1-3_ii_l12b_13a" "whatAction #ktu1-3_ii_l12b-13a_šnst #put_together #action" 
[,1]     [,2]             
[1,] "#ktu1-3_ii_l13b_14a" "whatAction #ktu1-3_ii_l13b-14a_tġlt #action #movement" 
[,1]     [,2]              
[1,] "#ktu1-3_ii_l15b_16a" "whatAction #ktu1-3_ii_l5b_6a_tmtḫṣ #confrontation #action" 
> x 
NULL 
> x1 <- print (cbind(word(word(y,-1)),(word(z, -3, -2)))) 
[,1]     [,2]          
[1,] "#ktu1-3_ii_l15b_16a" "#ktu1-3_ii_l5b_6a_tmtḫṣ #confrontation" 
> x1 
[,1]     [,2]          
[1,] "#ktu1-3_ii_l15b_16a" "#ktu1-3_ii_l5b_6a_tmtḫṣ #confrontation" 

但只是給了我一個事件的屬性值而不是列表。所以我嘗試添加for (i in 1:length(interp))

x1 <- for (i in 1:length(interp)) print (cbind(word(word(y,-1)),(word(z, -3, -2)))) 

> x1 <- for (i in 1:length(interp)) print (cbind(word(word(y,-1)),(word(z, -3, -2)))) 
[,1]     [,2]          
[1,] "#ktu1-3_ii_l15b_16a" "#ktu1-3_ii_l5b_6a_tmtḫṣ #confrontation" 
[,1]     [,2]          
[1,] "#ktu1-3_ii_l15b_16a" "#ktu1-3_ii_l5b_6a_tmtḫṣ #confrontation" 
[,1]     [,2]          
[1,] "#ktu1-3_ii_l15b_16a" "#ktu1-3_ii_l5b_6a_tmtḫṣ #confrontation" 
[,1]     [,2]          
[1,] "#ktu1-3_ii_l15b_16a" "#ktu1-3_ii_l5b_6a_tmtḫṣ #confrontation" 
[,1]     [,2]          
[1,] "#ktu1-3_ii_l15b_16a" "#ktu1-3_ii_l5b_6a_tmtḫṣ #confrontation" 
[,1]     [,2]          
[1,] "#ktu1-3_ii_l15b_16a" "#ktu1-3_ii_l5b_6a_tmtḫṣ #confrontation" 
[,1]     [,2]          
[1,] "#ktu1-3_ii_l15b_16a" "#ktu1-3_ii_l5b_6a_tmtḫṣ #confrontation" 
[,1]     [,2]          
[1,] "#ktu1-3_ii_l15b_16a" "#ktu1-3_ii_l5b_6a_tmtḫṣ #confrontation" 
> x1 

我只是重複同樣出現8次(=實際發生數)

在前進,感謝您的幫助。

+0

請您發表小剪斷你的XML呢?我通常會比你在這裏有點不同。我在下面發佈了一個答案 - 它與您的案例不符 - 但您可能會發現它有幫助。如果您發佈了剪切片段,我很樂意調整它 - xml –

+0

我可能誤解了您的問題,但是對它進行了剪裁。我相信我能幫上忙。 –

+0

謝謝伊恩。我剛剛在== XML ===之後添加了一個示例。 – Vanessa

回答

0

我找到了解決辦法,也許這將是有益的:

listInterp <- list() 
for (i in 1:length(interp)) { 
    print ((cbind((y=(KTU = (xmlGetAttr(interp[[i]],"ana")))), (z=(verb.category = (xmlGetAttr(interpRef[[i]],"ana"))))))) 
    listInterp[[i]] <- (paste(cbind(word(word(y,-1)),(word(z, -3, -2))), collapse=": ")) #to select attribute values 
    } 
listInterp<-(lapply(listInterp,gsub,pattern="#",replacement="")) #to replace # by empty space 
listInterp 

#result 
[[1]] 
[1] "ktu1-3_ii_l5b-6a: ktu1-3_ii_l5b-6a_tmtḫṣ verb.competition" 
[[2]] 
[1] "ktu1-3_ii_l6b: ktu1-3_ii_l6b_tḫtṣb verb.competition" 
[[3]] 
[1] "ktu1-3_ii_l7: ktu1-3_ii_l7_tmḫṣ verb.emotion" 
[[4]] 
[1] "ktu1-3_ii_l8: ktu1-3_ii_l8_tṣmt verb.emotion" 
[...]