2015-12-21 80 views
0

我有一個數據框,比如說集羣。我知道我們能得到的彙總統計如下:R獲取R中的彙總統計值

summary(cluster) 

這給類似:

cluster  client  domain     
Min. :4 RRr_IL:40449 P621:409 
1st Qu.:4        
Median :4        
Mean :4        
3rd Qu.:4        
Max. :4        

shift_end  shift_length_avg patients_seen_cnt 
Min. : 0.00 Min. :6.000 Min. : 1.000 
1st Qu.:13.00 1st Qu.:7.000 1st Qu.: 4.000 
Median :15.00 Median :8.000 Median : 6.000 
Mean :13.93 Mean :8.128 Mean : 9.107 
3rd Qu.:17.00 3rd Qu.:9.000 3rd Qu.: 11.000 
Max. :23.00 Max. :9.000 Max. :273.000 

我如何解析這個總結。這似乎是一個字符數組。

例如,如果我想要從總結中獲得'shift_length_avg'的中值,我該如何做?

+0

請參閱'str(summary(cluster))'雖然爲什麼不是'median(cluster $ shift_length_avg)'? – daroczig

+0

這可能工作'摘要(集羣)[3,5]' – DatamineR

回答

0

嘗試這種情況:

Med<-median(cluster$shift_length_avg, na.rm = FALSE) 
1

summary結果是table它是一個字符矩陣。

summary(iris) 
# Sepal.Length Sepal.Width  Petal.Length Petal.Width   Species 
# Min. :4.300 Min. :2.000 Min. :1.000 Min. :0.100 setosa :50 
# 1st Qu.:5.100 1st Qu.:2.800 1st Qu.:1.600 1st Qu.:0.300 versicolor:50 
# Median :5.800 Median :3.000 Median :4.350 Median :1.300 virginica :50 
# Mean :5.843 Mean :3.057 Mean :3.758 Mean :1.199     
# 3rd Qu.:6.400 3rd Qu.:3.300 3rd Qu.:5.100 3rd Qu.:1.800     
# Max. :7.900 Max. :4.400 Max. :6.900 Max. :2.500     

str(summary(iris)) 
# 'table' chr [1:6, 1:5] "Min. :4.300 " "1st Qu.:5.100 " ... 
# - attr(*, "dimnames")=List of 2 
# ..$ : chr [1:6] "" "" "" "" ... 
# ..$ : chr [1:5] " Sepal.Length" " Sepal.Width" " Petal.Length" " Petal.Width" ... 

summary(iris)[, 3] 

# "Min. :1.000 " "1st Qu.:1.600 " "Median :4.350 " "Mean :3.758 " 
# "3rd Qu.:5.100 " "Max. :6.900 " 

colnames(summary(iris)) 
# [1] " Sepal.Length" " Sepal.Width" " Petal.Length" " Petal.Width" "  Species" 

summary(iris)[, " Petal.Length"] 

# "Min. :1.000 " "1st Qu.:1.600 " "Median :4.350 " "Mean :3.758 " 
# "3rd Qu.:5.100 " "Max. :6.900 "