我得到了旅遊交易數據集是這樣的(約56萬人次):數據幀1如何使用R提供每週簡介?
ID START TIME DATE ORIGIN DESTINATION DAY
1005 9.10 2012-01-02 A B Monday
1005 18.15 2012-01-02 B A Monday
1005 9.05 2012-01-08 A B Sunday
1005 17.05 2012-01-08 B A Sunday
1010 8.00 2012-01-09 A C Monday
1010 12.00 2012-01-09 C A Monday
1013 13.15 2012-01-10 D E Tuesday
1013 15.30 2012-01-10 E G Tuesday
1013 9.06 2012-01-12 D E Thursday
... ... 2012-..-.. . . ...
和ID指數像這樣(約1986年的ID):數據幀2
ID
1005
1010
1013
1015
1030
1034
1036
1031
1040
...
我想創建一個基於這兩個數據框的每週旅行概況。我不知道我是否是對的,但我想這些代碼:
weekday = c("Sunday", "Monday","Tuesday", "Wednesday", "Thursday", "Friday", "Saturday")
br = seq(0,23,by=1)
ranges = paste(head(br,-1), br[-1], sep="_")
for (i in dataframe2$ID) {
for (n in weekday){
x= filter(dataframe1,dataframe1$ID %in% i & dataframe1$DAY %in% n)
freq = hist(as.numeric(x), br, include.lowest=TRUE, plot=FALSE)
df = as.data.frame(t(data.frame(frequency = freq$counts)))
df$i = i
df$n = n
colnames(df) = c(as.character(ranges),"ID","Day")
write.table(head(df),file="testdata1.csv", append=TRUE,sep=",",col.names=FALSE,row.names=FALSE)
}
}
我想和包含其每週的行程頻率的CSV表來結束。我也想問問是否有簡單的方法來簡化這項任務。
ID 0_1 1_2 2_3 3_4 4_5 5_6 6_7 7_8 8_9 9_10 10_11 11_12 12_13 13_14 14_15 15_16 16_17 17_18 18_19 19_20 20_21 21_22 22_23 Day
1005 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 1 0 0 0 0 0 Sunday
1005 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 1 0 0 0 0 0 Monday
1005 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 Tuesday
1005 Wednesday
1005 Thursday
1005 Friday
1005 Saturday
1010 Sunday
1010
1010
1010
1010
1010
1010
到底
我想製作一個圖表是這樣的: enter image description here
它的更好,如果你'dput'您的數據爲您圖中的數據總結 –