R ggplot散點圖顏色多列

我正在嘗試使用多個字段創建與ggplot的散點圖。我已閱讀了關於這些散點圖和字段的着色，但想知道如何爲ggplot2movies數據集執行此操作？我想顏色基礎上，流派，但這些流派都分手了：R ggplot散點圖顏色多列

> movies <- ggplot2movies::movies 
> head(movies) 
      title year length budget rating votes r1 r2 r3 r4 r5 r6 r7 r8 r9 r10 mpaa Action Animation Comedy Drama Documentary Romance Short 
        <chr> <int> <dbl> <int> <dbl> <int> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <chr> <int>  <int> <int> <int>  <int> <int> <int> 
1      $ 1971 121  NA 6.4 348 4.5 4.5 4.5 4.5 14.5 24.5 24.5 14.5 4.5 4.5   0   0  1  1   0  0  0 
2  $1000 a Touchdown 1939  71  NA 6.0 20 0.0 14.5 4.5 24.5 14.5 14.5 14.5 4.5 4.5 14.5   0   0  1  0   0  0  0 
3 $21 a Day Once a Month 1941  7  NA 8.2  5 0.0 0.0 0.0 0.0 0.0 24.5 0.0 44.5 24.5 24.5   0   1  0  0   0  0  1 
4     $40,000 1996  70  NA 8.2  6 14.5 0.0 0.0 0.0 0.0 0.0 0.0 0.0 34.5 45.5   0   0  1  0   0  0  0 
5 $50,000 Climax Show, The 1975  71  NA 3.4 17 24.5 4.5 0.0 14.5 14.5 4.5 0.0 0.0 0.0 24.5   0   0  0  0   0  0  0 
6     $pent 2000  91  NA 4.3 45 4.5 4.5 4.5 14.5 14.5 14.5 4.5 4.5 14.5 14.5   0   0  0  1   0  0  0

什麼是解決這個（基於流派的顏色）的最好方法？所有的幫助真的很感謝！

來源

2016-12-04 dnsko

我猜你將不得不收拾數據（寬長格式）。也許用'tidyr :: gather（）'。 – hrbrmstr

正如@ hrbrmstr所述，您需要將數據從寬轉換爲長。您可以使用tidyr::gather()與dplyr::filter()一起來實現此目的。這條產業鏈：

彙集了來自行動短的名稱和值入列genre和flag。這將多列（寬）移動到鍵值對（長）。
使用過濾器刪除genre（那些標誌== 0）的多餘值。
商店在plot_data

其餘代碼所產生的數據幀是length VS rating簡單ggplot2散點圖。

library(dplyr) 
library(tidyr) 
library(ggplot2) 
library(ggplot2movies) 

plot_data <- movies %>% 
    gather(genre, flag, Action:Short) %>% 
    filter(flag != 0) 

ggplot(plot_data, aes(x = rating, y = length)) + 
    geom_point(aes(color = genre), alpha = 0.4)

來源

2016-12-04 15:03:50

非常有幫助，而我正在尋找！謝謝 – dnsko

R ggplot散點圖顏色多列

回答

相關問題