df <- read.csv ('https://raw.githubusercontent.com/ulklc/covid1-timeseries/master/countryReport/raw/rawReport.csv',
stringsAsFactors = FALSE)
利用这个数据集,我怎样才能找出5个国家中病例最多的患者人数?
然后,如何将患者人数画成线图?
要找到病人最多的国家
library(tidyverse)
df %>%
group_by(countryName) %>%
summarise(total = max(confirmed)) %>%
arrange(desc(total)) %>%
top_n(5) -> countries5
countries5
## A tibble: 5 x 2
# countryName total
# <chr> <int>
#1 United States 1768116
#2 Brazil 438812
#3 Russia 379051
#4 Spain 284986
#5 United Kingdom 269127
绘制其 confirmed
案演变。
countries5 %>%
left_join(df, by = 'countryName') %>%
select(countryCode, confirmed, day) %>%
mutate(day = as.Date(day)) %>%
filter(confirmed > 0) %>%
ggplot(aes(day, confirmed, color = countryCode)) +
geom_line()