我在使用以下数据时遇到问题(df)
1 TeamA 1
2 TeamB 2
3 TeamC 3
4 TeamA 4
5 TeamB 5
6 TeamC 6
7 TeamA 7
8 TeamB 8
9 TeamD 9
10 TeamD 10
我想添加一个粘贴团队结果的列,所以它看起来像这样。所以新专栏看起来像这样。由于我的数据不小,for循环不会这样做。
1 TeamA 1 1-4-7
2 TeamB 2 2-5-8
3 TeamC 3 3-6
4 TeamA 4 1-4-7
5 TeamB 5 2-5-8
6 TeamC 6 3-6
7 TeamA 7 1-4-7
8 TeamB 8 2-5-8
9 TeamD 9 9-10
10 TeamD 10 9-10
在原始数据中,没有我可以使用的团队模式。我认为它必须与dplyr的group_by
一起工作,但我无法做到。
像这样使用ave
:
transform(DF, new = ave(No, Team, FUN = function(x) paste(x, collapse = "-")))
赠送:
Team No new
1 TeamA 1 1-4-7
2 TeamB 2 2-5-8
3 TeamC 3 3-6
4 TeamA 4 1-4-7
5 TeamB 5 2-5-8
6 TeamC 6 3-6
7 TeamA 7 1-4-7
8 TeamB 8 2-5-8
9 TeamD 9 9-10
10 TeamD 10 9-10
或使用dplyr:
library(dplyr)
DF %>%
group_by(Team) %>%
mutate(new = paste(No, collapse = "-")) %>%
ungroup
可重复形式的输入DF
是:
Lines <- "
TeamA 1
TeamB 2
TeamC 3
TeamA 4
TeamB 5
TeamC 6
TeamA 7
TeamB 8
TeamD 9
TeamD 10"
DF <- read.table(text = Lines, as.is = TRUE, col.names = c("Team", "No"))
我们可以aggregate
,然后merge
到原来的data.frame
并排序:
df <- read.table(text="1 TeamA 1
2 TeamB 2
3 TeamC 3
4 TeamA 4
5 TeamB 5
6 TeamC 6
7 TeamA 7
8 TeamB 8
9 TeamD 9
10 TeamD 10",h=F,strin=F)
aggregated_scores <- aggregate(V3 ~ V2,df,paste,collapse='-')
new_df <- merge(df[-3],aggregated_scores)
new_df <- new_df[order(new_df$V1),]
# V2 V1 V3
# 1 TeamA 1 1-4-7
# 4 TeamB 2 2-5-8
# 8 TeamC 3 3-6
# 3 TeamA 4 1-4-7
# 5 TeamB 5 2-5-8
# 7 TeamC 6 3-6
# 2 TeamA 7 1-4-7
# 6 TeamB 8 2-5-8
# 9 TeamD 9 9-10
# 10 TeamD 10 9-10