如何添加为每个现有行添加新行的列

问题描述 投票:3回答:3

我有两个数据框,df1和df2,并希望将它们合并为df3,如下所示。我确信有一种简单的方法可以做到这一点,但我一直无法找到一个直接的解决方案。

df1 = data.frame(id = c(1,2), Name = c('Bob', 'Sue'), stringsAsFactors = F)
id | Name 
==========
1 |   Bob 
2 |   Sue 

df2 = data.frame(id = c(1,2,3,4), year = c(2001, 2002, 2003, 2004))
id | year
==========
1 |   2001 
2 |   2002 
3 |   2003 
4 |   2004

df3 =
id | Name | year
=================
1 |   Bob | 2001
2 |   Bob | 2002
3 |   Bob | 2003
4 |   Bob | 2004
5 |   Sue | 2001
6 |   Sue | 2002
7 |   Sue | 2003
8 |   Sue | 2004
r
3个回答
6
投票

使用merge(df1, df2, by=NULL)作笛卡尔产品请参见:https://www.rdocumentation.org/packages/base/versions/3.5.3/topics/merge


4
投票

我们可以使用crossing

library(dplyr)
library(tidyr)
crossing(df1, df2) %>%
   transmute(id = row_number(), Name, year)
#  id Name year
#1  1  Bob 2001
#2  2  Bob 2002
#3  3  Bob 2003
#4  4  Bob 2004
#5  5  Sue 2001
#6  6  Sue 2002
#7  7  Sue 2003
#8  8  Sue 2004

输出中的“id”列似乎与数据集中的初始“id”列无关。在这种情况下,执行没有'id'列的crossing,然后创建'id'作为row_number()

crossing(df1[-1], df2[-1]) %>% 
        mutate(id = row_number())

data

df1 <- structure(list(id = 1:2, Name = c("Bob", "Sue")), 
  class = "data.frame", row.names = c(NA, -2L))

df2 <- structure(list(id = 1:4, year = 2001:2004), class = "data.frame",
 row.names = c(NA, -4L))

2
投票

也许你可以使用:expand.grid(Name = df1$Name, year = df2$year)

这给了:

  Name year
1  Bob 2001
2  Sue 2001
3  Bob 2002
4  Sue 2002
5  Bob 2003
6  Sue 2003
7  Bob 2004
8  Sue 2004
© www.soinside.com 2019 - 2024. All rights reserved.