基于多列计算频率

问题描述 投票:0回答:1

我有多个带有 VC 资金的专栏。

df <- data.frame(Company = c("X", "Y","Z"),
                 Valuation = c("10","20","30"),
                 VC.1 = c("Zeev Ventures","Bedrock Capital","Activant Capital"),
                 VC.2 = c("Bedrock Capital","Activant Capital","Zeev Ventures"))

我需要计算每个风险投资基金的不同统计数据。例如

VC 总估值 总投资
泽夫风险投资 40 2
基岩资本 40 2
活力资本 50 2
r dplyr
1个回答
0
投票

您可以旋转

VC.1
VC.2
列,并使用常用的
dplyr
函数获取结果:

library(tidyr)
library(dplyr)
pivot_longer(df, cols = c("VC.1", "VC.2"),
             values_to = "VC") %>%
  group_by(VC) %>%
  summarise(TV = sum(as.numeric(Valuation)), TI = n())

# A tibble: 3 × 3
  VC                  TV    TI
  <chr>            <dbl> <int>
1 Activant Capital    50     2
2 Bedrock Capital     30     2
3 Zeev Ventures       40     2
© www.soinside.com 2019 - 2024. All rights reserved.