我想对cut
R
函数生成的关卡进行一些操作。我希望在我的MWE中有exp(Labs)
。
set.seed(12345)
Y <- rnorm(n = 50, mean = 500, sd = 1)
Y1 <- cut(log(Y), 5)
Labs <- levels(Y1)
Labs
[1] "(6.21,6.212]" "(6.212,6.213]" "(6.213,6.215]" "(6.215,6.217]" "(6.217,6.219]"
exp(cbind(lower = as.numeric( sub("\\((.+),.*", "\\1", Labs) ),
upper = as.numeric( sub("[^,]*,([^]]*)\\]", "\\1", Labs) )))
lower upper
[1,] 497.7013 498.6976
[2,] 498.6976 499.1966
[3,] 499.1966 500.1960
[4,] 500.1960 501.1974
[5,] 501.1974 502.2008
题
我怎么能在这里得到exp(Labs)
?
期望的输出
"(497.7013, 498.6976]" "(498.6976, 499.1966]" "(499.1966, 500.1960]" "(500.1960, 501.1974]" "(501.1974, 502.2008]"
编辑
根据@akrun回答:
Labs1 <- c("(-2.32,0.99]", "(0.99,4.28]", "(4.28,7.58]", "(7.58,10.9]", "(10.9,14.2]")
Labs1
[1] "(-2.32,0.99]" "(0.99,4.28]" "(4.28,7.58]" "(7.58,10.9]" "(10.9,14.2]"
exp(cbind(lower = as.numeric( sub("\\((.+),.*", "\\1", Labs1) ),
upper = as.numeric( sub("[^,]*,([^]]*)\\]", "\\1", Labs1) )))
lower upper
[1,] 9.827359e-02 2.691234e+00
[2,] 2.691234e+00 7.224044e+01
[3,] 7.224044e+01 1.958629e+03
[4,] 1.958629e+03 5.417636e+04
[5,] 5.417636e+04 1.468864e+06
gsubfn('([0-9.]+)', ~round(exp(as.numeric(x)),4), Labs1)
[1] "(-10.1757,2.6912]" "(2.6912,72.2404]" "(72.2404,1958.629]"
[4] "(1958.629,54176.3638]" "(54176.3638,1468864.1897]"
res1 <- exp(as.data.frame(t(sapply(strsplit(Labs1, '[^0-9.]+'),
function(x) as.numeric(x[-1])))))
sprintf('(%s]', do.call(paste, c(round(res1,4), sep=", ")))
[1] "(10.1757, 2.6912]" "(2.6912, 72.2404]" "(72.2404, 1958.629]"
[4] "(1958.629, 54176.3638]" "(54176.3638, 1468864.1897]"
紧凑的选择是使用gsubfn
。我们在([0-9.]+)
参数中将数字元素与点(pattern
)匹配,并且我们将匹配的元素替换为首先将其转换为'numeric',取exp
和round
。
library(gsubfn)
gsubfn('([-0-9.]+)', ~round(exp(as.numeric(x)),4), Labs)
#[1] "(497.7013,498.6976]" "(498.6976,499.1966]" "(499.1966,500.196]"
#[4] "(500.196,501.1974]" "(501.1974,502.2008]"
注意:这取决于我们使用的模式。
或者避免两次调用sub
的另一种选择是strsplit
。我们split
非数字元素。输出将是一个list
所以我们可以使用lapply/sapply
循环列表元素,转换为numeric
类,并创建一个包含两列的'data.frame'。
res <- exp(as.data.frame(t(sapply(strsplit(Labs, '[^-0-9.]+'),
function(x) as.numeric(x[-1])))))
关于根据OP的代码获得预期输出。我在原始代码中将cbind
更改为data.frame
,以便可以使用do.call
。
res <- exp(data.frame(lower = as.numeric( sub("\\((.+),.*", "\\1", Labs) ),
upper = as.numeric( sub("[^,]*,([^]]*)\\]", "\\1", Labs) )))
我们paste
'res'(do.call(paste0
)的行元素,然后添加额外的括号与sprintf
或另一个paste
sprintf('(%s]', do.call(paste, c(round(res,4), sep=", ")))
#[1] "(497.7013, 498.6976]" "(498.6976, 499.1966]" "(499.1966, 500.196]"
#[4] "(500.196, 501.1974]" "(501.1974, 502.2008]"
使用'Labs1'检查输出
gsubfn('([-0-9.]+)', ~round(exp(as.numeric(x)),4), Labs1)
#[1] "(0.0983,2.6912]" "(2.6912,72.2404]"
#[3] "(72.2404,1958.629]" "(1958.629,54176.3638]"
#[5] "(54176.3638,1468864.1897]"
exp(as.data.frame(t(sapply(strsplit(Labs1, '[^-0-9.]+'),
function(x) as.numeric(x[-1])))))
# V1 V2
#1 9.827359e-02 2.691234e+00
#2 2.691234e+00 7.224044e+01
#3 7.224044e+01 1.958629e+03
#4 1.958629e+03 5.417636e+04
#5 5.417636e+04 1.468864e+06