在Rcpp函数中替换Rcpp :: List的元素是否是内存安全的?

问题描述 投票:3回答:1

我需要将作为参数传递的Rcpp::List对象的元素覆盖到Rcpp函数。我担心的是记忆安全。是否通过重新分配列表的非空元素,我实际上是重新布线指向原始内容的指针,但是从不解除分配存储原始内容的内存?如果是,如何解决这个问题?

我知道我可以很容易地修改一个由Rcpp::NumericVector构成的Rcpp对象(例如.Rcpp::List),因为Rcpp::NumericVector做了一个浅拷贝。然而,这并不满足我的要求,即完全用其他东西替换元素。

下面,我提供了一个C ++代码片段,其中显示了我所指的场景。

#include <Rcpp.h>
using namespace Rcpp;

// [[Rcpp::export]]
void replaceListElement(List l)
{
    std::vector<int> v;
    v.push_back(4);
    v.push_back(5);
    v.push_back(6);
    l["a"] = v;
}

/*** R
l <- list()
l$a <- c(1,2,3)
replaceListElement(l)
print(l)
*/

当通过RStudio中的Rcpp获取时,print(l)命令输出以下内容

$a
[1] 4 5 6

这是期望的结果,所以我的问题仅涉及记忆安全。

c++ r rcpp
1个回答
5
投票

Rcpp::ListVector<VECSXP>,即指向其他向量的指针的向量。如果将新向量分配给此列表中的某个元素,则实际上只需更改指针而不释放指针用于指向的内存。但是,R仍然知道这个内存并通过它的垃圾收集器释放它。我们可以通过一个简单的实验看到这一点,在这个实验中,我使用你的C ++代码,R代码略有变化:

#include <Rcpp.h>
using namespace Rcpp;

// [[Rcpp::export]]
void replaceListElement(List l)
{
  std::vector<int> v;
  v.push_back(4);
  v.push_back(5);
  v.push_back(6);
  l["a"] = v;
}

/*** R
l <- list()
l$a <- runif(1e7)
replaceListElement(l)
print(l)
gc() # optional
*/

这里使用更大的矢量来使效果更加突出。如果我现在使用R -d valgrind -e 'Rcpp::sourceCpp("<filename>")',我会通过gc()调用得到以下结果

==13827==
==13827== HEAP SUMMARY:
==13827==     in use at exit: 48,125,775 bytes in 9,425 blocks
==13827==   total heap usage: 34,139 allocs, 24,714 frees, 173,261,724 bytes allocated
==13827==
==13827== LEAK SUMMARY:
==13827==    definitely lost: 0 bytes in 0 blocks
==13827==    indirectly lost: 0 bytes in 0 blocks
==13827==      possibly lost: 0 bytes in 0 blocks
==13827==    still reachable: 48,125,775 bytes in 9,425 blocks
==13827==                       of which reachable via heuristic:
==13827==                         newarray           : 4,264 bytes in 1 blocks
==13827==         suppressed: 0 bytes in 0 blocks
==13827== Rerun with --leak-check=full to see details of leaked memory
==13827==
==13827== For counts of detected and suppressed errors, rerun with: -v
==13827== ERROR SUMMARY: 0 errors from 0 contexts (suppressed: 0 from 0)

没有gc()电话:

==13761==
==13761== HEAP SUMMARY:
==13761==     in use at exit: 132,713,314 bytes in 10,009 blocks
==13761==   total heap usage: 34,086 allocs, 24,077 frees, 173,212,886 bytes allocated
==13761==
==13761== LEAK SUMMARY:
==13761==    definitely lost: 0 bytes in 0 blocks
==13761==    indirectly lost: 0 bytes in 0 blocks
==13761==      possibly lost: 0 bytes in 0 blocks
==13761==    still reachable: 132,713,314 bytes in 10,009 blocks
==13761==                       of which reachable via heuristic:
==13761==                         newarray           : 4,264 bytes in 1 blocks
==13761==         suppressed: 0 bytes in 0 blocks
==13761== Rerun with --leak-check=full to see details of leaked memory
==13761==
==13761== For counts of detected and suppressed errors, rerun with: -v
==13761== ERROR SUMMARY: 0 errors from 0 contexts (suppressed: 0 from 0)

所以在这两种情况下,valgrind都没有检测到任何内存泄漏。可达到的存储器的数量相差大约8×10 ^ 7字节,即l$a中原始矢量的大小。这表明R确实知道原始向量并在被告知这样做时释放它,但是当R自己决定运行垃圾收集器时也会发生这种情况。

© www.soinside.com 2019 - 2024. All rights reserved.