使用numpy.einsum计算数据的协方差矩阵

问题描述 投票:0回答:2

我的目标是使用

numpy.einsum
计算一组数据的协方差矩阵。举个例子吧

example_data = np.array([0.2, 0.3], [0.1, 0.2]])

以下是我尝试过的代码:

import numpy as np

d = example_data[0].shape[1]
mu = np.mean(example_data, axis=0)
data = np.reshape(example_data,(len(example_data),d,1))
mu = np.tile(mu,len(example_data))
mu = np.reshape(mu,(len(example_data),d,1))
d_to_mean = data-mu 

covariance_matrix = np.einsum('ijk,kji->ij', d_to_mean, np.transpose(d_to_mean)) 
#I don't know how to set the subscripts correctly

任何如何使这种方法可行的建议都将受到赞赏!

python numpy covariance numpy-einsum
2个回答
2
投票

基于协方差矩阵的定义,可以很容易地解决该任务

tmp = np.random.rand(5,3) # 5 corresponds to 5 observations, 3 corresponds to 3 variables
tmp_mean = np.mean(tmp,axis=0)[:,None]
tmp_centered = tmp.T - tmp_mean
cov = (tmp_centered @ tmp_centered.T) / (5-1)

如果你需要

einsum
无论如何

cov_ein = np.einsum('ij,jk->ik',tmp_centered,tmp_centered.T) / (5-1)

1
投票

您可以通过以下方式避免其他答案中的矩阵转置:

N, D = (5, 3)
tmp = np.random.rand(N, D)
tmp_centered = tmp - np.mean(tmp, axis=0)
cov = np.einsum('ji,jk->ik', tmp_centered , tmp_centered) / (N-1)
© www.soinside.com 2019 - 2024. All rights reserved.