如何使用Python和Numpy计算r平方？

Question

我正在使用Python和Numpy计算任意度数的最佳拟合多项式。我传递了x值，y值以及要拟合的多项式的阶数（线性，二次等）的列表。

这很有效，但我还想计算r（相关系数）和r-平方（确定系数）。我正在将我的结果与Excel的最佳拟合趋势线功能及其计算的r平方值进行比较。使用这个，我知道我正在为线性最佳拟合（度等于1）正确计算r平方。但是，我的函数不适用于次数大于1的多项式。

Excel可以做到这一点。如何使用Numpy计算高阶多项式的r平方？

这是我的职能：

import numpy

# Polynomial Regression
def polyfit(x, y, degree):
    results = {}

    coeffs = numpy.polyfit(x, y, degree)
     # Polynomial Coefficients
    results['polynomial'] = coeffs.tolist()

    correlation = numpy.corrcoef(x, y)[0,1]

     # r
    results['correlation'] = correlation
     # r-squared
    results['determination'] = correlation**2

    return results

Answer 1

import numpy as np from scipy import stats import statsmodels.api as sm import math n=1000 x = np.random.rand(1000)*10 x.sort() y = 10 * x + (5+np.random.randn(1000)*10-5) x_list = list(x) y_list = list(y) def get_r2_numpy(x, y): slope, intercept = np.polyfit(x, y, 1) r_squared = 1 - (sum((y - (slope * x + intercept))**2) / ((len(y) - 1) * np.var(y, ddof=1))) return r_squared def get_r2_scipy(x, y): _, _, r_value, _, _ = stats.linregress(x, y) return r_value**2 def get_r2_statsmodels(x, y): return sm.OLS(y, sm.add_constant(x)).fit().rsquared def get_r2_python(x_list, y_list): n = len(x) x_bar = sum(x_list)/n y_bar = sum(y_list)/n x_std = math.sqrt(sum([(xi-x_bar)**2 for xi in x_list])/(n-1)) y_std = math.sqrt(sum([(yi-y_bar)**2 for yi in y_list])/(n-1)) zx = [(xi-x_bar)/x_std for xi in x_list] zy = [(yi-y_bar)/y_std for yi in y_list] r = sum(zxi*zyi for zxi, zyi in zip(zx, zy))/(n-1) return r**2 def get_r2_numpy_manual(x, y): zx = (x-np.mean(x))/np.std(x, ddof=1) zy = (y-np.mean(y))/np.std(y, ddof=1) r = np.sum(zx*zy)/(len(x)-1) return r**2 def get_r2_numpy_corrcoef(x, y): return np.corrcoef(x, y)[0, 1]**2 print('Python') %timeit get_r2_python(x_list, y_list) print('Numpy polyfit') %timeit get_r2_numpy(x, y) print('Numpy Manual') %timeit get_r2_numpy_manual(x, y) print('Numpy corrcoef') %timeit get_r2_numpy_corrcoef(x, y) print('Scipy') %timeit get_r2_scipy(x, y) print('Statsmodels') %timeit get_r2_statsmodels(x, y)上的维基百科文章建议将其用于一般模型拟合，而不仅仅是线性回归。

Answer 2

这里是使用Python和Numpy计算weighted

Answer 3

R平方是仅适用于线性回归的统计量。

本质上，它衡量的是线性回归可以解释数据的多少变化。

Answer 4

来自scipy.stats.linregress源。他们使用平均平方和方法。

Here

如何使用Python和Numpy计算r平方？

问题描述投票：83回答：9

9个回答

最新问题

如何使用Python和Numpy计算r平方？

问题描述 投票：83回答：9

9个回答

最新问题

问题描述投票：83回答：9