我写了下面的代码,但是我想从statsmodel进行总结,有人可以帮我吗?
谢谢。
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LogisticRegression
X = df[['age_over_65', 'female_perc', 'foreign_born_perc','bachelors_perc', 'household_income']]
y = df['winner']
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.33, random_state=42)
logmodel = LogisticRegression(solver='lbfgs')
logmodel.fit(X_train,y_train)
model = logmodel.predict(X_test)
Sci-Kit学习专注于机器学习性能,而不是统计推断。
如果要查看logit模型的摘要结果,最好使用statsmodels
下面的示例代码。
import statsmodels.formula.api as smf
model = smf.logit(formula=f"{target} ~ {' + '.join(XVARS)}", data=xtrain_logit)
logmodel = model.fit()
logmodel.summary2()
#to save in a text file.
with open('logit_results.txt'), 'w') as text_file:
print(logmodel.summary2(), file=text_file)