从sklearn.svm导入LinearSVC
从sklearn.feature_extraction.text导入CountVectorizer
从sklearn.feature_extraction.text导入TfidfTransformer
从sklearn.metrics导入precision_score
X = data ['Review']] >>
y =数据['类别']] >>
tfidf = TfidfVectorizer(ngram_range =(1,1))
分类器= LinearSVC()
[X_train,X_test,y_train,y_test = train_test_split(X,y,test_size = 0.3)
clf =管道([('tfidf',tfidf),(“ CLF”,分类器)])
clf.fit(X_train,y_train)
y_pred = clf.predict(X_test)
print(classification_report(y_test,y_pred))
accuracy_score(y_test,y_pred)
我应该在哪里更改为cross_val_score?
从sklearn.svm导入sklearn.feature_extraction.text的LinearSVC从sklearn.feature_extraction.text导入CountVectorizer从sklearn.metrics导入TfidfTransformer导入precision_score ...
来自sklearn documentation
使用交叉验证的最简单方法是在估计器和数据集上调用cross_val_score帮助函数。