我有一个多项式 NB() 的模型:
text_clf_NB = Pipeline([('vect', CountVectorizer()),
('tfidf', TfidfTransformer()),
('clf', MultinomialNB()),
])
text_clf_NB.fit(Train_X_NB, Train_Y_NB)
我把它保存到.pickle
pickle.dump(text_clf_NB, open("NB_classification.pickle", "wb"))
在另一种情况下我加载这个模型:
clf = pickle.load(open("NB_classification.pickle", "rb"))
你能帮我吗,我怎样才能得到火车数据的稀疏矩阵?我的意思是我想在 clf 的 TfidfTransformer 之后获得 Train_X_NB 的值?
像这样尝试:
import pickle
# Load the model
clf = pickle.load(open("NB_classification.pickle", "rb"))
# Access the CountVectorizer and TfidfTransformer
count_vectorizer = clf.named_steps['vect']
tfidf_transformer = clf.named_steps['tfidf']
# Transform the training data
Train_X_counts = count_vectorizer.transform(Train_X_NB)
Train_X_tfidf = tfidf_transformer.transform(Train_X_counts)