从 .pickle 获取数据

问题描述 投票:0回答:1

我有一个多项式 NB() 的模型:

text_clf_NB = Pipeline([('vect', CountVectorizer()),
                      ('tfidf', TfidfTransformer()),
                      ('clf', MultinomialNB()),
                      ])
   
text_clf_NB.fit(Train_X_NB, Train_Y_NB)

我把它保存到.pickle

pickle.dump(text_clf_NB, open("NB_classification.pickle", "wb"))

在另一种情况下我加载这个模型:

clf = pickle.load(open("NB_classification.pickle", "rb"))

你能帮我吗,我怎样才能得到火车数据的稀疏矩阵?我的意思是我想在 clf 的 TfidfTransformer 之后获得 Train_X_NB 的值?

python model pickle text-classification tf-idf
1个回答
0
投票

像这样尝试:

import pickle

# Load the model
clf = pickle.load(open("NB_classification.pickle", "rb"))

# Access the CountVectorizer and TfidfTransformer
count_vectorizer = clf.named_steps['vect']
tfidf_transformer = clf.named_steps['tfidf']

# Transform the training data
Train_X_counts = count_vectorizer.transform(Train_X_NB)
Train_X_tfidf = tfidf_transformer.transform(Train_X_counts)
© www.soinside.com 2019 - 2024. All rights reserved.