如何向Word2vec gensim手动添加单词和向量?

问题描述 投票:0回答:1

比方说,word2vec.model是我训练有素的word2vec模型。当出现单词外单词(oov_word)时,我使用compute_vec(oov_word)方法计算向量vec。现在,我想将oov_word及其对应的向量vec添加/追加到我已经训练的模型word2vec.model

我已经检查了以下链接。但是他们没有回答我的问题。

Combining/adding vectors from different word2vec models

https://datascience.stackexchange.com/questions/49431/how-to-train-an-existing-word2vec-gensim-model-on-new-words

https://radimrehurek.com/gensim/models/keyedvectors.html#gensim.models.keyedvectors.BaseKeyedVectors.add

gensim word2vec
1个回答
0
投票
from gensim.models.keyedvectors import WordEmbeddingsKeyedVectors
vector_length = 100
kv = WordEmbeddingsKeyedVectors(vector_length)

# wordList - list of words
# vectorList - list of the vector corresponding to the words

kv.add(wordList, vectorList)

kv.most_similar(word1) # gives the list of words similar to word1
© www.soinside.com 2019 - 2024. All rights reserved.