Gensim行数据框摘要

问题描述 投票:0回答:1

我正在使用'Gensim'生成我拥有的不同行的摘要。这是原始数据框的外观:

df.head()

                                   Example Content
0   Not happy they have just reduced rates for Und...
1   One of the worst banks. I had a very bad exper...
2   Some one in lloyds has signed a form in My nam...
3   Card blocked due to ordering a takeaway from m...
4   There are plenty of better banks than Lloyds.\...

我能够使用gensim将摘要应用于每行。问题是,我希望每行摘要都针对其原始显示,而这没有发生。这是我的代码的样子:

a = []

for i in df['Example Content']:

    i= i + str(". This is second sentence. This is third")             # this is to add two more sentences so that gensim summarizes it. These sentence add no value to summary.
    a = summarize(i, ratio=0.4, split = True)

df['Summary'] = a

这是上面代码的输出:

                                     Example Content                                 Summary
0   Not happy they have just reduced rates for Und...       Today I got a new phone and switched my sim an...
1   One of the worst banks. I had a very bad exper...       Today I got a new phone and switched my sim an...
2   Some one in lloyds has signed a form in My nam...       Today I got a new phone and switched my sim an...
3   Card blocked due to ordering a takeaway from m...       Today I got a new phone and switched my sim an...
4   There are plenty of better banks than Lloyds.\...       Today I got a new phone and switched my sim an...

下面显示的是每行的gensim生成的所有单独摘要:

The 2nd address was a shopping centre and they didnt even give me the name of the business.
I wasn't to know as I through Gallarias Novas was the shop name but that was just the place.
They said that they had issued a new card that I hadn't received and even though they new I was abroad using my card they stopped it anyway.
When my new card did arrive after getting home I now know the reason was that they were making me have a con tactless card whcih I did nto request.

 Today I got a new phone and switched my sim and set up my banking apps inc Halifax and LloydÕs.
Halifax worked fine, usual 4 digit code and confirmation call came through and all set up in mins.

我应如何获取与原始内容相对应的单个摘要并将其放入数据框中?

python pandas for-loop gensim summarization
1个回答
0
投票

您一直覆盖您的列表。替换

a = summarize(i, ratio=0.4, split = True)

a.append(summarize(i, ratio=0.4, split = True))
© www.soinside.com 2019 - 2024. All rights reserved.