如何使用TextBlob和Python进行标题的情感分析

Question

我想为我拥有的某些标题计算极性和主观性。我的代码工作正常，它没有给出任何错误，但是对于某些行，其极性和主观性给出的结果为0.00000。你知道为什么吗？

您可以在此处下载数据表：

https://www.sendspace.com/file/e8w4tw

我做错什么了吗？这是代码：

import pandas as pd
from textblob import TextBlob

pd.set_option('display.max_rows', 500)
pd.set_option('display.max_columns', 500)
pd.set_option('display.width', 1000)

df = pd.read_excel('coca cola news.xlsx', encoding='utf8')

df = df.dropna().reset_index(drop = True)
df = df.drop_duplicates().reset_index(drop = True)
print(df)

head_sentiment = []
head_subj = []

par_sentiment = []
par_subj = []


df['Headline Sentiment'] =  df['Headline'].apply(lambda text: TextBlob(text).sentiment.polarity).round(4)
df['Headline Subjectivity'] =  df['Headline'].apply(lambda text: TextBlob(text).sentiment.subjectivity).round(4)

df['Paragraph Sentiment'] =  df['Paragraph'].apply(lambda text: TextBlob(text).sentiment.polarity).round(4)
df['Paragraph Subjectivity'] =  df['Paragraph'].apply(lambda text: TextBlob(text).sentiment.subjectivity).round(4)

print(df)

print(df[df.columns[-4:]])

我的意思是，我知道可能是0，但是我在40％-50％的行中得到0.0000，这很多，甚至没有0.00001，这对我来说很奇怪。

你能帮我吗？

Answer 1

有时会发生。尝试从多语言使用极性方法。https://polyglot.readthedocs.io/en/latest/Installation.html

并比较结果。首先，您应该进行一些预处理，例如：

删除停用词
删除数字，html链接，数字，特殊字符等

如何使用TextBlob和Python进行标题的情感分析

问题描述投票：0回答：1

1个回答

最新问题

如何使用TextBlob和Python进行标题的情感分析

问题描述 投票：0回答：1

1个回答

最新问题

问题描述投票：0回答：1