在tweepy上使用用户对象时的属性错误

问题描述 投票:1回答:1

我正在尝试编写一个程序,使用他们的Stream API和Tweepy从Twitter传输推文。这是我的代码的相关部分:

def on_data(self, data):
    if data.user.id == "25073877" or data.in_reply_to_user_id == "25073877":
        self.filename = trump.csv

    elif data.user.id == "30354991" or data.in_reply_to_user_id == "30354991":
        self.filename = harris.csv

    if not 'RT @' in data.text:
        csvFile = open(self.filename, 'a')
        csvWriter = csv.write(csvFile)

        print(data.text)
        try:
            csvWriter.writerow([data.text, data.created_at, data.user.id, data.user.screen_name,  data.in_reply_to_status_id])

        except:
            pass

def on_error(self, status_code):
    if status_code == 420:
        return False

代码应该做的是流式传输推文并编写推文的文本,创建日期,高音扬声器的用户ID,他们的屏幕名称,以及如果推文是一个他们回复的状态的回复ID答复。但是,我收到以下错误:

File "test.py", line 13, in on_data

 if data.user.id == "25073877" or data.in_reply_to_user_id == "25073877":

AttributeError: 'unicode' object has no attribute 'user'

有人可以帮帮我吗?谢谢!

编辑:正在读入“数据”的样本

{"created_at":"Fri Feb 15 20:50:46 +0000 2019","id":1096512164347760651,"id_str":"1096512164347760651","text":"@realDonaldTrump \nhttps:\/\/t.co\/NPwSuJ6V2M","source":"\u003ca href=\"http:\/\/twitter.com\" rel=\"nofollow\"\u003eTwitter Web Client\u003c\/a\u003e","truncated":false,"in_reply_to_status_id":null,"in_reply_to_status_id_str":null,"in_reply_to_user_id":25073877,"in_reply_to_user_id_str":"25073877","in_reply_to_screen_name":"realDonaldTrump","user":{"id":1050189031743598592,"id_str":"1050189031743598592","name":"Lauren","screen_name":"switcherooskido","location":"United States","url":null,"description":"Concerned citizen of the USA who would like to see Integrity restored in the US Government. Anti-marxist!\nSigma, INTP\/J\nREJECT PC and Identity Politics #WWG1WGA","translator_type":"none","protected":false,"verified":false,"followers_count":1459,"friends_count":1906,"listed_count":0,"favourites_count":5311,"statuses_count":8946,"created_at":"Thu Oct 11 00:59:11 +0000 2018","utc_offset":null,"time_zone":null,"geo_enabled":false,"lang":"en","contributors_enabled":false,"is_translator":false,"profile_background_color":"000000","profile_background_image_url":"http:\/\/abs.twimg.com\/images\/themes\/theme1\/bg.png","profile_background_image_url_https":"https:\/\/abs.twimg.com\/images\/themes\/theme1\/bg.png","profile_background_tile":false,"profile_link_color":"FF691F","profile_sidebar_border_color":"000000","profile_sidebar_fill_color":"000000","profile_text_color":"000000","profile_use_background_image":false,"profile_image_url":"http:\/\/pbs.twimg.com\/profile_images\/1068591478329495558\/ng_tNAXx_normal.jpg","profile_image_url_https":"https:\/\/pbs.twimg.com\/profile_images\/1068591478329495558\/ng_tNAXx_normal.jpg","profile_banner_url":"https:\/\/pbs.twimg.com\/profile_banners\/1050189031743598592\/1541441602","default_profile":false,"default_profile_image":false,"following":null,"follow_request_sent":null,"notifications":null},"geo":null,"coordinates":null,"place":null,"contributors":null,"is_quote_status":false,"quote_count":0,"reply_count":0,"retweet_count":0,"favorite_count":0,"entities":{"hashtags":[],"urls":[{"url":"https:\/\/t.co\/NPwSuJ6V2M","expanded_url":"https:\/\/www.conservativereview.com\/news\/5-insane-provisions-amnesty-omnibus-bill\/","display_url":"conservativereview.com\/news\/5-insane-\u2026","indices":[18,41]}],"user_mentions":[{"screen_name":"realDonaldTrump","name":"Donald J. Trump","id":25073877,"id_str":"25073877","indices":[0,16]}],"symbols":[]},"favorited":false,"retweeted":false,"possibly_sensitive":false,"filter_level":"low","lang":"und","timestamp_ms":"1550263846848"}

所以我认为修改后的问题是如何告诉程序只将这个JSON输出的部分写入CSV文件?我一直在使用Twitter的流API提供“数据”属性的引用。

python twitter tweepy
1个回答
0
投票

如您的评论中所述,推文数据采用“JSON格式”。我相信你的意思是它是一个JSON格式的字符串(unicode),而不是一个解析过的JSON对象。要在代码中访问您想要的字段,您需要使用json解析数据字符串。

EG

import json

json_data_object = json.loads(data)

然后,您可以像访问字典一样访问字段,例如

json_data_object['some_key']['some_other_key']
© www.soinside.com 2019 - 2024. All rights reserved.