为什么会出现 TypeError: expected string or bytes-like object is occurring?

问题描述 投票:0回答:0
token_ids = [] 
for tweet in tweets:
    # Remove unwanted characters and symbols
    tweet = re.sub(r'[^\w\s]', '', tweet)
    # Tokenize the tweet
    tokens = bert_tokenizer.tokenize([tweet])
    # Convert tokens to token IDs
    ids = tf.squeeze(bert_tokenizer.convert_tokens_to_ids(tokens))
    token_ids.append(ids) 
    input_ids = tf.ragged.constant(token_ids)

我试图对推文进行预处理和标记化,但它给出了:

TypeError: expected string or bytes-like object
python string byte tokenize tweets
© www.soinside.com 2019 - 2024. All rights reserved.