上传到openAI进行微调=:UnicodeEncodeError:'ascii'编解码器无法对位置7中的字符'\ u201c'进行编码:序数不在范围(128)中

问题描述 投票:0回答:1

Python 3.9.12:这个非常简单的代码缩进上传 jsonl 文件到 openAI 进行微调:gpt-3.5-turbo-1106 导致以下错误。非常感谢您的帮助。

正在上传的jsonl文件内容:

{"messages": [{"role": "system", "content": "Friendly Home Assistant"}, {"role": "user", "content": "Lights are too bring and hurting my eyes."}, {"role": "assistant", "content": "I'll reduce the luminosity by 25% now."}]}
{"messages": [{"role": "system", "content": "Friendly Home Assistant"}, {"role": "user", "content": "Music is not loud enough."}, {"role": "assistant", "content": "Ill increase the volume by 25% now."}]}

upload.py中的代码:

from openai import OpenAI
client = openai.OpenAI()
training_data_path = "fine_tuneGPT_test.jsonl"

response = client.files.create(
    file=open("training_data_path","rb"),
    purpose="fine-tune"
)

print(response)

和错误:

(openai-env) (base) PS C:\Users\mendw\Dropbox\Projects\finetune_gpt\test_2> python upload.py                                                                                 
Traceback (most recent call last):
  File "C:\Users\mendw\Dropbox\Projects\finetune_gpt\test_2\upload.py", line 15, in <module>
    response = client.files.create(
  File "C:\Users\mendw\Dropbox\Projects\finetune_gpt\test_2\openai-env\lib\site-packages\openai\resources\files.py", line 95, in create
    return self._post(
  File "C:\Users\mendw\Dropbox\Projects\finetune_gpt\test_2\openai-env\lib\site-packages\openai\_base_client.py", line 1086, in post
    return cast(ResponseT, self.request(cast_to, opts, stream=stream, stream_cls=stream_cls))
  File "C:\Users\mendw\Dropbox\Projects\finetune_gpt\test_2\openai-env\lib\site-packages\openai\_base_client.py", line 846, in request
    return self._request(
  File "C:\Users\mendw\Dropbox\Projects\finetune_gpt\test_2\openai-env\lib\site-packages\openai\_base_client.py", line 866, in _request
    request = self._build_request(options)
  File "C:\Users\mendw\Dropbox\Projects\finetune_gpt\test_2\openai-env\lib\site-packages\openai\_base_client.py", line 446, in _build_request
    headers = self._build_headers(options)
  File "C:\Users\mendw\Dropbox\Projects\finetune_gpt\test_2\openai-env\lib\site-packages\openai\_base_client.py", line 407, in _build_headers
    headers = httpx.Headers(headers_dict)
  File "C:\Users\mendw\Dropbox\Projects\finetune_gpt\test_2\openai-env\lib\site-packages\httpx\_models.py", line 70, in __init__
    self._list = [
  File "C:\Users\mendw\Dropbox\Projects\finetune_gpt\test_2\openai-env\lib\site-packages\httpx\_models.py", line 74, in <listcomp>
    normalize_header_value(v, encoding),
  File "C:\Users\mendw\Dropbox\Projects\finetune_gpt\test_2\openai-env\lib\site-packages\httpx\_utils.py", line 53, in normalize_header_value
    return value.encode(encoding or "ascii")
**UnicodeEncodeError: 'ascii' codec can't encode character '\u201c' in position 7: ordinal not in range(128)**

我正在寻找有关此问题的答案,但找不到解决方案。我尝试手动将编码设置为 utf-8,因为我超出了 ascii 限制。

ascii openai-api fine-tuning
1个回答
0
投票

我找到了,只是想分享

开放AI社区

© www.soinside.com 2019 - 2024. All rights reserved.