Python Pandas - 由于引用字符,csv 文件未正确拆分为列?

问题描述 投票:0回答:1

我已经为以下问题苦苦挣扎了一段时间,似乎无法在任何地方找到正确的解决方案。

问题如下:我有一个包含 13 列的 csv 文件,当我将 CSV 文件读入 Python Pandas 时,所有内容都放置在单个列中,并且我似乎无法正确地将数据集拆分为 13 列。

我尝试了很多东西,但这是我使用的最新代码以及由此产生的输出:


将 pandas 导入为 pd

file_path =“C:\Users\User23\Downloads\Detail_Table_Test(2).csv”

df = pd.read_csv(file_path, sep=',', header = 0, engine = 'python', quotechar = '"')

打印(df.head())

会话 ID,"活动日期","活动时间","序列键","类别名称","详细信息 1","详细信息 2","详细信息 3","详细信息","之前的两个类别","从类别名称","到类别名称","之后有两个类别" 0 1111111,"2023-11-11","02:05:06","1","着陆...
1 1111111,"2023-11-11","02:05:10","2","数据收集...
2 1111111,"2023-11-11","02:05:19","3","问题...
3 1111111,"2023-11-11","02:05:29","4","问题...
4 1111111,"2023-11-11","04:35:34","5","数据收集...


这是文件中的一行数据:

111111,"2023-11-12","18:09:34","14","问卷填写","公司ESG评估(新)","多元化、公平与包容性","评估" ,“公司ESG评估(新)-多元化、公平性和包容性:评估”、“数据收集”、“问卷填写”、“问卷填写”、“问卷填写”


我尝试指定 quotechars = '"'

我尝试了以下代码行: sep=', (?=(?:"[^"]?(?: [^"])*))|, (?=[^",]+ (?:,|$))',


提前1000次感谢:)

pandas csv split double-quotes
1个回答
0
投票

试试这个:

check=pd.read_csv(r"C:\Users\743622\Desktop\trial.txt",quoting=csv.QUOTE_NONE,quotechar = '"')
© www.soinside.com 2019 - 2024. All rights reserved.