我已经为以下问题苦苦挣扎了一段时间,似乎无法在任何地方找到正确的解决方案。
问题如下:我有一个包含 13 列的 csv 文件,当我将 CSV 文件读入 Python Pandas 时,所有内容都放置在单个列中,并且我似乎无法正确地将数据集拆分为 13 列。
我尝试了很多东西,但这是我使用的最新代码以及由此产生的输出:
将 pandas 导入为 pd
file_path =“C:\Users\User23\Downloads\Detail_Table_Test(2).csv”
df = pd.read_csv(file_path, sep=',', header = 0, engine = 'python', quotechar = '"')
打印(df.head())
会话 ID,"活动日期","活动时间","序列键","类别名称","详细信息 1","详细信息 2","详细信息 3","详细信息","之前的两个类别","从类别名称","到类别名称","之后有两个类别"
0 1111111,"2023-11-11","02:05:06","1","着陆...
1 1111111,"2023-11-11","02:05:10","2","数据收集...
2 1111111,"2023-11-11","02:05:19","3","问题...
3 1111111,"2023-11-11","02:05:29","4","问题...
4 1111111,"2023-11-11","04:35:34","5","数据收集...
这是文件中的一行数据:
111111,"2023-11-12","18:09:34","14","问卷填写","公司ESG评估(新)","多元化、公平与包容性","评估" ,“公司ESG评估(新)-多元化、公平性和包容性:评估”、“数据收集”、“问卷填写”、“问卷填写”、“问卷填写”
我尝试指定 quotechars = '"'
我尝试了以下代码行: sep=', (?=(?:"[^"]?(?: [^"])*))|, (?=[^",]+ (?:,|$))',
提前1000次感谢:)
试试这个:
check=pd.read_csv(r"C:\Users\743622\Desktop\trial.txt",quoting=csv.QUOTE_NONE,quotechar = '"')