我有一个带有一些空单元格的.csv文件。如何计算.csv文件中的空单元格? if row['PredictionString']== " "
?
submission = pd.read_csv(os.path.join(ROOT_DIR, 'submission.csv'))
for index, row in submission.iterrows():
if row['PredictionString']== " ":
counter1 = counter1 + 1
print('output:', counter1)
它不起作用。
output: 0
第一个名字:patientId
第二个名字:PredictionString
如果submission
是一个熊猫数据帧(就像它似乎是),你可以像这样计算:
counter1 = len(submission[submission.PredictionString == ' '])
没有任何for循环。
编辑:考虑空' '
,''
和NaN
的:
counter1 = len(submission[(submission.PredictionString == ' ') | (submission.PredictionString == '') | (submission.PredictionString.isnull())])
例:
>> mydict = {'patientId': {0: '1', 1: '1', 2: '1'},
>> 'PredictionString': {0: '', 1: ' ', 2: np.NaN}}
>> submission = pd.DataFrame(mydict)
>> counter1 = len(submission[(submission.PredictionString == ' ') | (submission.PredictionString == '') | (submission.PredictionString.isnull())])
>> print(counter1)
3