Pandas数据框的分组和计数以及在Python中的验证

问题描述 投票:2回答:1

我目前正在分析以执行以下操作:

1。我需要计算每年是否存在4个“否”条目。代表2018年和2019年的人物。同一日期应排除在外(确实不管哪一个)

它看起来应如下所示:

Year    Gender  No. People 
18      Men         11
        Woman        8
        Not Applied  3
19      Men         14
        Woman        5
        Not Applied  0

“人数”列显示人数的计数。

2。按性别检查,在10天内的最近10个月中是否存在“人数”中的6个条目以上。

结果可能看起来像:

Period                   Gender      Entries
01/23/2019 - 01/15/2019  Men         6
N/A                      Woman       N/A
N/A                      Not Applied N/A

3。检查过去3个月中是否有11项针对人数的衡量标准]]

Period                   Gender      Entries
12/20/2018 - 01/23/2019  Men         26
12/20/2018 - 01/23/2019  Woman       13
12/20/2018 - 12/26/2018  Not Applied N/A

以某种方式看起来很复杂,这就是为什么我在代码中苦苦挣扎。

我开始使用以下代码:

import pandas as pd
path = 'path'
filename = 'excel.xls'
final_path = path + '/' + filename
ws_name = 'Sheet1'

df.groupby(df['Date'].dt.year)['No. People'].agg(['count']) 

但是我总是为结果或错误而苦苦挣扎。

数据类似于Excel中的以下内容:

Date    Gender  No. People
12/20/18    Men 4
12/21/18    Men 9
12/22/18    Men 3
12/23/18    Men 9
12/24/18    Men 6
12/25/18    Men 1
12/26/18    Men 3
12/27/18    Men 8
12/28/18    Men 3
12/29/18    Men 5
12/30/18    Men 8
12/31/18    Men 
01/01/19    Men 
01/02/19    Men 
01/03/19    Men 
01/04/19    Men 9
01/05/19    Men 7
01/06/19    Men 5
01/07/19    Men 1
01/08/19    Men 8
01/09/19    Men 5
01/10/19    Men 6
01/11/19    Men 9
01/12/19    Men 7
01/13/19    Men 
01/14/19    Men 
01/15/19    Men 
01/16/19    Men 
01/17/19    Men 
01/18/19    Men 
01/19/19    Men 6
01/20/19    Men 5
01/21/19    Men 2
01/22/19    Men 5
01/23/19    Men 1
12/20/18    Women   6
12/21/18    Women   6
12/22/18    Women   2
12/23/18    Women   2
12/24/18    Women   2
12/25/18    Women   
12/26/18    Women   
12/27/18    Women   
12/28/18    Women   1
12/29/18    Women   1
12/30/18    Women   4
12/31/18    Women   
01/01/19    Women   
01/02/19    Women   
01/03/19    Women   
01/04/19    Women   
01/05/19    Women   
01/06/19    Women   
01/07/19    Women   
01/08/19    Women   
01/09/19    Women   
01/10/19    Women   
01/11/19    Women   
01/12/19    Women   
01/13/19    Women   
01/14/19    Women   
01/15/19    Women   
01/16/19    Women   
01/17/19    Women   
01/18/19    Women   
01/19/19    Women   4
01/20/19    Women   6
01/21/19    Women   8
01/22/19    Women   9
01/23/19    Women   4
12/20/18    Not Applied 6
12/21/18    Not Applied 2
12/22/18    Not Applied 3
12/23/18    Not Applied 
12/24/18    Not Applied 
12/25/18    Not Applied 
12/26/18    Not Applied 

我目前正在分析以执行以下操作:1.我需要计算每年是否存在4个“否”条目。代表2018年和2019年的人物。应该排除同一日期(是...

python pandas dataframe group-by count
1个回答
1
投票

首先,最好也按性别添加分组

© www.soinside.com 2019 - 2024. All rights reserved.