如何在数据框中创建新列并将其全部分配为0?

问题描述 投票:1回答:3

我使用这种语法来预分配列,并为所有列分配0:

data['Base'] = 0
data['Base_Chg'] = 0
data['Base_5D_Chg'] = 0
data['Year_Low'] = 0
data['Year_High'] = 0
data['Market_Cap'] = 0
data['PE_Ratio'] = 0
data['SMA_50'] = 0
data['SMA_100'] = 0
data['SMA_200'] = 0
data['RSI'] = 0
data['ADX'] = 0
data['ATR'] = 0
data['STDEV'] = 0

有没有办法用更少的代码行来做同样的事情?

在python中使用熊猫。

Thx!

pandas dataframe rows assign
3个回答
1
投票

假设您的列名在列表中,我们创建一个字典,键为列名,值为0。然后,我们在df1上做一个caretersian联接。

cols = ['Base', 
'Base_Chg', 
'Base_5D_Chg', 
'Year_Low', 
'Year_High', 
'Market_Cap', 
'PE_Ratio', 
'SMA_50', 
'SMA_100', 
'SMA_200', 
'RSI', 
'ADX', 
'ATR', 
'STDEV']

df1 = pd.DataFrame({'A' : [0,1,2,3]}) # your original dataframe.

df2 = pd.DataFrame(dict(zip(cols,[0] * len(cols))),index=[0]) 
#new dataframe from list of cols.

df3 = pd.merge(df1.assign(key='key'),df2.assign(key='key'),how='outer').drop('key',axis=1)
#merge of your old dataframe and new.

print(df1)
   A
0  0
1  1
2  2
3  3

print(df2)
   Base  Base_Chg  Base_5D_Chg  Year_Low  Year_High  Market_Cap  PE_Ratio  \
0     0         0            0         0          0           0         0   

   SMA_50  SMA_100  SMA_200  RSI  ADX  ATR  STDEV  
0       0        0        0    0    0    0      0  

print(df3)



   A  Base  Base_Chg  Base_5D_Chg  Year_Low  Year_High  Market_Cap  PE_Ratio  \
0  0     0         0            0         0          0           0         0   
1  1     0         0            0         0          0           0         0   
2  2     0         0            0         0          0           0         0   
3  3     0         0            0         0          0           0         0   

   SMA_50  SMA_100  SMA_200  RSI  ADX  ATR  STDEV  
0       0        0        0    0    0    0      0  
1       0        0        0    0    0    0      0  
2       0        0        0    0    0    0      0  
3       0        0        0    0    0    0      0  

1
投票

您可以将关键字参数与OrderedDict一起使用。

import collections as co

od = co.OrderedDict({'Base':0,'Base_Chg':0,'Base_5D_Chg':0})

data.assign(**od)

结果:

In [18]: data.assign(**od)
Out[18]: 
   a  Base  Base_Chg  Base_5D_Chg
0  1     0         0            0
1  2     0         0            0
2  3     0         0            0

0
投票

至少,您仍然必须写出所有新列的名称。

您可以使用循环:

columns=['Base', 'Base_Chg', 'Base_5D_Chg', 'Year_Low', 'Year_High', 'Market_Cap', 'PE_Ratio', 'SMA_50', 'SMA_100', 'SMA_200', 'RSI', 'ADX', 'ATR', 'STDEV']
for col in columns:
    df[col] = 0

pd.concat

columns=['Base', 'Base_Chg', 'Base_5D_Chg', 'Year_Low', 'Year_High', 'Market_Cap', 'PE_Ratio', 'SMA_50', 'SMA_100', 'SMA_200', 'RSI', 'ADX', 'ATR', 'STDEV']
new_df = pd.DataFrame(0, columns=columns, index=df.index)
df = pd.concat([df, new_df], axis=1)

测试以了解哪种情况适合您的用例。

© www.soinside.com 2019 - 2024. All rights reserved.