如何在pandas中迭代合并数据帧？

Question

给定一个数据帧列表，我想迭代地合并它们并返回单个数据帧。输入：frames（pandas数据框列表）和on_columns（包含要合并的列名的字符串或字符串列表）。我如何使用df.merge来实现这一目标？ “”给定数据帧列表，迭代合并它们并返回单个数据帧

"""HINT: Use slice on frames when iterating and merging.

Arguments:
    frames {list} -- a list of pandas DataFrames
    on_columns {string or list} -- a string or list of strings
     containing the column names on which to join

Returns:
    df -- a pandas.DataFrame containing a merged version of the 
    two provided dataframes. If frames is None or an empty list return None
"""
def merge(frames, on_columns):
     #implementation here
     df = #merged df



return df

编辑：我想也许我可以使用df.concat但不确定如何？

Answer 1

像这样的东西应该工作，

def merge(frames, on_columns):
    #implementation here
    if not frames:
        return None
    if len(frames) == 1:
        return frames[0]
    out = frames[0]
    for df in frames[1:]:
        out = out.merge(df, on=on_columns)
    return out

Answer 2

import pandas as pd

df = next(dfs)
for records in dfs:
    df = df.append(records)

# the above is equivalent to
df = pd.concat(dfs)

注意事项：

dfs是pandas.DataFrame对象的迭代器
每个dfs都有相同的列
可能希望在完成所有追加后重新索引
试过https://docs.python.org/3/library/functools.html#functools.reduce，但是函数（df1，df2）应该是什么（即pd.concat期望迭代）并不明显;无论如何，pd.concat实用程序做了减少
http://pandas.pydata.org/pandas-docs/stable/reference/general_functions.html列出了pd.concat和其他公用事业 http://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.merge.html#pandas.merge http://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.concat.html#pandas.concat

附：不要创建库提供的功能，乐于阅读文档并重新阅读文档，尤其是。因为大熊猫文档是卷

如何在pandas中迭代合并数据帧？

问题描述投票：0回答：2

2个回答

最新问题

如何在pandas中迭代合并数据帧？

问题描述 投票：0回答：2

2个回答

最新问题

问题描述投票：0回答：2