使用groupby根据列中的值拆分Pandas Dataframe

alen0pnh 于 2023-03-16 发布在其他

关注(0)|答案(4)|浏览(213)

我想根据ZZ列拆分以下 Dataframe

df = 
        N0_YLDF  ZZ        MAT
    0  6.286333   2  11.669069
    1  6.317000   6  11.669069
    2  6.324889   6  11.516454
    3  6.320667   5  11.516454
    4  6.325556   5  11.516454
    5  6.359000   6  11.516454
    6  6.359000   6  11.516454
    7  6.361111   7  11.516454
    8  6.360778   7  11.516454
    9  6.361111   6  11.516454

作为输出，我需要一个新的DataFrame，其中N0_YLDF列拆分为4，ZZ的每个唯一值对应一个新列。我该怎么做呢？我可以执行groupby，但不知道如何处理分组对象。

pandas

来源：https://stackoverflow.com/questions/23691133/split-pandas-dataframe-based-on-values-in-a-column-using-groupby

4条答案

按热度按时间

daupos2t1#

gb = df.groupby('ZZ')    
[gb.get_group(x) for x in gb.groups]

赞(0）回复(0）举报 2023-03-16

7uzetpgm2#

还有另一种选择，因为groupby返回一个生成器，我们可以简单地使用列表解析来检索第二个值（帧）。

dfs = [x for _, x in df.groupby('ZZ')]

赞(0）回复(0）举报 2023-03-16

pdtvr36n3#

在R中有一个名为split的 Dataframe 方法，适用于所有R用户：

def split(df, group):
     gb = df.groupby(group)
     return [gb.get_group(x) for x in gb.groups]

赞(0）回复(0）举报 2023-03-16

ru9i0ody4#

将它们存储在dict中，这允许您基于组键访问组DataFrame。

d = dict(tuple(df.groupby('ZZ')))
d[6]

#    N0_YLDF  ZZ        MAT
#1  6.317000   6  11.669069
#2  6.324889   6  11.516454
#5  6.359000   6  11.516454
#6  6.359000   6  11.516454
#9  6.361111   6  11.516454

如果您只需要DataFrame的一个子集，在本例中只需要'NO_YLDF' Series，则可以修改dict解析。

d = dict((idx, gp['N0_YLDF']) for idx, gp in df.groupby('ZZ'))
d[6]
#1    6.317000
#2    6.324889
#5    6.359000
#6    6.359000
#9    6.361111
#Name: N0_YLDF, dtype: float64

赞(0）回复(0）举报 2023-03-16

我来回答

使用groupby根据列中的值拆分Pandas Dataframe

4条答案

相关问题

热门标签

最新问答