首頁猿問將自定義函數(shù)傳遞給 pandas...

將自定義函數(shù)傳遞給 pandas .agg()

Python

阿晨1998 2023-12-08 17:05:16

我在 pandas 中有以下聚合：summary_df = df.groupby(['provider', 'id']).agg( title =('title', 'first'), file_size = *custom*).reset_index()對于file_size我想使用以下計算：sum([item['file_size'] for item in df if item['is_main_video'] is True])我將如何在內(nèi)執(zhí)行上述操作.agg()？

查看完整描述

2 回答

慕桂英546537

TA貢獻1848條經(jīng)驗獲得超10個贊

agg在您的情況下，將標記一列作為源，您可以在之前創(chuàng)建另一列g(shù)roupby

df['New'] = np.where(df['is_main_video'], df['file_size'], 0)

summary_df = df.groupby(['provider', 'id']).agg(

title =('title', 'first'),

file_size = ('New', 'sum')

).reset_index()

更新

summary_df = df.assign(New = np.where(df['is_main_video'], df['file_size'], 0)).groupby(['provider', 'id']).agg(

title =('title', 'first'),

file_size = ('New', 'sum')

).reset_index()

反對回復 2023-12-08

猛跑小豬

TA貢獻1858條經(jīng)驗獲得超8個贊

您可以Series.where暫時“忽略”您的 file_sizes，其中“is_main_video”為 False，然后執(zhí)行 groupby 操作來對剩余內(nèi)容進行求和：

import pandas as pd

df = pd.DataFrame({

"provider": ["A", "A", "A", "B", "B"],

"title": ["hello", "world", "pandas", "example", "here"],

"is_main_video": [True, False, True, True, False],

"file_size": [10, 12, 20, 19, 10]

})

print(df)

provider title is_main_video file_size

0 A hello True 10

1 A world False 12

2 A pandas True 20

3 B example True 19

4 B here False 10

aggregated_df = (df.assign(file_size=df["file_size"].where(df["is_main_video"]))

.groupby("provider", as_index=False)

.agg(

title=("title", "first"),

file_size=("file_size", "sum"))

)

print(aggregated_df)

provider title file_size

0 A hello 30.0

1 B example 19.0

反對回復 2023-12-08

2 回答
0 關(guān)注
253 瀏覽

關(guān)注

添加回答

舉報

0/150

提交

取消

使用 Ctrl+D 可將網(wǎng)站添加到書簽

微信客服

購課補貼
聯(lián)系客服咨詢優(yōu)惠詳情

幫助反饋 APP下載

慕課網(wǎng)APP
您的移動學習伙伴

公眾號

掃描二維碼
關(guān)注慕課網(wǎng)微信公眾號

第七色在线视频,2021少妇久久久久久久久久,亚洲欧洲精品成人久久av18,亚洲国产精品特色大片观看完整版,孙宇晨将参加特朗普的晚宴

熱搜

最近搜索清空

將自定義函數(shù)傳遞給 pandas .agg()

將自定義函數(shù)傳遞給 pandas .agg()

2 回答

添加回答