假设这是我的示例输入
df
date h_league
0 19901126 AA
1 19911127 NA
2 20030130 AA
3 20041217 NaN
4 20080716 AA
5 20011215 NA
6 19970603 NaN
我想从中提取独特的联盟
h_league
再做两个新的cols一个是
max_date
有最长日期和
min_date
# Desired Output:
h_league Max_date Min_date
0 AA 20080716 19901126
1 NA 20011215 19911127
我必须为这个任务编写一个函数,返回我想要的类似输出,但不是确切的期望输出。
def league_info(league):
league_games = df[df["h_league"] == league]
earliest = df["date"].min()
latest = df["date"].max()
print("{} went from {} to {}".format(league,earliest,latest))
for league in df["h_league"].unique():
league_info(league)