代码之家 › 专栏 › 技术社区 › daiyue

pandas根据df中的另一列指定列值

dataframe pandas python-3.x

daiyue · 技术社区 · 7 年前

我有以下的 df ,

id    a_id    b_id
1     25      50
1     25      50
2     26      51
2     26      51
3     25      52
3     28      52
3     28      52

我有以下代码要分配 a_id 和 b_id 到 -1 ,基于每个行的行数 id 价值在 东风 如果每个 阿伊德 或 ByID 值与的特定值具有完全相同的行/子df 身份证件 有,那排 阿伊德 和 ByID 获得- 1;

cluster_ids = df.loc[df['id'] > -1]['id'].unique()

types = ['a_id', 'b_id']

for cluster_id in cluster_ids:
    rows = df.loc[df['id'] == cluster_id]

    for type in types:
        ids = rows[type].values

        match_rows = df.loc[df[type] == ids[0]]

        if match_rows.equals(rows):
           df.loc[match_rows.index, type] = -1

所以结果df看起来像,

id    a_id    b_id
1     25      -1
1     25      -1
2     -1      -1
2     -1      -1
3     25      -1
3     28      -1
3     28      -1

我想知道是否有更有效的方法来做这件事。

1 回复 | 直到 7 年前

phi 7 年前

one_value_for_each_id = df.groupby('id').transform(lambda x: len(set(x)) == 1)

 a_id  b_id
0   True  True
1   True  True
2   True  True
3   True  True
4  False  True
5  False  True
6  False  True

one_id_for_each_value = pd.DataFrame({
    col: df.groupby(col).id.transform(lambda x: len(set(x)) == 1)
    for col in ['a_id', 'b_id']
})

   a_id  b_id
0  False  True
1  False  True
2   True  True
3   True  True
4  False  True
5   True  True
6   True  True

one_to_one_relationship = one_id_for_each_value & one_value_for_each_id

# Set all values that satisfy the one-to-one relationship to `-1`
df.loc[one_to_one_relationship.a_id, 'a_id'] = -1
df.loc[one_to_one_relationship.b_id, 'b_id'] = -1

a_id  b_id
0    25    -1
1    25    -1
2    -1    -1
3    -1    -1
4    25    -1
5    28    -1
6    28    -1

推荐文章

TheCodeNovice · R中符号格式的尾随零和其他问题[重复]

1 年前

Daniel Estévez · 扩展数据帧以包含不存在的值

1 年前

T Richard · 根据条件交换分组数据中的字符串或值

1 年前

Homer Jay Simpson · R中flextable的标题字体和垂直合并

1 年前

RKIDEV · Panda迭代行并将第n行值乘以下一(n+1)行值

1 年前

Ssong · 如何有条件地运用资本化?

1 年前

Marcio Lino · 在Pandas中转换多个值列

1 年前

Ray · 在Python pandas包中使用groupby函数时,输出结果存在差异的原因是什么?

1 年前

RobertF · 如果列没有表头,如何在R数据帧中引用变量名?

1 年前

Homer Jay Simpson · ggplot2`geom_label()中的警告消息`

1 年前