代码之家 › 专栏 › 技术社区 › sdbbs

用正则表达式替换Pandas str.replace会使结果加倍吗?[副本]

regex pandas python

sdbbs · 技术社区 · 1 年前

假设我有这个熊猫系列:

$ python3 -c 'import pandas as pd; print(pd.Series(["1","2","3","4"]))'
0    1
1    2
2    3
3    4
dtype: object

我想“包装”字符串“1”、“2”、“3”、“4”,使它们以“a”为前缀,以“b”为后缀->也就是说,我想要得到“a1b”,“a2b”,”a3b“,”a4b“。所以我试着 https://pandas.pydata.org/docs/reference/api/pandas.Series.str.replace.html

$ python3 -c 'import pandas as pd; print(pd.Series(["1","2","3","4"]).str.replace("(.*)", r"a\1b", regex=True))'
0    a1bab
1    a2bab
2    a3bab
3    a4bab
dtype: object

所以-我确实把“1”“包装”成了“a1b”->但是“ab”又重复了一次?

(在regex101.com中尝试这个regex,我注意到如果 g 标志已启用;也许熊猫 .str.replace 以某种方式启用它?但是,默认情况是 flags=0 大熊猫 .str.replace 根据文档?!)

如何将列单元格的全部内容“包装”在我想要的字符中?

2 回复 | 直到 1 年前

Andrej Kesely 1 年前

改变 (.*) 到 (.+) :

andrej@Andrej-PC:~/app$ python3 -c 'import pandas as pd; print(pd.Series(["1","2","3","4"]).str.replace("(.+)", r"a\1b", regex=True))'
0    a1b
1    a2b
2    a3b
3    a4b
dtype: object

PaulS 1 年前

可能的解决方案:

s = pd.Series(range(1,5))
'a' + s.astype(str) + 'b'

输出:

0    a1b
1    a2b
2    a3b
3    a4b
dtype: object

推荐文章

user29747013 · 如何创建一个新的数据框架,其中包含原始数据框架中列的聚合列?

5 月前

Cam · Pandas列表日期到日期时间

5 月前

jjkennedy · Pandas文本文件导入:当每个文件中存在多个表时,自动选择1个表

5 月前

Sun Jar · 在另一个系列中查找当前df值的索引,并将其添加到列中

5 月前

dietzi96 · Pandas DataFrame.to_sql随机和静默地失败,没有错误消息

5 月前

Bijan · Pandas批量更新帐户字符串

5 月前

Kernel · TypeError:Index.reindex()收到意外的关键字参数fill_value'

6 月前

Kernel · 进入熊猫的定义。系列super().reindex

6 月前

adventurous_chip_55 · 如何引爆柱子

6 月前

RKIDEV · Panda迭代行并将第n行值乘以下一(n+1)行值

6 月前