代码之家 › 专栏 › 技术社区 › Tape

convert for loop在python中应用函数以减少运行时间

for-loop dataframe function python

Tape · 技术社区 · 6 年前

for i in range(0,len(data_sim.index)):
    for j in range(1,len(data_sim.columns)):
        user = data_sim.index[i]
        activity = data_sim.columns[j]

        if dt_full.loc[i][j] != 0: 
            data_sim.loc[i][j] = 0
        else:
            activity_top_names = data_neighbours.loc[activity][1:dt_length]
            activity_top_sims = data_corr.loc[activity].sort_values(ascending=False)[1:dt_length]
            user_purchases = data_activity.loc[user,activity_top_names]

            data_sim.loc[i][j] = getScore(user_purchases,activity_top_sims)

在for-loop中,data\u-sim如下所示:

CustomerId     A      B      C     D      E
   1          NAs   NAs    NAs   NAs    NAs
   2           ..

我尝试在apply函数中重现相同的过程,如下所示:

def test(cell):

    user = cell.index
    activity = cell

    activity_top_names = data_neighbours.loc[activity][1:dt_length]
    activity_top_sims = data_corr.loc[activity].sort_values(ascending=False)[1:dt_length]
    user_purchase = data_activity_index.loc[user, activity_top_names]

    if dt_full.loc[user][activity] != 0:
        return cell.replace(cell, 0)

    else:
        re = getScore(user_purchase, activity_top_sims) 
        return cell.replace(cell, re)

在函数中,data_sim2如下所示,我将“CustomerId”列设置为index column,并将活动名称复制到每个活动列。

CustomerId(Index)     A      B      C     D      E
   1                  A      B      C     D      E
   2                  A      B      C     D      E

在函数“def test(cell)”内部,如果单元格位于数据\u sim2[1][0]中,

cell.index = 1  # userId
cell            # activity name

这个for循环的整体思想是根据每个单元格的位置将评分数据放入“data_sim”表中。我在创建函数时使用了相同的思想,在每个单元格中使用相同的计算,然后将其应用到数据表“data_sim”,

data_test = data_sim2.apply(lambda x: test(x))

它给了我一个错误说

"sort_values() missing 1 required positional argument: 'by'"

这很奇怪,因为这个问题不是在for循环中发生的。听起来像是“数据”_校正位置[activity]'仍然是一个序列的一个数据帧。

0 回复 | 直到 6 年前

推荐文章

Dave · 如何在for循环中修改列表值

4 月前

Haru Hoshizora · 为什么一个整数的位置没有改变,但值却不同

5 月前

BlurKid · R中for循环时结果的奇怪差异

5 月前

Rudraksh_pd · 取炭。通过char。c中创建字符串的输入

5 月前

Mtullis · 在我的表单值中循环遍历数组[重复]

5 月前

puboot · 我的for循环没有运行,我不知道为什么。它甚至不会在控制台上打印任何内容

9 月前

Justin Hawkins · 在一个数组中返回两个数组,其中包含带特定字母的名称和不带指定字母的名称

9 月前

leiseg · 使用扩展变量的for循环中PowerShell CLI中缺少终止符

9 月前

xhamsterIT · 循环VBA Microsoft Excel

10 月前

AlexC · 如何循环dplyr group_by并总结变量列表的语句

10 月前