代码之家 › 专栏 › 技术社区 › Tanner Clark

调试:数据帧列引用和索引[重复]

indexing for-loop dataframe python

Tanner Clark · 技术社区 · 6 年前

这个问题已经有了答案:

Deleting multiple columns based on column names in Pandas 8答

我想不出这个错误。我认为这是我对数据帧和索引的误解。另外,可能是对for循环的误解。(我习惯了matlab 对于循环…直观地说,迭代更容易:d)

错误如下:

KeyError: "['United States' 'Canada' 'Mexico'] not found in axis"

这发生在一行: as_df=as_df.drop(as_df[column])

但这毫无意义…我调用的是一个单独的列,而不是整个虚拟变量集。

可以复制并运行以下代码。我确定了。

我的代码:

import pandas as pd
import numpy as np
df=pd.DataFrame({"country": ['United States','Canada','Mexico'], "price": [23,32,21], "points": [3,4,4.5]})
df=df[['country','price','points']]
df2=df[['country']]
features=df2.columns
print(features)
target='points'

#------_-__-___---____________________
as_df=pd.concat([df[features],df[target]],axis=1)
#Now for Column Check
for column in as_df[features]:
    col=as_df[[column]]
    #Categorical Data Conversion
#This will split the countries into their own column with 1 being when it 
#is true and 0 being when it is false
    col.select_dtypes(include='object')
    dummies=pd.get_dummies(col)
    #ML Check:
    dumcols=dummies.drop(dummies.columns[1],axis=1)
    if dumcols.shape[1] > 1:
        print(column)
        as_df=as_df.drop(as_df[column])
    else:
        dummydf=col
as_df=pd.concat([as_df,dummydf],axis=1)
as_df.head()

2 回复 | 直到 6 年前

Will Lyles 6 年前

我会评论而不是回答,但我没有足够的声誉来这样做。(我需要澄清以帮助您,而Stack Exchange并不能“适当”地为我提供这样做的方法。)

我不完全确定你的最终目标是什么。你能解释一下你的最终结果是什么样的吗?包括在for循环结束后,以及整个代码完成运行之后?

Tanner Clark 6 年前

发现了我的错误。

as_df=as_df.drop(as_df[column])

应该是

as_df=as_df.drop(column,axis=1)

推荐文章

Dave · 如何在for循环中修改列表值

5 月前

Haru Hoshizora · 为什么一个整数的位置没有改变,但值却不同

6 月前

BlurKid · R中for循环时结果的奇怪差异

6 月前

Rudraksh_pd · 取炭。通过char。c中创建字符串的输入

6 月前

Mtullis · 在我的表单值中循环遍历数组[重复]

6 月前

puboot · 我的for循环没有运行,我不知道为什么。它甚至不会在控制台上打印任何内容

10 月前

Justin Hawkins · 在一个数组中返回两个数组,其中包含带特定字母的名称和不带指定字母的名称

10 月前

leiseg · 使用扩展变量的for循环中PowerShell CLI中缺少终止符

10 月前

xhamsterIT · 循环VBA Microsoft Excel

11 月前

AlexC · 如何循环dplyr group_by并总结变量列表的语句

11 月前