代码之家 › 专栏 › 技术社区 › stone rock

值错误:使用sklearn roc_auc_score函数不支持多类多输出格式

logistic-regression scikit-learn pandas python

stone rock · 技术社区 · 7 年前

我正在使用 logistic regression 用于预测。我的预测是 0's 和 1's . 在对我的模型进行给定数据的培训之后,以及在对重要特性(即 X_important_train 请参见屏幕截图。我的得分在70%左右,但当我使用 roc_auc_score(X,y) 或 roc_auc_score(X_important_train, y_train) 我正在获取值错误: ValueError: multiclass-multioutput format is not supported

代码:

# Load libraries
from sklearn.linear_model import LogisticRegression
from sklearn import datasets
from sklearn.preprocessing import StandardScaler
from sklearn.metrics import roc_auc_score

# Standarize features
scaler = StandardScaler()
X_std = scaler.fit_transform(X)

# Train the model using the training sets and check score
model.fit(X, y)
model.score(X, y)

model.fit(X_important_train, y_train)
model.score(X_important_train, y_train)

roc_auc_score(X_important_train, y_train)

截图:

1 回复 | 直到 7 年前

seralouk 7 年前

首先, roc_auc_score 函数需要具有相同形状的输入参数。

sklearn.metrics.roc_auc_score(y_true, y_score, average=âmacroâ, sample_weight=None)

Note: this implementation is restricted to the binary classification task or multilabel classification task in label indicator format.

y_true : array, shape = [n_samples] or [n_samples, n_classes]
True binary labels in binary label indicators.

y_score : array, shape = [n_samples] or [n_samples, n_classes]
Target scores, can either be probability estimates of the positive class, confidence values, or non-thresholded measure of decisions (as returned by âdecision_functionâ on some classifiers).

现在,输入的是真实的和预测的分数,而不是培训和标签数据,正如您在发布的示例中使用的那样。 更详细地说,

model.fit(X_important_train, y_train)
model.score(X_important_train, y_train)
# this is wrong here
roc_auc_score(X_important_train, y_train)

你应该这样做:

y_pred = model.predict(X_test_data)
roc_auc_score(y_true, y_pred)

推荐文章

Bushra Jabeen · 计算列中的互信息

3 年前

rkraaijveld · sklearn的Coef。线性回归为无

3 年前

Sherwin R · 随机森林预测错误的输出形状

3 年前

Trinh Hieu · 我想在100%中随机训练60%,剩下的40%在混乱矩阵中测试

3 年前

Gijo george · 如何识别段落中每个句子的情绪

3 年前

Test · 安装Scikit Learn Big Sur M1

3 年前

kukelia · 在自定义转换器内创建新数据帧时,SKlearn管道无法工作

3 年前

Arnoldas Maslovskis · 当需要1d数组时,传递了列向量y。请将y的形状更改为(n_samples),例如使用ravel()

3 年前

Rich · 我可以简化零系数的Lasso Lars运行时吗?

3 年前

Medo · 是否可以将3D图像转换为一个矢量?

7 年前