量化预测质量之分类报告 sklearn.metrics.classification_report

classification_report的调用为：classification_report(y_true, y_pred, labels=None, target_names=None, sample_weight=None, digits=2, output_dict=False)

y_true : 真实值
y_pred : 预测值

from sklearn.metrics import classification_report

truey = np.array([0,0,1,1,0,0])
prey = np.array([1,0,1,0,0,0])
print(classification_report(truey,prey,target_names=['zhen','jia']))

1）fraction of true positives/false positive/false negative/true negative

True Positive （真正, TP）被模型预测为正的正样本；

True Negative（真负 , TN）被模型预测为负的负样本；

False Positive （假正, FP）被模型预测为正的负样本；

False Negative（假负 , FN）被模型预测为负的正样本；

2）precision/recall，准确率和召回率

系统检索到的相关文档（Ａ）

系统检索到的不相关文档（Ｂ）

相关但是系统没有检索到的文档（Ｃ）

不相关但是被系统检索到的文档（Ｄ）

召回率Ｒ：R=A/(A+C)

精度Ｐ： P=A/(A+B).

3）F1-score

F1分数可以看作是模型准确率和召回率的一种加权平均，它的最大值是1，最小值是0。

参考：https://scikit-learn.org/stable/modules/generated/sklearn.metrics.classification_report.html

posted @ 2019-04-01 10:48 西瓜草莓甘蔗阅读(2266) 评论(1) 编辑收藏举报

刷新页面返回顶部

西瓜草莓甘蔗

量化预测质量之分类报告 sklearn.metrics.classification_report

公告