Model Evaluation

1. Confusion Matrix

Fact\Predict	Class A	Class B
Class A	True Positive	False Negative
Class B	False Positive	True Nagative

A confusion table for Class A

Positive/ Negative: if target class is A, then the predict A is Positve, Others are negative.

True (P/N): if Predict = Fact, then it's True.

2. Measures based on Confusion Matrix

a. Accuracy = TN+TP/ALL

　　comments: not good measure when data are unbalanced.

b. True Positive Rate/ recall/ sensitivity = TP / TP + FN

　　comments: use it when Positive results are important

c. True Negative Rate = TN / TN + FP

　　comments: use it when Negative Results are important

R for Confusion Matrix:

library(SDMTools)

confusion.matrix(svmmodel.truth,svmmodel.class)

3. ROC curve (bio-classification)

y: sensitivity

x: specificity

The bigger the Area of ROC is, the more accurate the model is.

4. Normalized Weighted Root Mean Squared Logarithmic Error

Submissions are evaluated on the Normalized Weighted Root Mean Squared Logarithmic Error (NWRMSLE), calculated as follows:

N W R M S L E = \sum n i = 1 w i ( ln ( y ^ i + 1 ) - ln ( y

where for row i, ${\hat{y}}_{i}$

The weights, $w_{i}$

This metric is suitable when predicting values across a large range of orders of magnitudes. It avoids penalizing large differences in prediction when both the predicted and the true number are large: predicting 5 when the true value is 50 is penalized more than predicting 500 when the true value is 545.

posted @ 2017-07-16 11:10 付小同阅读(597) 评论(0) 编辑收藏举报

会员力量，点亮园子希望

刷新页面返回顶部

Model Evaluation

1. Confusion Matrix

2. Measures based on Confusion Matrix

3. ROC curve (bio-classification)

4. Normalized Weighted Root Mean Squared Logarithmic Error

公告