分类问题样本不均衡问题
1. 欠采样
- 随机欠采样
- EasyEnsemble
- BalanceCascade
Exploratory Undersampling forClass-Imbalance Learning
2. 过采样
- 随机过采样
- SMOTE
- Borderline-SMOTE
SMOTE: Synthetic Minority Over-sampling Technique
Borderline-SMOTE: A New Over-Sampling Method in Imbalanced Data Sets Learning
3. cost-sensitive learning
- Adacost
4.评价方式
- F1-score
- G-Mean
- roc_auc_score