算法工程漫谈

机器学习算法

  • Classification 分类:性别分类、年龄分类

  • Clustering 聚类:常用地址聚类

  • Regression 回归:销量预测,续航里程预测

  • Collaborative Filtering 协同过滤(User-based, Item-based):个性化推荐

流程

pipeline相关
https://spark.apache.org/docs/latest/ml-pipeline.html

pmml相关
https://spark.apache.org/docs/latest/mllib-pmml-model-export.html

示例代码

分类

https://github.com/dmlc/xgboost/blob/master/jvm-packages/xgboost4j-example/src/main/scala/ml/dmlc/xgboost4j/scala/example/spark/SparkMLlibPipeline.scala

回归

https://docs.databricks.com/_static/notebooks/xgboost-regression.html

posted @ 2022-05-31 12:39  匠人先生  阅读(53)  评论(0编辑  收藏  举报