最近使用了LightBGM的Dataset,记录一下:
1.说明: classlightgbm.
Dataset
(data, label=None, reference=None, weight=None, group=None, init_score=None, silent=False, feature_name='auto', categorical_feature='auto', params=None, free_raw_data=True)
Bases: object
Dataset in LightGBM.
Constract Dataset.
Parameters: |
- data (string, numpy array, pandas DataFrame, scipy.sparse or list of numpy arrays) – Data source of Dataset. If string, it represents the path to txt file.
- label (list, numpy 1-D array, pandas one-column DataFrame/Series or None, optional (default=None)) – Label of the data.
- reference (Dataset or None, optional (default=None)) – If this is Dataset for validation, training data should be used as reference.
- weight (list, numpy 1-D array, pandas Series or None, optional (default=None)) – Weight for each instance.
- group (list, numpy 1-D array, pandas Series or None, optional (default=None)) – Group/query size for Dataset.
- init_score (list, numpy 1-D array, pandas Series or None, optional (default=None)) – Init score for Dataset.
- silent (bool, optional (default=False)) – Whether to print messages during construction.
- feature_name (list of strings or 'auto', optional (default="auto")) – Feature names. If ‘auto’ and data is pandas DataFrame, data columns names are used.
- categorical_feature (list of strings or int, or 'auto', optional (default="auto")) – Categorical features. If list of int, interpreted as indices. If list of strings, interpreted as feature names (need to specify
feature_name as well). If ‘auto’ and data is pandas DataFrame, pandas categorical columns are used. All values in categorical features should be less than int32 max value (2147483647). All negative values in categorical features will be treated as missing values.
- params (dict or None, optional (default=None)) – Other parameters.
- free_raw_data (bool, optional (default=True)) – If True, raw data is freed after constructing inner Dataset.
|
输出是一个dataset对象
2.使用:
根据说明使用自己的数据,我这里data和label都用了DataFrame格式的
【推荐】国内首个AI IDE,深度理解中文开发场景,立即下载体验Trae
【推荐】编程新体验,更懂你的AI,立即体验豆包MarsCode编程助手
【推荐】抖音旗下AI助手豆包,你的智能百科全书,全免费不限次数
【推荐】轻量又高性能的 SSH 工具 IShell:AI 加持,快人一步
· 记一次.NET内存居高不下排查解决与启示
· 探究高空视频全景AR技术的实现原理
· 理解Rust引用及其生命周期标识(上)
· 浏览器原生「磁吸」效果!Anchor Positioning 锚点定位神器解析
· 没有源码,如何修改代码逻辑?
· 分享4款.NET开源、免费、实用的商城系统
· 全程不用写代码,我用AI程序员写了一个飞机大战
· MongoDB 8.0这个新功能碉堡了,比商业数据库还牛
· 白话解读 Dapr 1.15:你的「微服务管家」又秀新绝活了
· 上周热点回顾(2.24-3.2)