Loading

摘要: import pandas as pd import numpy as np import matplotlib.pyplot as plt # 准备数据 data = pd.read_csv("./digit recognizor.csv") x = data.iloc[:,1:] # 特征矩阵 阅读全文
posted @ 2023-04-08 21:08 ThankCAT 阅读(22) 评论(0) 推荐(0) 编辑
摘要: import pandas as pd data = pd.read_csv("./digit recognizor.csv") x = data.iloc[:,1:] y = data.iloc[:,0] x.shape (42000, 784) 方差过滤 VarianceThreshold fr 阅读全文
posted @ 2023-04-08 21:07 ThankCAT 阅读(27) 评论(0) 推荐(0) 编辑
摘要: 二值化与分段 sklearn.preprocessing.Binarizer from sklearn.preprocessing import Binarizer import pandas as pd data = pd.read_csv("./data_full", index_col=0) 阅读全文
posted @ 2023-04-08 21:07 ThankCAT 阅读(10) 评论(0) 推荐(0) 编辑
摘要: 处理缺失值 import pandas as pd import numpy as np df = pd.read_csv("./Narrativedata.csv", index_col=0) df.info() <class 'pandas.core.frame.DataFrame'> Int6 阅读全文
posted @ 2023-04-08 21:06 ThankCAT 阅读(34) 评论(0) 推荐(0) 编辑