摘要: df = df.filter(["entryName","classifyId"], axis=1) df = df.drop('B', axis=1) 阅读全文
posted @ 2021-10-12 14:46 hifalee 阅读(287) 评论(0) 推荐(0) 编辑
摘要: 写 data = {'name': 'lin', 'email': 'xiaoqinglin2018@gmail.com'} with open('json_note.json', 'w') as f: # writing Json data json.dump(data, f,ensure_asc 阅读全文
posted @ 2021-08-30 14:45 hifalee 阅读(27) 评论(0) 推荐(0) 编辑
摘要: 所有包的导出 pip freeze > requirements.txt 项目所依赖的包 pip install pipreqs pipreqs ./ 安装 pip install -r requirements.txt 阅读全文
posted @ 2021-08-09 10:16 hifalee 阅读(34) 评论(0) 推荐(0) 编辑
摘要: user_list = df['UserId'].value_counts() user_list = user_list[user_list>=5].index.tolist() df = df[df['UserId'].isin(user_list)] df.to_csv(w_f1,index= 阅读全文
posted @ 2021-01-28 15:44 hifalee 阅读(59) 评论(0) 推荐(0) 编辑
摘要: df = pd.read_csv(r_f1) df['machine_Id'] = df['UserId']+df['knowledge_encoding'] df.rename(columns={"UserId": "old_UserId","machine_Id":"UserId"},inpla 阅读全文
posted @ 2021-01-27 16:30 hifalee 阅读(458) 评论(0) 推荐(0) 编辑
摘要: import pandas as pd #写入 list = [[1, 2, 3], [4, 5, 6], [7, 9, 9]] name = ['one', 'two', 'three'] test = pd.DataFrame(columns=name, data=list) # 数据有三列,列 阅读全文
posted @ 2020-11-26 15:56 hifalee 阅读(115) 评论(0) 推荐(0) 编辑
摘要: csv.read with open("test.csv", "r", encoding = "utf-8") as f: reader = csv.reader(f) rows = [row for row in reader] csv.write with open(file, 'w', new 阅读全文
posted @ 2020-11-26 11:27 hifalee 阅读(311) 评论(0) 推荐(0) 编辑
摘要: `KFold(n_splits=5, shuffle=True, random_state=3)` 阅读全文
posted @ 2020-11-25 15:27 hifalee 阅读(73) 评论(0) 推荐(0) 编辑
摘要: 读取csv文件 data = pd.read_csv(f1,nrows =20) s = data.loc[:, :] s = s.loc[:, ['machine_id', 'question_id', 'answer_right']] s = s.values s = s.tolist() 读取 阅读全文
posted @ 2020-11-25 15:12 hifalee 阅读(70) 评论(0) 推荐(0) 编辑
摘要: 预处理excel文件中的数据不平衡问题 data = pd.read_excel ("file) list_label = [] train_list, dev_list, test_list = [],[],[] data_value = data.value for i in range(len 阅读全文
posted @ 2020-11-25 15:06 hifalee 阅读(142) 评论(0) 推荐(0) 编辑