06 2019 档案

摘要:How to put more weight on certain features in machine learning? First of all - you should probably not do it. The whole concept of machine learning is 阅读全文
posted @ 2019-06-24 09:17 andy_0212 阅读(201) 评论(0) 推荐(0)
摘要:file_names = fnmatch.filter(os.listdir(folder), '*.jpeg') # take care of folder sequence, 1.jpeg, 2.jpeg, 3.jpeg coll.sort(key=lambda x:int(x[:-5])) o 阅读全文
posted @ 2019-06-18 16:13 andy_0212 阅读(394) 评论(0) 推荐(0)
摘要:最近在研究sample之间的similarity,以便更好地进行clustering,一下是相关资料 TF-IDF与余弦相似性的应用(一):自动提取关键词 TF-IDF与余弦相似性的应用(二):找出相似文章 相似度(距离计算)汇总 常用的相似度计算方法原理及实现 机器学习中的相似性度量 大量短文本聚 阅读全文
posted @ 2019-06-11 10:38 andy_0212 阅读(848) 评论(0) 推荐(0)
摘要:What is the fastest way to upload a big csv file in notebook to work with python pandas? 阅读全文
posted @ 2019-06-05 15:38 andy_0212 阅读(529) 评论(0) 推荐(0)