cup_leo

2021年11月29日

摘要： Spark与Pandas中DataFrame对比（详细） Pandas Spark 工作方式单机single machine tool，没有并行机制parallelism不支持Hadoop，处理大量数据有瓶颈分布式并行计算框架，内建并行机制parallelism，所有的数据和操作自动并行分布在各阅读全文

posted @ 2021-11-29 09:46 cup_leo 阅读(183) 评论(0) 推荐(0) 编辑

2021年11月26日

Python中操作SQLAlchemy,SQLAlchemy中文技术文档 mysql

摘要： Python中操作SQLAlchemy,SQLAlchemy中文技术文档 - 简书 (jianshu.com) 阅读全文

posted @ 2021-11-26 13:55 cup_leo 阅读(100) 评论(0) 推荐(0) 编辑

文本摘要 Rouge 指标计算 python

摘要： #Rouge gram-1,2,4,SU4 def Rouge_1(pred, true): #结巴精准分词 gram_1_true = list(jieba.cut(true)) gram_1_pred = list(jieba.cut(pred)) #统计参考摘要长度、预测摘要长度、预测正确的摘阅读全文

posted @ 2021-11-26 12:46 cup_leo 阅读(744) 评论(0) 推荐(0) 编辑

2021年11月25日

python 求众数含重复的众数

摘要： def mode(arr): dict_cnt = {} for x in arr: dict_cnt[x] = dict_cnt.get(x,0)+1 max_cnt = max(dict_cnt.values()) most_values = [k for k,v in dict_cnt.ite 阅读全文

posted @ 2021-11-25 11:37 cup_leo 阅读(500) 评论(0) 推荐(0) 编辑

2021年11月23日

比较实用的计算短文本相似度的算法

摘要：现在工业界有哪些比较实用的计算短文本相似度的算法或者模型？ https://www.zhihu.com/question/342548427/answer/1790309097 https://blog.csdn.net/ling620/article/details/95956211 阅读全文

posted @ 2021-11-23 11:04 cup_leo 阅读(233) 评论(0) 推荐(0) 编辑

2021年11月19日

hive web工具

摘要： yanagishima/yanagishima: Web UI for Trino, Hive and SparkSQL (github.com) Yanagishima is an open-source Web application for Trino, Hive and Spark. 阅读全文

posted @ 2021-11-19 15:01 cup_leo 阅读(82) 评论(0) 推荐(0) 编辑

2021年11月18日

spark 参数调优建议

摘要： spark参数调优阅读全文

posted @ 2021-11-18 14:17 cup_leo 阅读(8) 评论(0) 推荐(0) 编辑

2021年11月12日

图谱实战参考

摘要：基于医疗知识图谱的自动问答从无到有搭建一个以疾病为中心的一定规模医药领域知识图谱，并以该知识图谱完成自动问答与分析服务。 self-implement of disease centered Medical graph from zero to full and sever as question 阅读全文

posted @ 2021-11-12 17:31 cup_leo 阅读(22) 评论(0) 推荐(0) 编辑

三元组抽取调研

摘要： 1、依存关系法：基于依存句法分析的事实三元组抽取如何抽取实体关系？——基于依存句法分析的事实三元组抽取 - 编程、生活、技巧、认知、理财 (guanngxu.com) 论文:基于依存分析的开放式中文实体关系抽取方法 - 豆丁网 (docin.com) 论文 code：fact_triple_ex 阅读全文

posted @ 2021-11-12 13:46 cup_leo 阅读(185) 评论(0) 推荐(0) 编辑

中文知识图谱-基于规则的关系抽取-领域词抽取

摘要：中文知识图谱-领域词抽取 - 知乎 (zhihu.com) 中文知识图谱-基于规则的关系抽取中文知识图谱-基于规则的关系抽取 - 知乎 (zhihu.com) 阅读全文

posted @ 2021-11-12 13:34 cup_leo 阅读(153) 评论(0) 推荐(0) 编辑

公告