随笔档案「2023年11月」 - michaelchengjl

不同解码策略

摘要：不同解码策略 https://www.cnblogs.com/miners/p/14950681.html https://blog.csdn.net/taoqick/article/details/123897960 https://zhuanlan.zhihu.com/p/442557114 阅读全文

posted @ 2023-11-22 16:02 michaelchengjl 阅读(23) 评论(0) 推荐(0)

huggingface下载的.arrow数据集读取与使用说明

摘要：huggingface下载的.arrow数据集读取与使用说明 from datasets import load_from_disk from datasets import load_dataset dataset_cnn = load_dataset("ccdv/cnn_dailymail", 阅读全文

posted @ 2023-11-21 16:58 michaelchengjl 阅读(1456) 评论(0) 推荐(1)

NLP QA数据集

摘要：NLP QA数据集数据文档背景描述 CNN/Daily Mail（简称CNN/DM）作为单文本摘要语料库，每篇摘要包含多个摘要句。数据集最初是从美国有限新闻网（CNN）和每日邮报网（Daily Mail）收集的约100万条新闻数据作为机器阅读理解语料库。后来进行简单改动，形成用于单文本生成式摘要阅读全文

posted @ 2023-11-21 11:08 michaelchengjl 阅读(341) 评论(0) 推荐(0)

大模型部署加速

摘要：大模型部署加速 https://zhuanlan.zhihu.com/p/659571962 https://github.com/internlm/lmdeploy https://github.com/InternLM/lmdeploy/blob/main/docs/zh_cn/kv_int8. 阅读全文

posted @ 2023-11-03 15:41 michaelchengjl 阅读(150) 评论(0) 推荐(0)

vLLM 部署大模型

摘要：vLLM 部署大模型 https://github.com/vllm-project/vllm/tree/v0.2.0 https://vllm.readthedocs.io/en/latest/getting_started/installation.html https://vllm.readt 阅读全文

posted @ 2023-11-03 15:30 michaelchengjl 阅读(1150) 评论(0) 推荐(0)

LLM推理优化

摘要：LLM推理优化 https://blog.csdn.net/LF_AI/article/details/133054474?spm=1001.2014.3001.5502 阅读全文

posted @ 2023-11-03 15:27 michaelchengjl 阅读(52) 评论(0) 推荐(0)

Error loading wikitext data raise NotImplementedError(f"Loading a dataset cached in a {type(self._fs).__name__} is not supported.")

摘要：Error loading wikitext data raise NotImplementedError(f"Loading a dataset cached in a {type(self._fs).name} is not supported.") QA I was trying to loa 阅读全文

posted @ 2023-11-01 17:09 michaelchengjl 阅读(553) 评论(0) 推荐(0)

michaelchengjl

11 2023 档案

公告