摘要: ## LLM inference workflow **Generative Inference**. A typical LLM generative inference task consists of two stages: i) the prefill stage which takes a 阅读全文
posted @ 2023-08-23 18:57 鸽鸽的书房 阅读(112) 评论(0) 推荐(0) 编辑