Stay Hungry,Stay Foolish!

摘要: CUDA Refresher: The CUDA Programming Model https://developer.nvidia.com/blog/cuda-refresher-cuda-programming-model/ To execute any CUDA program, there 阅读全文
posted @ 2024-07-20 20:28 lightsong 阅读(2) 评论(0) 推荐(0) 编辑
摘要: vLLM https://github.com/vllm-project/vllm https://docs.vllm.ai/en/latest/ 推理和服务,但是更加偏向推理。 vLLM is a fast and easy-to-use library for LLM inference and 阅读全文
posted @ 2024-07-20 12:27 lightsong 阅读(5) 评论(0) 推荐(0) 编辑
Life Is Short, We Need Ship To Travel