2024 年 10月 21 日随笔档案 - marti88414

2024年10月21日

【论文阅读笔记】An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models

摘要： FastV is a plug-and-play inference acceleration method for large vision language models relying on visual tokens. It could reach 45% theoretical FLOPs reduction without harming the performance through pruning redundant visual tokens in deep layers. -- from official repo 阅读全文

posted @ 2024-10-21 15:47 marti88414 阅读(4) 评论(0) 推荐(0) 编辑

Loading

marti88414

公告