Loading

摘要: FastV is a plug-and-play inference acceleration method for large vision language models relying on visual tokens. It could reach 45% theoretical FLOPs reduction without harming the performance through pruning redundant visual tokens in deep layers. -- from official repo 阅读全文
posted @ 2024-10-21 15:47 marti88414 阅读(4) 评论(0) 推荐(0) 编辑