摘要: Abstract Background: adversarial images/prompts can jailbreak Multimodal large language model and cause unaligned behaviors 本文报告了在multi-agent + MLLM环境 阅读全文
posted @ 2025-02-04 19:02 雪溯 阅读(2) 评论(0) 推荐(0) 编辑
点击右上角即可分享
微信分享提示