摘要: Abstract Task: Defense LLM from prompt injection attacks Tool: TaskTracker Methods: use activation deltas( the difference in activations before and af 阅读全文
posted @ 2024-12-13 15:58 雪溯 阅读(1) 评论(0) 推荐(0) 编辑