CLIP
1、clip分析
原文:https://openai.com/research/multimodal-neurons 翻译:https://blog.csdn.net/tMb8Z9Vdm66wH68VX1/article/details/115314884
https://github.com/openai/CLIP-featurevis/issues/5
2、文本注意力可视化
https://zhuanlan.zhihu.com/p/357459957
https://github.com/haofanwang/natural-language-joint-query-search
3、 https://github.com/openai/CLIP/issues/108
4、CLIP可视化:
- 1、How does CLIP think without the prompts? · Issue #242 · openai/CLIP (github.com)
It sounds like you're attempting to maximize the image embeddings by searching text embeddings.
Researchers did that here: https://distill.pub/2021/multimodal-neurons/
But people were running into issues trying to figure out how to reproduce it: openai/CLIP-featurevis#2