摘要:
Fusion of Detected Objects in Text for Visual Question Answering 2022-03-18 16:29:58 Paper: https://aclanthology.org/D19-1219/ Code: https://github.co 阅读全文
摘要:
Align before Fuse: Vision and Language Representation Learning with Momentum Distillation 2022-03-18 10:04:06 Paper: https://proceedings.neurips.cc/pa 阅读全文