摘要:
ActBERT: Learning Global-Local Video-Text Representations 2022-03-17 16:41:43 Paper: http://openaccess.thecvf.com/content_CVPR_2020/papers/Zhu_ActBERT 阅读全文
摘要:
12-in-1: Multi-Task Vision and Language Representation Learning 2022-03-17 09:45:41 Paper: https://openaccess.thecvf.com/content_CVPR_2020/papers/Lu_1 阅读全文