03 2022 档案

VL-BERT: PRE-TRAINING OF GENERIC VISUALLINGUISTIC REPRESENTATIONS

摘要：VL-BERT: PRE-TRAINING OF GENERIC VISUALLINGUISTIC REPRESENTATIONS 2022-03-30 20:35:13 Paper: https://openreview.net/forum?id=SygXPaEYvH Code: https:// 阅读全文

posted @ 2022-03-30 20:37 AHU-WangXiao 阅读(75) 评论(0) 推荐(0) 编辑

Unicoder-VL: A Universal Encoder for Vision and Language by Cross-Modal Pre-Training

摘要：Unicoder-VL: A Universal Encoder for Vision and Language by Cross-Modal Pre-Training 2022-03-22 14:22:12 Paper: https://ojs.aaai.org/index.php/AAAI/ar 阅读全文

posted @ 2022-03-22 14:23 AHU-WangXiao 阅读(345) 评论(0) 推荐(0) 编辑

U-ViusalBERT --- Unsupervised Vision-and-Language Pre-training Without Parallel Images and Captions

摘要：Unsupervised Vision-and-Language Pre-training Without Parallel Images and Captions 2022-03-20 17:34:51 Paper: https://arxiv.org/pdf/2010.12831.pdf Cod 阅读全文

posted @ 2022-03-20 17:38 AHU-WangXiao 阅读(130) 评论(0) 推荐(0) 编辑

Visualbert --- A simple and performant baseline for vision and language

摘要：Visualbert: A simple and performant baseline for vision and language 2022-03-20 15:19:04 Paper: https://arxiv.org/pdf/1908.03557 1. Background and Mot 阅读全文

posted @ 2022-03-20 15:27 AHU-WangXiao 阅读(404) 评论(0) 推荐(0) 编辑

Fusion of Detected Objects in Text for Visual Question Answering

摘要：Fusion of Detected Objects in Text for Visual Question Answering 2022-03-18 16:29:58 Paper: https://aclanthology.org/D19-1219/ Code: https://github.co 阅读全文

posted @ 2022-03-18 16:31 AHU-WangXiao 阅读(83) 评论(0) 推荐(0) 编辑

Align before Fuse: Vision and Language Representation Learning with Momentum Distillation

摘要：Align before Fuse: Vision and Language Representation Learning with Momentum Distillation 2022-03-18 10:04:06 Paper: https://proceedings.neurips.cc/pa 阅读全文

posted @ 2022-03-18 10:13 AHU-WangXiao 阅读(1244) 评论(0) 推荐(0) 编辑

ActBERT: Learning Global-Local Video-Text Representations

摘要：ActBERT: Learning Global-Local Video-Text Representations 2022-03-17 16:41:43 Paper: http://openaccess.thecvf.com/content_CVPR_2020/papers/Zhu_ActBERT 阅读全文

posted @ 2022-03-17 16:51 AHU-WangXiao 阅读(153) 评论(0) 推荐(0) 编辑

12-in-1: Multi-Task Vision and Language Representation Learning

摘要：12-in-1: Multi-Task Vision and Language Representation Learning 2022-03-17 09:45:41 Paper: https://openaccess.thecvf.com/content_CVPR_2020/papers/Lu_1 阅读全文

posted @ 2022-03-17 14:28 AHU-WangXiao 阅读(303) 评论(0) 推荐(0) 编辑

ALIGN: Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision

摘要：Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision 2022-03-16 21:02:21 Paper: http://proceedings.mlr.press/v139 阅读全文

posted @ 2022-03-16 21:20 AHU-WangXiao 阅读(616) 评论(0) 推荐(0) 编辑

Connecting Vision and Language with Localized Narratives

该文被密码保护。

posted @ 2022-03-07 19:55 AHU-WangXiao 阅读(0) 评论(0) 推荐(0) 编辑

公告

昵称： AHU-WangXiao
园龄： 9年5个月
粉丝： 431
关注： 25

+加关注

2025年3月

日

一

二

三

四

五

六

The Blog of Xiao Wang

Associate Professor, School of Computer Science and Technology, Anhui University, Email: xiaowang@ahu.edu.cn

03 2022 档案

公告

搜索

常用链接

最新随笔

我的标签

积分与排名

随笔分类 (1028)

随笔档案 (909)

相册 (1)

Other Links

阅读排行榜

评论排行榜

推荐排行榜

最新评论