随笔分类 - Vision and Language
该文被密码保护。
摘要:Capsule-based Object Tracking with Natural Language Specification 2021-12-18 19:28:39 Paper: https://dl.acm.org/doi/abs/10.1145/3474085.3475349 1. Bac
阅读全文
摘要:Grounding-Tracking-Integration2020-05-19 11:00:57 Paper: https://arxiv.org/pdf/1912.06316 本文提出一种 tracking-by-language 的算法,来根据文本描述进行目标跟踪。思路比较直观,将该任务分为三
阅读全文
摘要:What’s new for Transformers at the ICLR 2020 Conference? 2020-05-07 10:51:22 Source: https://towardsdatascience.com/whats-new-for-transformers-at-the-
阅读全文
摘要:Video Object Grounding using Semantic Roles in Language Description 2020-03-25 17:44:59 Paper:https://arxiv.org/pdf/2003.10606.pdf Code: https://githu
阅读全文
摘要:Attention is All you need 2020-03-22 00:29:11 Paper: https://papers.nips.cc/paper/7181-attention-is-all-you-need.pdf Doc: https://huggingface.co/trans
阅读全文
该文被密码保护。
摘要:ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks 2020-03-12 23:10:53 Paper: NeurIPS 2019 Code: https:/
阅读全文
摘要:Weakly-Supervised Spatio-Temporally Grounding Natural Sentence in Video 2020-03-08 14:29:35 Paper: https://arxiv.org/pdf/1906.02549.pdf Code: https://
阅读全文
摘要:Visual Semantic Reasoning for Image-Text Matching 2020-03-06 15:17:02 Paper: https://arxiv.org/pdf/1909.02701.pdf Code: https://github.com/KunpengLi19
阅读全文
摘要:Stacked Cross Attention for Image-Text Matching 2020-03-06 15:13:08 Paper: https://arxiv.org/pdf/1803.08024.pdf Code: https://github.com/kuanghuei/SCA
阅读全文
该文被密码保护。
该文被密码保护。
该文被密码保护。
摘要:Learning Conditioned Graph Structures for Interpretable Visual Question Answering 2019-05-29 00:29:43 Paper:http://papers.nips.cc/paper/8054-learning-
阅读全文
摘要:Improving Deep Visual Representation for Person Re-identification by Global and Local Image-language Association2018-09-29 19:36:43 Paper:http://opena
阅读全文