Generalized pyramid co-attention with learnable aggregation net for video question answering-MedSci.cn

Generalized pyramid co-attention with learnable aggregation net for video question answering

Gao, LL; Chen, TM; Li, XP; Zeng, PP; Zhao, L; Li, YF

Gao, LL (corresponding author), Univ Elect Sci & Technol China, Ctr Future Media, Chengdu, Peoples R China.

PATTERN RECOGNITION, 2021; 120 ():

Abstract

Video based visual question answering (V-VQA) remains challenging at the intersection of vision and language. In this paper, we propose a novel archit......

Full Text Link

Links

期刊讨论 | 中国SCI论文 | 期刊主页 | 投稿经验 | 杂志官网 | 投稿链接 | 作者需知 | PMC链接 | Pubmed全文检索

科室
- - 订阅+
  - 更多科室
工具
服务