Abstract
Although Transformer can be powerful for modeling visual relations and describing complicated patterns, it could still perform unsatisfactorily for vi......
小提示:本篇文献需要登录阅读全文,点击跳转登录