[期刊]
  • 《》 2022年44卷12Pt.2期

摘要 : Text based Visual Question Answering (TextVQA) is a recently raised challenge requiring models to read text in images and answer natural language questions by jointly reasoning over the question, textual information and visual con... 展开

相关作者
相关关键词