• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Chen, Liqing (Chen, Liqing.) [1] | Zhuo, Yifan (Zhuo, Yifan.) [2] | Wu, Yingjie (Wu, Yingjie.) [3] | Wang, Yilei (Wang, Yilei.) [4] (Scholars:王一蕾) | Zheng, Xianghan (Zheng, Xianghan.) [5] (Scholars:郑相涵)

Indexed by:

EI Scopus

Abstract:

Visual Question Answering (VQA) tasks must provide correct answers to the questions posed by given images. Such requirement has been a wide concern since this task was presented. VQA consists of four steps: image feature extraction, question text feature extraction, multi-modal feature fusion and answer reasoning. During multimodal feature fusion, outer product calculation is used in existing models, which leads to excessive model parameters, high training overhead, and slow convergence. To avoid these problems, we applied the Variational Autoencoder (VAE) method to calculate the probability distribution of the hidden variables of image and question text. Furthermore, we designed a question feature hierarchy method based on the traditional attention mechanism model and VAE. The objective is to investigate deep questions and image correlation features to improve the accuracy of VQA tasks. © Springer Nature Switzerland AG 2019.

Keyword:

Computer vision Extraction Feature extraction Image enhancement Learning systems Probability distributions

Community:

  • [ 1 ] [Chen, Liqing]College of Mathematics and Computer Science, Fuzhou University, Fuzhou; Fujian Province, China
  • [ 2 ] [Zhuo, Yifan]College of Mathematics and Computer Science, Fuzhou University, Fuzhou; Fujian Province, China
  • [ 3 ] [Wu, Yingjie]College of Mathematics and Computer Science, Fuzhou University, Fuzhou; Fujian Province, China
  • [ 4 ] [Wang, Yilei]College of Mathematics and Computer Science, Fuzhou University, Fuzhou; Fujian Province, China
  • [ 5 ] [Zheng, Xianghan]College of Mathematics and Computer Science, Fuzhou University, Fuzhou; Fujian Province, China

Reprint 's Address:

  • 王一蕾

    [wang, yilei]college of mathematics and computer science, fuzhou university, fuzhou; fujian province, china

Show more details

Version:

Related Keywords:

Related Article:

Source :

ISSN: 0302-9743

Year: 2019

Volume: 11858 LNCS

Page: 657-669

Language: English

0 . 4 0 2

JCR@2005

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count: 1

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 6

Online/Total:413/9692136
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1