• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Guo, Wenzhong (Guo, Wenzhong.) [1] (Scholars:郭文忠) | Wang, Jianwen (Wang, Jianwen.) [2] | Wang, Shiping (Wang, Shiping.) [3] (Scholars:王石平)

Indexed by:

EI Scopus SCIE

Abstract:

Multimodal representation learning, which aims to narrow the heterogeneity gap among different modalities, plays an indispensable role in the utilization of ubiquitous multimodal data. Due to the powerful representation ability with multiple levels of abstraction, deep learning-based multimodal representation learning has attracted much attention in recent years. In this paper, we provided a comprehensive survey on deep multimodal representation learning which has never been concentrated entirely. To facilitate the discussion on how the heterogeneity gap is narrowed, according to the underlying structures in which different modalities are integrated, we category deep multimodal representation learning methods into three frameworks: joint representation, coordinated representation, and encoder-decoder. Additionally, we review some typical models in this area ranging from conventional models to newly developed technologies. This paper highlights on the key issues of newly developed technologies, such as encoder-decoder model, generative adversarial networks, and attention mechanism in a multimodal representation learning perspective, which, to the best of our knowledge, have never been reviewed previously, even though they have become the major focuses of much contemporary research. For each framework or model, we discuss its basic structure, learning objective, application scenes, key issues, advantages, and disadvantages, such that both novel and experienced researchers can benefit from this survey. Finally, we suggest some important directions for future work.

Keyword:

deep multimodal fusion multimodal adversarial learning multimodal deep learning Multimodal representation learning multimodal translation

Community:

  • [ 1 ] [Guo, Wenzhong]Fuzhou Univ, Coll Math & Comp Sci, Fuzhou 350116, Fujian, Peoples R China
  • [ 2 ] [Wang, Jianwen]Fuzhou Univ, Coll Math & Comp Sci, Fuzhou 350116, Fujian, Peoples R China
  • [ 3 ] [Wang, Shiping]Fuzhou Univ, Coll Math & Comp Sci, Fuzhou 350116, Fujian, Peoples R China
  • [ 4 ] [Guo, Wenzhong]Fuzhou Univ, Key Lab Network Comp & Intelligent Informat Proc, Fuzhou 350116, Fujian, Peoples R China
  • [ 5 ] [Wang, Jianwen]Fuzhou Univ, Key Lab Network Comp & Intelligent Informat Proc, Fuzhou 350116, Fujian, Peoples R China
  • [ 6 ] [Wang, Shiping]Fuzhou Univ, Key Lab Network Comp & Intelligent Informat Proc, Fuzhou 350116, Fujian, Peoples R China
  • [ 7 ] [Wang, Jianwen]Fujian Normal Univ, Coll Math & Informat, Fuzhou 350117, Fujian, Peoples R China
  • [ 8 ] [Wang, Jianwen]Fujian Normal Univ, Fujian Prov Engn Technol Res Ctr Publ Serv Big Da, Fuzhou 350117, Fujian, Peoples R China

Reprint 's Address:

  • 王石平

    [Wang, Shiping]Fuzhou Univ, Coll Math & Comp Sci, Fuzhou 350116, Fujian, Peoples R China;;[Wang, Shiping]Fuzhou Univ, Key Lab Network Comp & Intelligent Informat Proc, Fuzhou 350116, Fujian, Peoples R China

Show more details

Related Keywords:

Source :

IEEE ACCESS

ISSN: 2169-3536

Year: 2019

Volume: 7

Page: 63373-63394

3 . 7 4 5

JCR@2019

3 . 4 0 0

JCR@2023

ESI Discipline: ENGINEERING;

ESI HC Threshold:150

JCR Journal Grade:1

CAS Journal Grade:2

Cited Count:

WoS CC Cited Count: 238

SCOPUS Cited Count: 315

ESI Highly Cited Papers on the List: 6 Unfold All

  • 2025-1
  • 2024-11
  • 2024-9
  • 2024-7
  • 2024-5
  • 2024-3

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 0

Online/Total:295/10048853
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1