Learning speaker-independent multimodal representation for sentiment analysis - Details

author：

Wang, Jianwen (Wang, Jianwen.) ^[1] | Wang, Shiping (Wang, Shiping.) ^[2] (Scholars：王石平) | Lin, Mingwei (Lin, Mingwei.) ^[3] | Xu, Zeshui (Xu, Zeshui.) ^[4] | Guo, Wenzhong (Guo, Wenzhong.) ^[5] (Scholars：郭文忠)

Indexed by：

EI Scopus SCIE

Abstract：

Multimodal　sentiment　analysis　is　an　actively　growing　research　area　that　utilizes　language,　acoustic　and　visual　signals　to　predict　sentiment　inclination.　Compared　to　language,　acoustic　and　visual　features　carry　a　more　evident　personal　style　which　may　degrade　the　model　generalization　capability.　The　issue　will　be　exacerbated　in　a　speaker-independent　setting,　where　the　model　will　encounter　samples　from　unseen　speakers　during　the　testing　stage.　To　mitigate　personal　style＇s　impact,　we　propose　a　framework　named　SIMR　for　learning　speaker-independent　multimodal　representation.　This　framework　separates　the　nonverbal　inputs　into　style　encoding　and　content　representation　with　the　aid　of　informative　cross-modal　correlations.　Besides,　in　terms　of　integrating　cross-modal　complementary　information,　the　classical　transformer-based　approaches　are　inherently　inclined　to　discover　compatible　cross-modal　interactions　but　ignore　incompatible　ones.　In　contrast,　we　suggest　simultaneously　locating　both　through　an　enhanced　cross-modal　transformer　module.　Experimental　results　show　that　the　proposed　model　achieves　state-of-the-art　performance　on　several　datasets.

Keyword：

Multimodal fusion Multimodal representation learning Multimodal sentiment analysis Multi-view learning

Community：

[ 1 ] [Wang, Jianwen]Fuzhou Univ, Coll Comp & Data Sci, Fuzhou 350116, Peoples R China
[ 2 ] [Wang, Shiping]Fuzhou Univ, Coll Comp & Data Sci, Fuzhou 350116, Peoples R China
[ 3 ] [Guo, Wenzhong]Fuzhou Univ, Coll Comp & Data Sci, Fuzhou 350116, Peoples R China
[ 4 ] [Wang, Jianwen]Fujian Normal Univ, Coll Comp & Cyber Secur, Fuzhou 350117, Peoples R China
[ 5 ] [Lin, Mingwei]Fujian Normal Univ, Coll Comp & Cyber Secur, Fuzhou 350117, Peoples R China
[ 6 ] [Xu, Zeshui]Sichuan Univ, Business Sch, Chengdu 610064, Sichuan, Peoples R China
[ 7 ] [Wang, Shiping]Fuzhou Univ, Key Lab Network Comp & Intelligent Informat Proc, Fuzhou 350116, Peoples R China
[ 8 ] [Guo, Wenzhong]Fuzhou Univ, Key Lab Network Comp & Intelligent Informat Proc, Fuzhou 350116, Peoples R China
[ 9 ] [Lin, Mingwei]Fujian Normal Univ, Digital Fujian Inst Big Data Secur Technol, Fuzhou 350117, Peoples R China

Reprint 's Address：

郭文忠
[Guo, Wenzhong]Fuzhou Univ, Coll Comp & Data Sci, Fuzhou 350116, Peoples R China

Email：

wangjianwen@126.com |
wangjianwen@126.com |
linmwcs@163.com |
xuzeshui@263.net |
guowenzhong@fzu.edu.cn

Show more details

Version：

Learning speaker-independent multimodal representation for sentiment analysis
2023，Information Sciences
Learning speaker-independent multimodal representation for sentiment analysis
2023，Information Sciences

Related Keywords：

Quantum-inspired multimodal fusion with Lindblad master equation for sentiment analysis
2025，Neurocomputing
Incorporating multivariate semantic association graphs into multimodal networks for information extraction from documents
2024，JOURNAL OF SUPERCOMPUTING
Multimodal federated learning: Concept, methods, applications and future directions
2024，INFORMATION FUSION
Multimodal Cross Global Learnable Attention Network for MR images denoising with arbitrary modal missing
2025，COMPUTERIZED MEDICAL IMAGING AND GRAPHICS

Source ：

INFORMATION SCIENCES

ISSN： 0020-0255

Year： 2023

Volume： 628

Page： 208-225

0 . 0

JCR@2023

0 . 0 0 0

JCR@2023

ESI Discipline： COMPUTER SCIENCE;

ESI HC Threshold：32

CAS Journal Grade：1

Cited Count：

WoS CC Cited Count： 5

SCOPUS Cited Count： 5

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 0

Affiliated Colleges：

计算机与大数据学院、软件学院本学院/部未明确归属的数据

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to