CFN-GU: A Cross-Modal Fusion Network with Gated Units for Emotion Recognition - Details

author：

Zhou, H. (Zhou, H..) ^[1] | Yuan, X. (Yuan, X..) ^[2] | Xu, L. (Xu, L..) ^[3]

Indexed by：

Scopus

Abstract：

Emotion　Recognition　in　Conversation　(ERC)　has　emerged　as　a　pivotal　topic　in　the　realm　of　human-computer　interaction,　drawing　escalating　attention.　Despite　previous　research　achieving　certain　accomplishments,　most　approaches　treat　each　modality　equally,　failing　to　differentiate　the　emotional　information　across　different　modalities　and　thus　struggling　to　harness　the　complementary　and　associative　information　within　multimodal　data.　To　address　this　issue,　this　paper　propose　a　Cross-Modal　Fusion　Network　with　Gated　Units　(CFN-GU).　CFN-GU　comprises　two　main　components:　the　Single-Modal　Transformer　and　the　Learnable　Fusion　Strategy　With　Gate　(LG-Fusion).　The　Single-Modal　Transformer　is　employed　to　model　contextual　information　for　each　unimodal　feature,　extracting　rich　contextual　emotional　cues.　Subsequently,　LG-Fusion　autonomously　learns　the　weights　of　each　feature　information　for　every　modality,　thus　comprehensively　understanding　the　contributions　of　different　modalities　to　emotional　information.　Finally,　the　information　from　the　three　modalities　is　fused　based　on　these　learned　weights.　CFN-GU　achieves　an　F1　score　of　64.3%　on　IEMOCAP,　effectively　improving　ERC　performance　and　outperforming　all　benchmark　baselines.　　©　2024　IEEE.

Keyword：

Cross-modal emotion integration Emotion recognition in conversation Multimodal fusion

Community：

[ 1 ] [Zhou H.]China University of Mining & Technology, Beijing, China
[ 2 ] [Yuan X.]Fuzhou University, Fuzhou, China
[ 3 ] [Xu L.]China University of Mining & Technology, Beijing, China

Reprint 's Address：

Email：

Show more details

Related Keywords：

CFN-GU: A Cross-Modal Fusion Network with Gated Units for Emotion Recognition
2024，2nd IEEE International Conference on Image Processing and Computer Applications, ICIPCA 2024
Multimodal Fusion With Block Term Decomposition for Asynchronous Federated Learning
2024，IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS
Multimodal fusion-based spatiotemporal incremental learning for ocean environment perception under sparse observation
2024，INFORMATION FUSION
Multimodal federated learning: Concept, methods, applications and future directions
2024，INFORMATION FUSION
Quantum-inspired multimodal fusion with Lindblad master equation for sentiment analysis
2025，Neurocomputing

Source ：

Year： 2024

Page： 14-21

Language： English

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 1

Affiliated Colleges：

计算机与大数据学院、软件学院本学院/部未明确归属的数据

Get Fulltext

DOI Library Discovery Baidu Scholar Search SCOPUS

Type
Departments

All Years Choose Year From to