• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Chen, Y. (Chen, Y..) [1] | Du, C. (Du, C..) [2] | Zi, Y. (Zi, Y..) [3] | Xiong, S. (Xiong, S..) [4] | Lu, X. (Lu, X..) [5] (Scholars:卢孝强)

Indexed by:

Scopus

Abstract:

Remote Sensing (RS) audio-visual cross-modal retrieval is a challenging task in the search of meaningful RS information. Nevertheless, the impact of multi-scale features and associated redundant information in the RS images cannot be overlooked in the retrieval task. In addition, how to deal with the completely different physical expressions of different modal information is crucial for cross-modal retrieval tasks. To tackle these issues, we propose a Scale-aware Adaptive Refinement and Cross Interaction (SARCI) network. The Quaternion-attention Dominated Multi-scale Visual Refinement (QDMVR) module in SARCI is suggested to learn multi-scale visual features and further optimize features containing redundant information for different scale features. To better integrate channel attention and spatial attention for adaptively learning of meaningful visual semantics, we propose the Symmetric Quaternion Attention (SQA) within the QDMVR module to enhance RS visual features. The SQA mechanism acts on both high-level and low-level features to explore salient RS vision information across different scales. In order to allow information from different modalities to interact more valuably, we propose the Instruction-based Cross Learning Module (ICLM) to perform cross-modal feature interaction based on the characteristic of the two modalities. SARCI network demonstrates state-of-the-art performance on three public RS cross-modal datasets: Sydney, UCM and RSICD audio-visual datasets. The code is available at: https://github.com/WUTCM-Lab/SARCI. IEEE

Keyword:

Artificial intelligence Buildings cross-modal feature interaction Image retrieval Information retrieval Remote Sensing (RS) audio-visual cross-modal retrieval Scale-aware Adaptive Refinement and Cross Interaction (SARCI) Task analysis Technological innovation Visualization

Community:

  • [ 1 ] [Chen Y.]School of Computer Science and Artificial Intelligence, Wuhan University of Technology, Wuhan, China
  • [ 2 ] [Du C.]School of Computer Science and Artificial Intelligence, Wuhan University of Technology, Wuhan, China
  • [ 3 ] [Zi Y.]School of Computer Science and Artificial Intelligence, Wuhan University of Technology, Wuhan, China
  • [ 4 ] [Xiong S.]School of Computer Science and Artificial Intelligence, Wuhan University of Technology, Wuhan, China
  • [ 5 ] [Lu X.]College of Physics and Information Engineering, Fuzhou University, Fuzhou, China

Reprint 's Address:

Email:

Show more details

Related Keywords:

Related Article:

Source :

IEEE Transactions on Geoscience and Remote Sensing

ISSN: 0196-2892

Year: 2024

Volume: 62

Page: 1-1

7 . 5 0 0

JCR@2023

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 0

Affiliated Colleges:

Online/Total:150/10041981
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1