• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Chen, Yaxiong (Chen, Yaxiong.) [1] | Huang, Jirui (Huang, Jirui.) [2] | Xiong, Shengwu (Xiong, Shengwu.) [3] | Lu, Xiaoqiang (Lu, Xiaoqiang.) [4]

Indexed by:

EI

Abstract:

In recent years, with the continuous advancement of remote sensing (RS) technology and text processing techniques, there has been a growing abundance of RS images and associated textual data. Combining RS images with their corresponding textual data allows for integrated analysis and retrieval, which holds significant practical implications across multiple application domains, including geographic information systems (GIS), environmental monitoring, and agricultural management. RS images have the characteristics of multitargets and multiscales, and the textual descriptions of these targets are not fully utilized, leading to a decrease in retrieval accuracy. Previous methods have struggled to balance intermodality information interaction and intramodality feature fusion, and they have paid little attention to the consistency of distribution within modalities. In light of this, this article proposes a symmetric multilevel guidance network (SMLGN) for cross-modal retrieval in RS. SMLGN first introduces fusion guidance between local and global within modalities and fine-grained bidirectional guidance between modalities, allowing for the learning of a common semantic space. Furthermore, to address the distribution differences of different modalities within the common semantic space, we design an adversarial joint learning framework and a multiobjective loss function to optimize the SMLGN method and achieve consistency in data distribution. The experimental results demonstrate that the SMLGN method performs well in the task of cross-modal retrieval between RS images and textual data. It effectively integrates the information from both modalities, improving the accuracy and reliability of the retrieval process. © 1980-2012 IEEE.

Keyword:

Image analysis Information management Learning systems Modal analysis Remote sensing Search engines Semantics Text processing

Community:

  • [ 1 ] [Chen, Yaxiong]Wuhan University of Technology, Chongqing Research Institute, Chongqing; 401122, China
  • [ 2 ] [Chen, Yaxiong]Wuhan University of Technology, Sanya Science and Education Innovation Park, Sanya; 572000, China
  • [ 3 ] [Chen, Yaxiong]Shanghai Artificial Intelligence Laboratory, Shanghai; 200232, China
  • [ 4 ] [Huang, Jirui]Wuhan University of Technology, Chongqing Research Institute, Chongqing; 401122, China
  • [ 5 ] [Huang, Jirui]Wuhan University of Technology, Sanya Science and Education Innovation Park, Sanya; 572000, China
  • [ 6 ] [Huang, Jirui]Shanghai Artificial Intelligence Laboratory, Shanghai; 200232, China
  • [ 7 ] [Xiong, Shengwu]Wuhan University of Technology, Chongqing Research Institute, Chongqing; 401122, China
  • [ 8 ] [Xiong, Shengwu]Wuhan University of Technology, Sanya Science and Education Innovation Park, Sanya; 572000, China
  • [ 9 ] [Xiong, Shengwu]Shanghai Artificial Intelligence Laboratory, Shanghai; 200232, China
  • [ 10 ] [Lu, Xiaoqiang]Fuzhou University, College of Physics and Information Engineering, Fuzhou; 350108, China

Reprint 's Address:

Email:

Show more details

Related Keywords:

Source :

IEEE Transactions on Geoscience and Remote Sensing

ISSN: 0196-2892

Year: 2024

Volume: 62

Page: 1-17

7 . 5 0 0

JCR@2023

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count: 9

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 1

Affiliated Colleges:

Online/Total:472/9743015
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1