• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Huang, Jinghao (Huang, Jinghao.) [1] | Chen, Yaxiong (Chen, Yaxiong.) [2] | Xiong, Shengwu (Xiong, Shengwu.) [3] | Lu, Xiaoqiang (Lu, Xiaoqiang.) [4]

Indexed by:

EI

Abstract:

The cross-modal drone image-text (DIT) retrieval task involves using either text or drone images as queries to retrieve relevant drone images or corresponding text. The primary challenge stems from the diverse and intricate nature of drone images, making effective alignment between image and text challenging. In response, we propose an innovative approach called visual contextual semantic reasoning (VCSR), aimed at precisely aligning information across different modalities. VCSR employs textual cues to guide rich semantic reasoning within the visual context, reducing redundancy in visual information. Furthermore, the method captures drone image information relevant to the text, revealing subtle correspondences between drone image regions and textual content. To enhance visual semantic learning, context region learning (CRL) term and consistency semantic alignment (CSA) terms are introduced for stronger guidance, further intensifying the cross-modal interaction between textual and visual data, resulting in more robust feature representation. Extensive experiments conducted on two self-constructed DIT datasets demonstrate that VCSR outperforms alternative methods in terms of DIT retrieval performance. The codes are accessible at https://github.com/huangjh98/VCSR. © 1980-2012 IEEE.

Keyword:

Drones Image coding Image retrieval Job analysis Latent semantic analysis Modal analysis Semantics Semantic Segmentation Target drones

Community:

  • [ 1 ] [Huang, Jinghao]The School of Computer Science and Artificial Intelligence, Wuhan University of Technology, Wuhan; 430070, China
  • [ 2 ] [Huang, Jinghao]Wuhan University of Technology, Sanya Science and Education Innovation Park, Sanya; 572000, China
  • [ 3 ] [Huang, Jinghao]Chongqing Research Institute, Wuhan University of Technology, Chongqing; 401122, China
  • [ 4 ] [Chen, Yaxiong]The School of Computer Science and Artificial Intelligence, Wuhan University of Technology, Wuhan; 430070, China
  • [ 5 ] [Chen, Yaxiong]Wuhan University of Technology, Sanya Science and Education Innovation Park, Sanya; 572000, China
  • [ 6 ] [Chen, Yaxiong]Shanghai Artificial Intelligence Laboratory, Shanghai; 200232, China
  • [ 7 ] [Chen, Yaxiong]Wuhan Huaxia Institute of Technology, School of Information Engineering, Wuhan; 430223, China
  • [ 8 ] [Chen, Yaxiong]Qiongtai Normal University, School of Information Science and Technology, Haikou; 571127, China
  • [ 9 ] [Xiong, Shengwu]The School of Computer Science and Artificial Intelligence, Wuhan University of Technology, Wuhan; 430070, China
  • [ 10 ] [Xiong, Shengwu]Wuhan University of Technology, Sanya Science and Education Innovation Park, Sanya; 572000, China
  • [ 11 ] [Xiong, Shengwu]Shanghai Artificial Intelligence Laboratory, Shanghai; 200232, China
  • [ 12 ] [Xiong, Shengwu]Wuhan Huaxia Institute of Technology, School of Information Engineering, Wuhan; 430223, China
  • [ 13 ] [Xiong, Shengwu]Qiongtai Normal University, School of Information Science and Technology, Haikou; 571127, China
  • [ 14 ] [Lu, Xiaoqiang]Fuzhou University, College of Physics and Information Engineering, Fuzhou; 350108, China

Reprint 's Address:

Email:

Show more details

Related Keywords:

Related Article:

Source :

IEEE Transactions on Geoscience and Remote Sensing

ISSN: 0196-2892

Year: 2024

Volume: 62

7 . 5 0 0

JCR@2023

CAS Journal Grade:1

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 1

Affiliated Colleges:

Online/Total:304/10372580
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1