• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Huang, J. (Huang, J..) [1] | Chen, Y. (Chen, Y..) [2] | Xiong, S. (Xiong, S..) [3] | Lu, X. (Lu, X..) [4] (Scholars:卢孝强)

Indexed by:

Scopus

Abstract:

The cross-modal drone image-text (DIT) retrieval task involves using either text or drone images as queries to retrieve relevant drone images or corresponding text. The primary challenge stems from the diverse and intricate nature of drone images, making effective alignment between image and text challenging. In response, we propose an innovative approach called Visual Contextual Semantic Reasoning (VCSR), aimed at precisely aligning information across different modalities. VCSR employs textual cues to guide rich semantic reasoning within the visual context, reducing redundancy in visual information. Furthermore, the method captures drone image information relevant to the text, revealing subtle correspondences between drone image regions and textual content. To enhance visual semantic learning, context region learning term and consistency semantic alignment term are introduced for stronger guidance, further intensifying the cross-modal interaction between textual and visual data, resulting in more robust feature representation. Extensive experiments conducted on two self-constructed DIT datasets demonstrate that VCSR outperforms alternative methods in terms of DIT retrieval performance. The codes are accessible at https://github.com/huangjh98/VCSR. IEEE

Keyword:

Cognition Cross-modal drone retrieval Drones Hidden Markov models Semantic aligment Semantics Task analysis Technological innovation visual contextual semantic reasoning Visualization

Community:

  • [ 1 ] [Huang J.]School of Computer Science and Artificial Intelligence, Wuhan University of Technology, Wuhan, China
  • [ 2 ] [Chen Y.]Sanya Science and Education Innovation Park, Wuhan University of Technology, Sanya, China
  • [ 3 ] [Xiong S.]Sanya Science and Education Innovation Park, Wuhan University of Technology, Sanya, China
  • [ 4 ] [Lu X.]College of Physics and Information Engineering, Fuzhou University, Fuzhou, China

Reprint 's Address:

Email:

Show more details

Related Keywords:

Source :

IEEE Transactions on Geoscience and Remote Sensing

ISSN: 0196-2892

Year: 2024

Volume: 62

Page: 1-1

7 . 5 0 0

JCR@2023

CAS Journal Grade:1

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 1

Affiliated Colleges:

Online/Total:261/10372406
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1