Visual Contextual Semantic Reasoning for Cross-Modal Drone Image-Text Retrieval - Details

author：

Huang, Jinghao (Huang, Jinghao.) ^[1] | Chen, Yaxiong (Chen, Yaxiong.) ^[2] | Xiong, Shengwu (Xiong, Shengwu.) ^[3] | Lu, Xiaoqiang (Lu, Xiaoqiang.) ^[4]

Indexed by：

Abstract：

The　cross-modal　drone　image-text　(DIT)　retrieval　task　involves　using　either　text　or　drone　images　as　queries　to　retrieve　relevant　drone　images　or　corresponding　text.　The　primary　challenge　stems　from　the　diverse　and　intricate　nature　of　drone　images,　making　effective　alignment　between　image　and　text　challenging.　In　response,　we　propose　an　innovative　approach　called　visual　contextual　semantic　reasoning　(VCSR),　aimed　at　precisely　aligning　information　across　different　modalities.　VCSR　employs　textual　cues　to　guide　rich　semantic　reasoning　within　the　visual　context,　reducing　redundancy　in　visual　information.　Furthermore,　the　method　captures　drone　image　information　relevant　to　the　text,　revealing　subtle　correspondences　between　drone　image　regions　and　textual　content.　To　enhance　visual　semantic　learning,　context　region　learning　(CRL)　term　and　consistency　semantic　alignment　(CSA)　terms　are　introduced　for　stronger　guidance,　further　intensifying　the　cross-modal　interaction　between　textual　and　visual　data,　resulting　in　more　robust　feature　representation.　Extensive　experiments　conducted　on　two　self-constructed　DIT　datasets　demonstrate　that　VCSR　outperforms　alternative　methods　in　terms　of　DIT　retrieval　performance.　The　codes　are　accessible　at　https://github.com/huangjh98/VCSR.　©　1980-2012　IEEE.

Keyword：

Drones Image coding Image retrieval Job analysis Latent semantic analysis Modal analysis Semantics Semantic Segmentation Target drones

Community：

[ 1 ] [Huang, Jinghao]The School of Computer Science and Artificial Intelligence, Wuhan University of Technology, Wuhan; 430070, China
[ 2 ] [Huang, Jinghao]Wuhan University of Technology, Sanya Science and Education Innovation Park, Sanya; 572000, China
[ 3 ] [Huang, Jinghao]Chongqing Research Institute, Wuhan University of Technology, Chongqing; 401122, China
[ 4 ] [Chen, Yaxiong]The School of Computer Science and Artificial Intelligence, Wuhan University of Technology, Wuhan; 430070, China
[ 5 ] [Chen, Yaxiong]Wuhan University of Technology, Sanya Science and Education Innovation Park, Sanya; 572000, China
[ 6 ] [Chen, Yaxiong]Shanghai Artificial Intelligence Laboratory, Shanghai; 200232, China
[ 7 ] [Chen, Yaxiong]Wuhan Huaxia Institute of Technology, School of Information Engineering, Wuhan; 430223, China
[ 8 ] [Chen, Yaxiong]Qiongtai Normal University, School of Information Science and Technology, Haikou; 571127, China
[ 9 ] [Xiong, Shengwu]The School of Computer Science and Artificial Intelligence, Wuhan University of Technology, Wuhan; 430070, China
[ 10 ] [Xiong, Shengwu]Wuhan University of Technology, Sanya Science and Education Innovation Park, Sanya; 572000, China
[ 11 ] [Xiong, Shengwu]Shanghai Artificial Intelligence Laboratory, Shanghai; 200232, China
[ 12 ] [Xiong, Shengwu]Wuhan Huaxia Institute of Technology, School of Information Engineering, Wuhan; 430223, China
[ 13 ] [Xiong, Shengwu]Qiongtai Normal University, School of Information Science and Technology, Haikou; 571127, China
[ 14 ] [Lu, Xiaoqiang]Fuzhou University, College of Physics and Information Engineering, Fuzhou; 350108, China

Reprint 's Address：

Email：

Show more details

Related Keywords：

Improving Water Hyacinth Extraction from UAV Images Using Enhanced U-Net
2025，7th International Conference on Wireless Communications, Networking and Applications, WCNA 2023
Lightweight Multi-Scale UAV Identification Based on Radio Frequency Fingerprint
2025，2nd International Conference on Artificial Intelligence and Communication Technologies, ICAICT 2024
LVPTrack: High Performance Domain Adaptive UAV Tracking with Label Aligned Visual Prompt Tuning
2025，39th Annual AAAI Conference on Artificial Intelligence, AAAI 2025
Functional Deployment of Drone Logistics
2020，2nd IEEE Eurasia Conference on Biomedical Engineering, Healthcare and Sustainability 2020, ECBIOS 2020
Exploring Techniques to Mitigate Interference in Drone Communication Systems
2025，

Source ：

IEEE Transactions on Geoscience and Remote Sensing

ISSN： 0196-2892

Year： 2024

Volume： 62

7 . 5 0 0

JCR@2023

CAS Journal Grade：1

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 1

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to