Relevance-Guided Adaptive Learning for Remote Sensing Image–Text Retrieval - Details

author：

Chen, X. (Chen, X..) ^[1] | Zheng, X. (Zheng, X..) ^[2] | Lu, X. (Lu, X..) ^[3]

Indexed by：

Scopus

Abstract：

Remote　sensing　image–text　retrieval　aims　to　establish　semantic　alignment　between　images　and　texts　to　enable　accurate　cross-modal　retrieval.　Existing　methods　usually　extract　features　from　images　and　texts　independently,　aligning　them　in　a　shared　embedding　space　to　achieve　cross-modal　retrieval.　However,　these　methods　often　assume　complete　alignment　between　image–text　pairs,　overlooking　the　inherent　disparities　between　the　rich　visual　details　in　remote　sensing　images　and　the　abstract　nature　of　textual　descriptions.　These　disparities　result　in　image–text　pairs　only　sharing　partial　semantic　correlations,　rather　than　one-to-one　complete　alignment.　Such　incomplete　alignment　adversely　affects　model　training　and　retrieval　accuracy.　To　address　this　problem,　a　relevance-guided　adaptive　learning　method　is　proposed,　which　quantifies　and　leverages　the　relevance　of　image–text　pairs　to　refine　the　training　process　while　enhancing　retrieval　performance.　First,　the　proposed　method　introduces　an　image–text　relevance　measurement　mechanism　that　integrates　global　and　local　feature　distances　to　accurately　evaluate　the　degree　of　semantic　relevance　between　images　and　texts.　Second,　a　relevance-based　sample　division　strategy　is　proposed,　utilizing　a　Gaussian　Mixture　Model　to　dynamically　redivide　samples　into　positive　and　negative　pairs　according　to　the　measured　image–text　relevance.　This　strategy　refines　the　training　dataset,　reduces　noise,　and　enhances　the　effectiveness　of　model　learning.　Finally,　a　relevance-weighted　triplet　loss　is　designed　to　adaptively　adjust　the　contribution　of　sample　pairs　to　the　loss　function　based　on　their　relevance,　further　optimizing　model　training　and　enhancing　retrieval　accuracy.　Experimental　results　on　multiple　remote　sensing　image–text　retrieval　datasets　demonstrate　that　the　proposed　method　significantly　improves　retrieval　accuracy　and　performance.　©　1980-2012　IEEE.

Keyword：

incomplete alignment relevance-based sample division relevance-weighted triplet loss Remote sensing image–text retrieval semantic alignment

Community：

[ 1 ] [Chen X.]Fuzhou University, College of Physics and Information Engineering, Fuzhou, 350108, China
[ 2 ] [Zheng X.]Fuzhou University, College of Physics and Information Engineering, Fuzhou, 350108, China
[ 3 ] [Lu X.]Fuzhou University, College of Physics and Information Engineering, Fuzhou, 350108, China

Reprint 's Address：

Email：

Show more details

Related Keywords：

A Spatial and Semantic Alignment Fusion Network for SeaLand Port Segmentation
2025，IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING
Context-Aware Local-Global Semantic Alignment for Remote Sensing Image-Text Retrieval
2025，IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING
Visual Contextual Semantic Reasoning for Cross-Modal Drone Image-Text Retrieval
2024，IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING
Prototype rectification for zero-shot learning
2024，PATTERN RECOGNITION

Source ：

IEEE Transactions on Geoscience and Remote Sensing

ISSN： 0196-2892

Year： 2025

Volume： 63

7 . 5 0 0

JCR@2023

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 2

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search SCOPUS

Type
Departments

All Years Choose Year From to