Context-Aware Local-Global Semantic Alignment for Remote Sensing Image-Text Retrieval - Details

author：

Chen, Xiumei (Chen, Xiumei.) ^[1] | Zheng, Xiangtao (Zheng, Xiangtao.) ^[2] (Scholars：郑向涛) | Lu, Xiaoqiang (Lu, Xiaoqiang.) ^[3] (Scholars：卢孝强)

Indexed by：

EI Scopus SCIE

Abstract：

Remote　sensing　image-text　retrieval　(RSITR)　is　a　cross-modal　task　that　integrates　visual　and　textual　information,　attracting　significant　attention　in　remote　sensing　research.　Remote　sensing　images　typically　contain　complex　scenes　with　abundant　details,　presenting　significant　challenges　for　accurate　semantic　alignment　between　images　and　texts.　Despite　advances　in　the　field,　achieving　precise　alignment　in　such　intricate　contexts　remains　a　major　hurdle.　To　address　this　challenge,　this　article　introduces　a　novel　context-aware　local-global　semantic　alignment　(CLGSA)　method.　The　proposed　method　consists　of　two　key　modules:　the　local　key　feature　alignment　(LKFA)　module　and　the　cross-sample　global　semantic　alignment　(CGSA)　module.　The　LKFA　module　incorporates　a　local　image　masking　and　reconstruction　task　to　improve　the　alignment　between　image　and　text　features.　Specifically,　this　module　masks　certain　regions　of　the　image　and　uses　text　context　information　to　guide　the　reconstruction　of　the　masked　areas,　enhancing　the　alignment　of　local　semantics　and　ensuring　more　accurate　retrieval　of　region-specific　content.　The　CGSA　module　employs　a　hard　sample　triplet　loss　to　improve　global　semantic　consistency.　By　prioritizing　difficult　samples　during　training,　this　module　refines　feature　space　distributions,　helping　the　model　better　capture　global　semantics　across　the　entire　image-text　pair.　A　series　of　extensive　experiments　demonstrates　the　effectiveness　of　the　proposed　method.　The　method　achieves　an　mR　score　of　32.07%　on　the　RSICD　dataset　and　46.63%　on　the　RSITMD　dataset,　outperforming　baseline　methods　and　confirming　the　robustness　and　accuracy　of　the　approach.

Keyword：

Accuracy Cross modal retrieval Feature extraction Hard sample triplet loss Image reconstruction local image masking Remote sensing remote sensing image-text retrieval (RSITR) semantic alignment Semantics Sensors text-guided reconstruction Training Transformers Visualization

Community：

[ 1 ] [Chen, Xiumei]Fuzhou Univ, Coll Phys & Informat Engn, Fuzhou 350108, Peoples R China
[ 2 ] [Zheng, Xiangtao]Fuzhou Univ, Coll Phys & Informat Engn, Fuzhou 350108, Peoples R China
[ 3 ] [Lu, Xiaoqiang]Fuzhou Univ, Coll Phys & Informat Engn, Fuzhou 350108, Peoples R China

Reprint 's Address：

郑向涛
[Zheng, Xiangtao]Fuzhou Univ, Coll Phys & Informat Engn, Fuzhou 350108, Peoples R China

Email：

xiangtaoz@gmail.com

Show more details

Version：

Context-Aware Local–Global Semantic Alignment for Remote Sensing Image–Text Retrieval
2025，IEEE Transactions on Geoscience and Remote Sensing
Context-Aware Local–Global Semantic Alignment for Remote Sensing Image–Text Retrieval
2025，IEEE Transactions on Geoscience and Remote Sensing
Context-Aware Local–Global Semantic Alignment for Remote Sensing Image–Text Retrieval
2025，IEEE Transactions on Geoscience and Remote Sensing

Related Keywords：

Building Type Classification Using CNN-Transformer Cross-Encoder Adaptive Learning From Very High Resolution Satellite Images
2025，IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING
Bilinear Parallel Fourier Transformer for Multimodal Remote Sensing Classification
2025，IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING
A Spatial and Semantic Alignment Fusion Network for SeaLand Port Segmentation
2025，IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING
VGRSS: Datasets and Models for Visual Grounding in Remote Sensing Ship Images
2025，IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING

Source ：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING

ISSN： 0196-2892

Year： 2025

Volume： 63

7 . 5 0 0

JCR@2023

CAS Journal Grade：1

Cited Count：

WoS CC Cited Count： 1

SCOPUS Cited Count： 3

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 2

Affiliated Colleges：

物理与信息工程学院、微电子学院本学院/部未明确归属的数据

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to