Multiscale Salient Alignment Learning for Remote-Sensing Image-Text Retrieval - Details

author：

Chen, Yaxiong (Chen, Yaxiong.) ^[1] | Huang, Jinghao (Huang, Jinghao.) ^[2] | Li, Xiaoyu (Li, Xiaoyu.) ^[3] | Xiong, Shengwu (Xiong, Shengwu.) ^[4] | Lu, Xiaoqiang (Lu, Xiaoqiang.) ^[5] (Scholars：卢孝强)

Indexed by：

EI Scopus SCIE

Abstract：

Remote-sensing　image-text　(RSIT)　retrieval　involves　the　use　of　either　textual　descriptions　or　remote-sensing　images　(RSI)　as　queries　to　retrieve　relevant　RSIs　or　corresponding　text　descriptions.　Many　traditional　cross-modal　RSIT　retrieval　methods　tend　to　overlook　the　importance　of　capturing　salient　information　and　establishing　the　prior　similarity　between　RSIs　and　texts,　leading　to　a　decline　in　cross-modal　retrieval　performance.　In　this　article,　we　address　these　challenges　by　introducing　a　novel　approach　known　as　multiscale　salient　image-guided　text　alignment　(MSITA).　This　approach　is　designed　to　learn　salient　information　by　aligning　text　with　images　for　effective　cross-modal　RSIT　retrieval.　The　MSITA　approach　first　incorporates　a　multiscale　fusion　module　and　a　salient　learning　module　to　facilitate　the　extraction　of　salient　information.　In　addition,　it　introduces　an　image-guided　text　alignment　(IGTA)　mechanism　that　uses　image　information　to　guide　the　alignment　of　texts,　enabling　the　effective　capture　of　fine-grained　correspondences　between　RSI　regions　and　textual　descriptions.　In　addition　to　these　components,　a　novel　loss　function　is　devised　to　enhance　the　similarity　across　different　modalities　and　reinforce　the　prior　similarity　between　RSIs　and　texts.　Extensive　experiments　conducted　on　four　widely　adopted　RSIT　datasets　affirm　that　the　MSITA　approach　significantly　enhances　cross-modal　RSIT　retrieval　performance　in　comparison　to　other　state-of-the-art　methods.

Keyword：

Cross-modal retrieval image-guided text alignment (IGTA) prior similarity salient learning

Community：

[ 1 ] [Chen, Yaxiong]Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan 430070, Peoples R China
[ 2 ] [Huang, Jinghao]Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan 430070, Peoples R China
[ 3 ] [Li, Xiaoyu]Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan 430070, Peoples R China
[ 4 ] [Xiong, Shengwu]Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan 430070, Peoples R China
[ 5 ] [Chen, Yaxiong]Wuhan Univ Technol, Sanya Sci & Educ Innovat Pk, Sanya 572000, Peoples R China
[ 6 ] [Huang, Jinghao]Wuhan Univ Technol, Sanya Sci & Educ Innovat Pk, Sanya 572000, Peoples R China
[ 7 ] [Li, Xiaoyu]Wuhan Univ Technol, Sanya Sci & Educ Innovat Pk, Sanya 572000, Peoples R China
[ 8 ] [Xiong, Shengwu]Wuhan Univ Technol, Sanya Sci & Educ Innovat Pk, Sanya 572000, Peoples R China
[ 9 ] [Chen, Yaxiong]Shanghai Artificial Intelligence Lab, Shanghai 200232, Peoples R China
[ 10 ] [Huang, Jinghao]Shanghai Artificial Intelligence Lab, Shanghai 200232, Peoples R China
[ 11 ] [Li, Xiaoyu]Shanghai Artificial Intelligence Lab, Shanghai 200232, Peoples R China
[ 12 ] [Xiong, Shengwu]Shanghai Artificial Intelligence Lab, Shanghai 200232, Peoples R China
[ 13 ] [Chen, Yaxiong]Wuhan Univ Technol, Chongqing Res Inst, Chongqing 401122, Peoples R China
[ 14 ] [Huang, Jinghao]Wuhan Univ Technol, Chongqing Res Inst, Chongqing 401122, Peoples R China
[ 15 ] [Li, Xiaoyu]Wuhan Univ Technol, Chongqing Res Inst, Chongqing 401122, Peoples R China
[ 16 ] [Xiong, Shengwu]Wuhan Univ Technol, Chongqing Res Inst, Chongqing 401122, Peoples R China
[ 17 ] [Lu, Xiaoqiang]Fuzhou Univ, Coll Phys & Informat Engn, Fuzhou 350108, Peoples R China

Reprint 's Address：

[Xiong, Shengwu]Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan 430070, Peoples R China;;[Xiong, Shengwu]Wuhan Univ Technol, Sanya Sci & Educ Innovat Pk, Sanya 572000, Peoples R China;;[Xiong, Shengwu]Shanghai Artificial Intelligence Lab, Shanghai 200232, Peoples R China;;[Xiong, Shengwu]Wuhan Univ Technol, Chongqing Res Inst, Chongqing 401122, Peoples R China;;

Email：

xiongsw@whut.edu.cn

Show more details

Version：

Multiscale Salient Alignment Learning for Remote-Sensing Image-Text Retrieval
2024，IEEE Transactions on Geoscience and Remote Sensing
Multiscale Salient Alignment Learning for Remote-Sensing Image-Text Retrieval
2024，IEEE Transactions on Geoscience and Remote Sensing

Related Keywords：

Source ：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING

ISSN： 0196-2892

Year： 2024

Volume： 62

7 . 5 0 0

JCR@2023

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 0

Affiliated Colleges：

物理与信息工程学院、微电子学院本学院/部未明确归属的数据

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to