Integrating Spatial Details With Long-Range Contexts for Semantic Segmentation of Very High-Resolution Remote-Sensing Images - Details

author：

Long, Jiang (Long, Jiang.) ^[1] | Li, Mengmeng (Li, Mengmeng.) ^[2] (Scholars：李蒙蒙) | Wang, Xiaoqin (Wang, Xiaoqin.) ^[3] (Scholars：汪小钦)

Indexed by：

EI Scopus SCIE

Abstract：

This　letter　presents　a　cross-learning　network　(i.e.,　CLCFormer)　integrating　fine-grained　spatial　details　within　long-range　global　contexts　based　upon　convolutional　neural　networks　(CNNs)　and　transformer,　for　semantic　segmentation　of　very　high-resolution　(VHR)　remote-sensing　images.　More　specifically,　CLCFormer　comprises　two　parallel　encoders,　derived　from　the　CNN　and　transformer,　and　a　CNN　decoder.　The　encoders　are　backboned　on　SwinV2　and　EfficientNet-B3,　from　which　the　extracted　semantic　features　are　aggregated　at　multiple　levels　using　a　bilateral　feature　fusion　module　(BiFFM).　First,　we　used　attention　gate　(ATG)　modules　to　enhance　feature　representation,　improving　segmentation　results　for　objects　with　various　shapes　and　sizes.　Second,　we　used　an　attention　residual　(ATR)　module　to　refine　spatial　features＇s　learning,　alleviating　boundary　blurring　of　occluded　objects.　Finally,　we　developed　a　new　strategy,　called　auxiliary　supervise　strategy　(ASS),　for　model　optimization　to　further　improve　segmentation　performance.　Our　method　was　tested　on　the　WHU,　Inria,　and　Potsdam　datasets,　and　compared　with　CNN-based　and　transformer-based　methods.　Results　showed　that　our　method　achieved　state-of-the-art　performance　on　the　WHU　building　dataset　(92.31%　IoU),　Inria　building　dataset　(83.71%　IoU),　and　Potsdam　dataset　(80.27%　MIoU).　We　concluded　that　CLCFormer　is　a　flexible,　robust,　and　effective　method　for　the　semantic　segmentation　of　VHR　images.　The　codes　of　the　proposed　model　are　available　at　https://github.com/long123524/CLCFormer.

Keyword：

Auxiliary supervise Buildings CLCFormer Convolution Convolutional neural networks convolutional neural networks (CNNs) Feature extraction Semantics semantic segmentation Tiles transformer Transformers very high-resolution (VHR) images

Community：

[ 1 ] [Long, Jiang]Fuzhou Univ, Acad Digital China Fujian, Key Lab Spatial Data Min & Informat Sharing, Minist Educ, Fuzhou 350002, Peoples R China
[ 2 ] [Li, Mengmeng]Fuzhou Univ, Acad Digital China Fujian, Key Lab Spatial Data Min & Informat Sharing, Minist Educ, Fuzhou 350002, Peoples R China
[ 3 ] [Wang, Xiaoqin]Fuzhou Univ, Acad Digital China Fujian, Key Lab Spatial Data Min & Informat Sharing, Minist Educ, Fuzhou 350002, Peoples R China

Reprint 's Address：

李蒙蒙
[Li, Mengmeng]Fuzhou Univ, Acad Digital China Fujian, Key Lab Spatial Data Min & Informat Sharing, Minist Educ, Fuzhou 350002, Peoples R China

Email：

205527028@fzu.edu.cn |
mli@fzu.edu.cn |
wangxq@fzu.edu.cn

Show more details

Version：

Integrating spatial details with long-range contexts for semantic segmentation of very high resolution remote sensing images
2023，IEEE Geoscience and Remote Sensing Letters
Integrating Spatial Details with Long-Range Contexts for Semantic Segmentation of Very High-Resolution Remote-Sensing Images
2023，IEEE Geoscience and Remote Sensing Letters

Related Keywords：

HSINet: A Hybrid Semantic Integration Network for Medical Image Segmentation
2025，19th Chinese Conference on Image and Graphics Technologies and Applications, IGTA 2024
Building Type Classification Using CNN-Transformer Cross-Encoder Adaptive Learning From Very High Resolution Satellite Images
2025，IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING
SegFormer-Based Cotton Planting Areas Extraction from High-Resolution Remote Sensing Images
2023，11th International Conference on Agro-Geoinformatics, Agro-Geoinformatics 2023
Attention-Guided CNN-Transformer Hybrid Network for Hyperspectral Image Classification
2023，7th Asian Conference on Artificial Intelligence Technology, ACAIT 2023

Source ：

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS

ISSN： 1545-598X

Year： 2023

Volume： 20

4 . 0

JCR@2023

4 . 0 0 0

JCR@2023

ESI Discipline： GEOSCIENCES;

ESI HC Threshold：26

JCR Journal Grade：1

CAS Journal Grade：3

Cited Count：

WoS CC Cited Count： 18

SCOPUS Cited Count： 14

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 1

Affiliated Colleges：

数字中国研究院（福建）本学院/部未明确归属的数据

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to