• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Wang, Wenlong (Wang, Wenlong.) [1] | Yu, Peng (Yu, Peng.) [2] | Li, Mengmeng (Li, Mengmeng.) [3] | Zhong, Xiaojing (Zhong, Xiaojing.) [4] | He, Yuanrong (He, Yuanrong.) [5] | Su, Hua (Su, Hua.) [6] | Zhou, Yunxuan (Zhou, Yunxuan.) [7]

Indexed by:

Scopus SCIE

Abstract:

Building extraction from remote sensing imagery is vital for various human activities. But it is challenging due to diverse building appearances and complex backgrounds. Research shows the importance of both global context and spatial details for accurate building extraction. Therefore, methods integrating convolutional neural networks (CNNs) and visual transformers (ViTs) are popular nowadays. However, current methods combining these two methods inadequately merge their features and only perform decoding once, leading to issues like unclear boundaries, internal voids, and susceptibility to non-building elements in complex scenarios with low inter-class and high intra-class variability. To address these issues, this paper introduces a novel extraction method called TDFNet. We first replace ViT with V-Mamba, which has linear complexity, and combine it with CNN for feature extraction. A bidirectional fusion module (BFM) is then designed to comprehensively integrate spatial details and global information, thereby enabling accurate identification of boundaries between adjacent buildings, and maintaining the structural integrity of buildings to avoid internal holes. During the decoding process, we propose an Encoder-Decoder Fusion Module (EDFM) to initially merge features from different stages of the encoder and decoder, thereby diminishing the model's susceptibility to non-building elements with features similar to those of buildings, and consequently reducing the incidence of erroneous extractions. Subsequently, a twice decoding strategy is implemented to enhance the learning of multi-scale features significantly, thereby mitigating the impact of tree occlusions and shadows. Our method yields the state-of-the-art (SOTA) performance on three public building datasets.

Keyword:

Building extraction remote sensing twice decoding V-Mamba

Community:

  • [ 1 ] [Wang, Wenlong]Xiamen Univ Technol, Coll Comp & Informat Engn, Xiamen, Peoples R China
  • [ 2 ] [Yu, Peng]Xiamen Univ Technol, Coll Comp & Informat Engn, Xiamen, Peoples R China
  • [ 3 ] [He, Yuanrong]Xiamen Univ Technol, Coll Comp & Informat Engn, Xiamen, Peoples R China
  • [ 4 ] [Li, Mengmeng]Fuzhou Univ, Acad Digital China, Key Lab Spatial Data Min & Informat Sharing, Minist Educ, Fuzhou, Peoples R China
  • [ 5 ] [Su, Hua]Fuzhou Univ, Acad Digital China, Key Lab Spatial Data Min & Informat Sharing, Minist Educ, Fuzhou, Peoples R China
  • [ 6 ] [Zhong, Xiaojing]Jimei Univ, Coll Harbour & Coastal Engn, Xiamen Key Lab Green & Smart Coastal Engn, Xiamen, Peoples R China
  • [ 7 ] [Zhou, Yunxuan]East China Normal Univ, State Key Lab Estuarine & Coastal Res, Shanghai, Peoples R China

Reprint 's Address:

  • [Yu, Peng]Xiamen Univ Technol, Coll Comp & Informat Engn, Xiamen, Peoples R China

Show more details

Related Keywords:

Source :

GEO-SPATIAL INFORMATION SCIENCE

ISSN: 1009-5020

Year: 2025

4 . 4 0 0

JCR@2023

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 4

Affiliated Colleges:

Online/Total:645/11562744
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1