• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索
High Impact Results & Cited Count Trend for Year Keyword Cloud and Partner Relationship

Query:

学者姓名:李蒙蒙

Refining:

Co-

Submit Unfold

Language

Submit

Clean All

Sort by:
Default
  • Default
  • Title
  • Year
  • WOS Cited Count
  • Impact factor
  • Ascending
  • Descending
< Page ,Total 5 >
Dual Fine-Grained network with frequency Transformer for change detection on remote sensing images SCIE
期刊论文 | 2025 , 136 | INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION
Abstract&Keyword Cite Version(1)

Abstract :

Change detection is a fundamental yet challenging task in remote sensing, crucial for monitoring urban expansion, land use changes, and environmental dynamics. However, compared with common color images, objects in remote sensing images exhibit minimal interclass variation and significant intraclass variation in the spectral dimension, with obvious scale inconsistency in the spatial dimension. Change detection complexity presents significant challenges, including differentiating similar objects, accounting for scale variations, and identifying pseudo changes. This research introduces a dual fine-grained network with a frequency Transformer (named as FTransDF-Net) to address the above issues. Specifically, for small-scale and approximate spectral ground objects, the network employs an encoder-decoder architecture consisting of dual fine-grained gated (DFG) modules. This enables the extraction and fusion of fine-grained level information in dual dimensions of features, facilitating a comprehensive analysis of their differences and correlations. As a result, a dynamic fusion representation of salient information is achieved. Additionally, we develop a lightweight frequency transformer (LFT) with minimal parameters for detecting large-scale ground objects that undergo significant changes over time. This is achieved by incorporating a frequency attention (FA) module, which utilizes Fourier transform to model long-range dependencies and combines global adaptive attentive features with multi-level fine-grained features. Our comparative experiments across four publicly available datasets demonstrate that FTransDF-Net reaches advanced results. Importantly, it outperforms the leading comparison method by 1.23% and 2.46% regarding IoU metrics concerning CDD and DSIFN, respectively. Furthermore, efficacy for each module is substantiated through ablation experiments. The code is accessible on https://github.com/LeeThrzz/FTrans-DF-Net.

Keyword :

Change detection Change detection Dual fine-grained Dual fine-grained Frequency transformer Frequency transformer Remote sensing Remote sensing

Cite:

Copy from the list or Export to your reference management。

GB/T 7714 Li, Zhen , Zhang, Zhenxin , Li, Mengmeng et al. Dual Fine-Grained network with frequency Transformer for change detection on remote sensing images [J]. | INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION , 2025 , 136 .
MLA Li, Zhen et al. "Dual Fine-Grained network with frequency Transformer for change detection on remote sensing images" . | INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION 136 (2025) .
APA Li, Zhen , Zhang, Zhenxin , Li, Mengmeng , Zhang, Liqiang , Peng, Xueli , He, Rixing et al. Dual Fine-Grained network with frequency Transformer for change detection on remote sensing images . | INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION , 2025 , 136 .
Export to NoteExpress RIS BibTex

Version :

Dual Fine-Grained network with frequency Transformer for change detection on remote sensing images Scopus
期刊论文 | 2025 , 136 | International Journal of Applied Earth Observation and Geoinformation
Building Type Classification Using CNN-Transformer Cross-Encoder Adaptive Learning From Very High Resolution Satellite Images SCIE
期刊论文 | 2025 , 18 , 976-994 | IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING
Abstract&Keyword Cite Version(3)

Abstract :

Building type information indicates the functional properties of buildings and plays a crucial role in smart city development and urban socioeconomic activities. Existing methods for classifying building types often face challenges in accurately distinguishing buildings between types while maintaining well-delineated boundaries, especially in complex urban environments. This study introduces a novel framework, i.e., CNN-Transformer cross-attention feature fusion network (CTCFNet), for building type classification from very high resolution remote sensing images. CTCFNet integrates convolutional neural networks (CNNs) and Transformers using an interactive cross-encoder fusion module that enhances semantic feature learning and improves classification accuracy in complex scenarios. We develop an adaptive collaboration optimization module that applies human visual attention mechanisms to enhance the feature representation of building types and boundaries simultaneously. To address the scarcity of datasets in building type classification, we create two new datasets, i.e., the urban building type (UBT) dataset and the town building type (TBT) dataset, for model evaluation. Extensive experiments on these datasets demonstrate that CTCFNet outperforms popular CNNs, Transformers, and dual-encoder methods in identifying building types across various regions, achieving the highest mean intersection over union of 78.20% and 77.11%, F1 scores of 86.83% and 88.22%, and overall accuracy of 95.07% and 95.73% on the UBT and TBT datasets, respectively. We conclude that CTCFNet effectively addresses the challenges of high interclass similarity and intraclass inconsistency in complex scenes, yielding results with well-delineated building boundaries and accurate building types.

Keyword :

Accuracy Accuracy Architecture Architecture Buildings Buildings Building type classification Building type classification CNN-transformer networks CNN-transformer networks cross-encoder cross-encoder Earth Earth Feature extraction Feature extraction feature interaction feature interaction Optimization Optimization Remote sensing Remote sensing Semantics Semantics Transformers Transformers very high resolution remote sensing very high resolution remote sensing Visualization Visualization

Cite:

Copy from the list or Export to your reference management。

GB/T 7714 Zhang, Shaofeng , Li, Mengmeng , Zhao, Wufan et al. Building Type Classification Using CNN-Transformer Cross-Encoder Adaptive Learning From Very High Resolution Satellite Images [J]. | IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING , 2025 , 18 : 976-994 .
MLA Zhang, Shaofeng et al. "Building Type Classification Using CNN-Transformer Cross-Encoder Adaptive Learning From Very High Resolution Satellite Images" . | IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING 18 (2025) : 976-994 .
APA Zhang, Shaofeng , Li, Mengmeng , Zhao, Wufan , Wang, Xiaoqin , Wu, Qunyong . Building Type Classification Using CNN-Transformer Cross-Encoder Adaptive Learning From Very High Resolution Satellite Images . | IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING , 2025 , 18 , 976-994 .
Export to NoteExpress RIS BibTex

Version :

Building Type Classification Using CNN-Transformer Cross-Encoder Adaptive Learning from Very High Resolution Satellite Images Scopus
期刊论文 | 2025 , 18 , 976-994 | IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Building Type Classification Using CNN-Transformer Cross-Encoder Adaptive Learning from Very High Resolution Satellite Images EI
期刊论文 | 2025 , 18 , 976-994 | IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Building Type Classification Using CNN-Transformer Cross-Encoder Adaptive Learning From Very-High-Resolution Satellite Images Scopus
期刊论文 | 2024 | IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Extracting vectorized agricultural parcels from high-resolution satellite images using a Point-Line-Region interactive multitask model SCIE
期刊论文 | 2025 , 231 | COMPUTERS AND ELECTRONICS IN AGRICULTURE
Abstract&Keyword Cite Version(2)

Abstract :

Precise information on agricultural parcels is crucial for effective farm management, crop mapping, and monitoring. Current techniques often encounter difficulties in automatically delineating vectorized parcels from remote sensing images, especially in irregular-shaped areas, making it challenging to derive closed and vectorized boundaries. To address this, we treat parcel delineation as identifying valid parcel vertices from remote sensing images to generate parcel polygons. We introduce a Point-Line-Region interactive multitask network (PLR-Net) that jointly learns semantic features of parcel vertices, boundaries, and regions through point-, line-, and region-related subtasks within a multitask learning framework. We derived an attraction field map (AFM) to enhance the feature representation of parcel boundaries and improve the detection of parcel regions while maintaining high geometric accuracy. The point-related subtask focuses on learning features of parcel vertices to obtain preliminary vertices, which are then refined based on detected boundary pixels to derive valid parcel vertices for polygon generation. We designed a spatial and channel excitation module for feature interaction to enhance interactions between points, lines, and regions. Finally, the generated parcel polygons are refined using the Douglas-Peucker algorithm to regularize polygon shapes. We evaluated PLR-Net using high-resolution GF-2 satellite images from the Shandong, Xinjiang, and Sichuan provinces of China and medium-resolution Sentinel-2 images from The Netherlands. Results showed that our method outperformed existing state-of-the-art techniques (e.g., BsiNet, SEANet, and Hisup) in pixel- and object-based geometric accuracy across all datasets, achieving the highest IoU and polygonal average precision on GF2 datasets (e.g., 90.84% and 82.00% in Xinjiang) and on the Sentinel-2 dataset (75.86% and 47.1%). Moreover, when trained on the Xinjiang dataset, the model successfully transferred to the Shandong dataset, achieving an IoU score of 83.98%. These results demonstrate that PLR-Net is an accurate, robust, and transferable method suitable for extracting vectorized parcels from diverse regions and types of remote sensing images. The source codes of our model are available at https://github.com/mengmengli01/PLR-Net-demo/tree/main.

Keyword :

Agricultural parcel delineation Agricultural parcel delineation Multitask neural networks Multitask neural networks PLR-Net PLR-Net Point-line-region interactive Point-line-region interactive Vectorized parcels Vectorized parcels

Cite:

Copy from the list or Export to your reference management。

GB/T 7714 Li, Mengmeng , Lu, Chengwen , Lin, Mengjing et al. Extracting vectorized agricultural parcels from high-resolution satellite images using a Point-Line-Region interactive multitask model [J]. | COMPUTERS AND ELECTRONICS IN AGRICULTURE , 2025 , 231 .
MLA Li, Mengmeng et al. "Extracting vectorized agricultural parcels from high-resolution satellite images using a Point-Line-Region interactive multitask model" . | COMPUTERS AND ELECTRONICS IN AGRICULTURE 231 (2025) .
APA Li, Mengmeng , Lu, Chengwen , Lin, Mengjing , Xiu, Xiaolong , Long, Jiang , Wang, Xiaoqin . Extracting vectorized agricultural parcels from high-resolution satellite images using a Point-Line-Region interactive multitask model . | COMPUTERS AND ELECTRONICS IN AGRICULTURE , 2025 , 231 .
Export to NoteExpress RIS BibTex

Version :

Extracting vectorized agricultural parcels from high-resolution satellite images using a Point-Line-Region interactive multitask model EI
期刊论文 | 2025 , 231 | Computers and Electronics in Agriculture
Extracting vectorized agricultural parcels from high-resolution satellite images using a Point-Line-Region interactive multitask model Scopus
期刊论文 | 2025 , 231 | Computers and Electronics in Agriculture
Extraction buildings from very high-resolution images with asymmetric siamese multitask networks and adversarial edge learning SCIE
期刊论文 | 2025 , 136 | INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION
Abstract&Keyword Cite Version(1)

Abstract :

Building extraction from very high-resolution remote-sensing images still faces two main issues: (1) small buildings are severely omitted and the extracted building shapes have a low consistency with ground truths. (2) supervised deep-learning methods have poor performance in few-shot scenarios, limiting the practical application of these methods. To address the first issue, we propose an asymmetric Siamese multitask network integrating adversarial edge learning called ASMBR-Net for building extraction. It contains an efficient asymmetric Siamese feature extractor comprising pre-trained backbones of convolutional neural networks and Transformers under pre-training and fine-tuning paradigms. This extractor balances the local and global feature representation and reduces training costs. Adversarial edge-learning technology automatically integrates edge constraints and strengthens the modeling ability of small and complex building-shaped patterns. Aiming to overcome the second issue, we introduce a self-training framework and design an instance transfer strategy to generate reliable pseudo-samples. We examined the proposed method on the WHU and Massachusetts (MA) datasets and a self-constructed Dongying (DY) dataset, comparing it with state-of-the-art methods. The experimental results show that our method achieves the highest F1-score of 96.06%, 86.90%, and 84.98% on the WHU, MA, and DY datasets, respectively. Ablation experiments further verify the effectiveness of the proposed method. The code is available at: https://github.com/liuxuanguang/ASMBR-Net

Keyword :

Adversarial learning Adversarial learning Building extraction Building extraction Multitask learning Multitask learning Self-learning Self-learning VHR remote-sensing image VHR remote-sensing image

Cite:

Copy from the list or Export to your reference management。

GB/T 7714 Liu, Xuanguang , Li, Yujie , Dai, Chenguang et al. Extraction buildings from very high-resolution images with asymmetric siamese multitask networks and adversarial edge learning [J]. | INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION , 2025 , 136 .
MLA Liu, Xuanguang et al. "Extraction buildings from very high-resolution images with asymmetric siamese multitask networks and adversarial edge learning" . | INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION 136 (2025) .
APA Liu, Xuanguang , Li, Yujie , Dai, Chenguang , Zhang, Zhenchao , Ding, Lei , Li, Mengmeng et al. Extraction buildings from very high-resolution images with asymmetric siamese multitask networks and adversarial edge learning . | INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION , 2025 , 136 .
Export to NoteExpress RIS BibTex

Version :

Extraction buildings from very high-resolution images with asymmetric siamese multitask networks and adversarial edge learning Scopus
期刊论文 | 2025 , 136 | International Journal of Applied Earth Observation and Geoinformation
Cross-modal feature interaction network for heterogeneous change detection SCIE
期刊论文 | 2025 | GEO-SPATIAL INFORMATION SCIENCE
Abstract&Keyword Cite Version(1)

Abstract :

Heterogeneous change detection is a task of considerable practical importance and significant challenge in remote sensing. Heterogeneous change detection involves identifying change areas using remote sensing images obtained from different sensors or imaging conditions. Recently, research has focused on feature space translation methods based on deep learning technology for heterogeneous images. However, these types of methods often lead to the loss of original image information, and the translated features cannot be efficiently compared, further limiting the accuracy of change detection. For these issues, we propose a cross-modal feature interaction network (CMFINet). Specifically, CMFINet introduces a cross-modal interaction module (CMIM), which facilitates the interaction between heterogeneous features through attention exchange. This approach promotes consistent representation of heterogeneous features while preserving image characteristics. Additionally, we design a differential feature extraction module (DFEM) to enhance the extraction of true change features from spatial and channel dimensions, facilitating efficient comparison after feature interaction. Extensive experiments conducted on the California, Toulouse, and Wuhan datasets demonstrate that CMFINet outperforms eight existing methods in identifying change areas in different scenes from multimodal images. Compared to the existing methods applied to the three datasets, CMFINet achieved the highest F1 scores of 83.93%, 75.65%, and 95.42%, and the highest mIoU values of 85.38%, 78.34%, and 94.87%, respectively. The results demonstrate the effectiveness and applicability of CMFINet in heterogeneous change detection.

Keyword :

attention mechanisms attention mechanisms Change detection Change detection CNN CNN feature interaction feature interaction heterogeneous remote sensing images heterogeneous remote sensing images

Cite:

Copy from the list or Export to your reference management。

GB/T 7714 Yang, Zhiwei , Wang, Xiaoqin , Lin, Haihan et al. Cross-modal feature interaction network for heterogeneous change detection [J]. | GEO-SPATIAL INFORMATION SCIENCE , 2025 .
MLA Yang, Zhiwei et al. "Cross-modal feature interaction network for heterogeneous change detection" . | GEO-SPATIAL INFORMATION SCIENCE (2025) .
APA Yang, Zhiwei , Wang, Xiaoqin , Lin, Haihan , Li, Mengmeng , Lin, Mengjing . Cross-modal feature interaction network for heterogeneous change detection . | GEO-SPATIAL INFORMATION SCIENCE , 2025 .
Export to NoteExpress RIS BibTex

Version :

Cross-modal feature interaction network for heterogeneous change detection Scopus
期刊论文 | 2025 | Geo-Spatial Information Science
Integrating Segment Anything Model Derived Boundary Prior and High-Level Semantics for Cropland Extraction From High-Resolution Remote Sensing Images SCIE
期刊论文 | 2024 , 21 | IEEE GEOSCIENCE AND REMOTE SENSING LETTERS
Abstract&Keyword Cite

Abstract :

Visual foundation models (VFMs) pretrained on large-scale training datasets show robust zero-shot adaptability across many vision tasks. However, there still exist limitations in remote sensing processing tasks due to the variety and complexity of remote sensing images. In this letter, we propose a two-flow network (TFNet) based on multitask VFM, named TFNet, to extract croplands with well-delineated boundaries from high-resolution remote sensing images. TFNet consists of a mask flow and a boundary flow. It first uses a VFM as visual encoder to obtain universal semantic features regarding croplands and then aggregates them into the two flows. Next, a boundary prior-guided module (BPM) is developed to incorporate boundary semantics derived from the boundary flow into the mask flow, to refine the boundary details of croplands. We also develop a multibranch parallel fusion module (MPFM) that aggregates multiscale contextual information to improve the identification of cropland with varied sizes and shapes. Finally, a semantic consistency loss is introduced to further optimize the feature learning of cropland information. We conducted extensive experiments on Shandong (SD) and Xinjiang (XJ) datasets collected from Gaofen-2 (GF-2) satellites and compared our method with five existing methods. Experimental results show that the croplands extracted by our method have the fewest omissions and errors, achieving the highest attribute accuracy (intersection over union (IoU) of 0.863 and 0.945) and lowest geometric errors (global total classification (GTC) of 0.134 and 0.097) than other methods on the two datasets. Our method effectively distinguished croplands of varied sizes, shapes, and spectra, even in scenarios with limited samples. Code and datasets are available at https://github.com/long123524/TFNet.

Keyword :

Boundary prior Boundary prior cropland extraction cropland extraction high-resolution remote sensing images high-resolution remote sensing images limited samples limited samples two-flow network (TFNet) two-flow network (TFNet) visual foundation model (VFM) visual foundation model (VFM)

Cite:

Copy from the list or Export to your reference management。

GB/T 7714 Long, Jiang , Zhao, Hang , Li, Mengmeng et al. Integrating Segment Anything Model Derived Boundary Prior and High-Level Semantics for Cropland Extraction From High-Resolution Remote Sensing Images [J]. | IEEE GEOSCIENCE AND REMOTE SENSING LETTERS , 2024 , 21 .
MLA Long, Jiang et al. "Integrating Segment Anything Model Derived Boundary Prior and High-Level Semantics for Cropland Extraction From High-Resolution Remote Sensing Images" . | IEEE GEOSCIENCE AND REMOTE SENSING LETTERS 21 (2024) .
APA Long, Jiang , Zhao, Hang , Li, Mengmeng , Wang, Xiaoqin , Lu, Chengwen . Integrating Segment Anything Model Derived Boundary Prior and High-Level Semantics for Cropland Extraction From High-Resolution Remote Sensing Images . | IEEE GEOSCIENCE AND REMOTE SENSING LETTERS , 2024 , 21 .
Export to NoteExpress RIS BibTex

Version :

Mapping tobacco planting areas in smallholder farmlands using Phenological-Spatial-Temporal LSTM from time-series Sentinel-1 SAR images SCIE
期刊论文 | 2024 , 129 | INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION
WoS CC Cited Count: 2
Abstract&Keyword Cite Version(1)

Abstract :

Phenological information on crop growth aids in identifying crop types from remote sensing images, but its incorporation into classification models is insufficiently exploited, especially in deep learning frameworks. This study presents a new model, Phenological-Temporal-Spatial LSTM (PST-LSTM), for mapping tobacco planting areas in smallholder farmlands using time-series Sentinel-1 Synthetic Aperture Radar (SAR) images. The PSTLSTM model is built on a multi-modal learning framework that fuses phenological information with deep spatial-temporal features. We applied the model to extract tobacco planting areas in Ninghua, Pucheng, and Shanghang Counties, in Fujian Province, and Luoping County in Yunnan Province, China. We compared PSTLSTM with existing methods based on phenological rules and Dynamic Time Warping (DTW) methods, and analyzed its strength in feature fusion. Results showed that our model outperformed these methods, achieving an overall accuracy (OA) of 93.16% compared to 86.69% and 85.93% for the phenological rules and DTW methods, respectively, in the Ninghua area. PST-LSTM effectively integrated time-series data with phenological information derived from different strategies at the feature level and performed better than existing feature fusion methods (based upon fuzzy sets) at the decision level. It also demonstrated a better spatial transferability than other methods when applied to different study areas, achieving an OA of 90.95%, 91.41%, and 80.75% for the Pucheng, Shanghang, and Luoping areas, respectively, using training samples from Ninghua. We conclude that PST-LSTM can effectively extract tobacco planting areas in smallholder farming from time-series SAR images and has the potential for mapping other crop types.

Keyword :

LSTM) LSTM) Phenological knowledge Phenological knowledge Phenological-Temporal-Spatial LSTM (PST Phenological-Temporal-Spatial LSTM (PST Smallholder farming Smallholder farming Time -series SAR images Time -series SAR images Tobacco mapping Tobacco mapping

Cite:

Copy from the list or Export to your reference management。

GB/T 7714 Li, Mengmeng , Feng, Xiaomin , Belgiu, Mariana . Mapping tobacco planting areas in smallholder farmlands using Phenological-Spatial-Temporal LSTM from time-series Sentinel-1 SAR images [J]. | INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION , 2024 , 129 .
MLA Li, Mengmeng et al. "Mapping tobacco planting areas in smallholder farmlands using Phenological-Spatial-Temporal LSTM from time-series Sentinel-1 SAR images" . | INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION 129 (2024) .
APA Li, Mengmeng , Feng, Xiaomin , Belgiu, Mariana . Mapping tobacco planting areas in smallholder farmlands using Phenological-Spatial-Temporal LSTM from time-series Sentinel-1 SAR images . | INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION , 2024 , 129 .
Export to NoteExpress RIS BibTex

Version :

Mapping tobacco planting areas in smallholder farmlands using Phenological-Spatial-Temporal LSTM from time-series Sentinel-1 SAR images Scopus
期刊论文 | 2024 , 129 | International Journal of Applied Earth Observation and Geoinformation
Semantic change detection using a hierarchical semantic graph interaction network from high-resolution remote sensing images SCIE
期刊论文 | 2024 , 211 , 318-335 | ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING
WoS CC Cited Count: 6
Abstract&Keyword Cite Version(2)

Abstract :

Current semantic change detection (SCD) methods face challenges in modeling temporal correlations (TCs) between bitemporal semantic features and difference features. These methods lead to inaccurate detection results, particularly for complex SCD scenarios. This paper presents a hierarchical semantic graph interaction network (HGINet) for SCD from high-resolution remote sensing images. This multitask neural network combines semantic segmentation and change detection tasks. For semantic segmentation, we construct a multilevel perceptual aggregation network with a pyramidal architecture. It extracts semantic features that discriminate between different categories at multiple levels. We model the correlations between bitemporal semantic features using a TC module that enhances the identification of unchanged areas. For change detection, we design a semantic difference interaction module based on a graph convolutional network. It measures the interactions among bitemporal semantic features, their corresponding difference features, and the combination of both. Extensive experiments on four datasets, namely SECOND, HRSCD, Fuzhou, and Xiamen, show that HGINet performs better in identifying changed areas and categories across various scenarios and regions than nine existing methods. Compared with the existing methods applied on the four datasets, it achieves the highest F1scd values of 59.48%, 64.12%, 64.45%, and 84.93%, and SeK values of 19.34%, 14.55%, 18.28%, and 51.12%, respectively. Moreover, HGINet mitigates the influence of fake changes caused by seasonal effects, producing results with well-delineated boundaries and shapes. Furthermore, HGINet trained on the Fuzhou dataset is successfully transferred to the Xiamen dataset, demonstrating its effectiveness and robustness in identifying changed areas and categories from high-resolution remote sensing images. The code of our paper is accessible at https://github.com/long123524/HGINet-torch.

Keyword :

Hierarchical semantic graph interaction network Hierarchical semantic graph interaction network High-resolution remote sensing images High-resolution remote sensing images Semantic change detection Semantic change detection Semantic difference interaction Semantic difference interaction Temporal correlations Temporal correlations

Cite:

Copy from the list or Export to your reference management。

GB/T 7714 Long, Jiang , Li, Mengmeng , Wang, Xiaoqin et al. Semantic change detection using a hierarchical semantic graph interaction network from high-resolution remote sensing images [J]. | ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING , 2024 , 211 : 318-335 .
MLA Long, Jiang et al. "Semantic change detection using a hierarchical semantic graph interaction network from high-resolution remote sensing images" . | ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING 211 (2024) : 318-335 .
APA Long, Jiang , Li, Mengmeng , Wang, Xiaoqin , Stein, Alfred . Semantic change detection using a hierarchical semantic graph interaction network from high-resolution remote sensing images . | ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING , 2024 , 211 , 318-335 .
Export to NoteExpress RIS BibTex

Version :

Semantic change detection using a hierarchical semantic graph interaction network from high-resolution remote sensing images EI
期刊论文 | 2024 , 211 , 318-335 | ISPRS Journal of Photogrammetry and Remote Sensing
Semantic change detection using a hierarchical semantic graph interaction network from high-resolution remote sensing images Scopus
期刊论文 | 2024 , 211 , 318-335 | ISPRS Journal of Photogrammetry and Remote Sensing
TBSCD-Net: A Siamese Multitask Network Integrating Transformers and Boundary Regularization for Semantic Change Detection From VHR Satellite Images SCIE
期刊论文 | 2024 , 21 | IEEE GEOSCIENCE AND REMOTE SENSING LETTERS
Abstract&Keyword Cite Version(2)

Abstract :

Semantic change detection (SCD) from very high-resolution (VHR) images involves two key challenges: 1) the global features of bitemporal images tend to be extracted insufficiently, leading to imprecise land cover semantic classification results; and 2) the detected changed objects exhibit ambiguous boundaries, resulting in low geometric accuracy. To address these two issues, we propose an SCD method called TBSCD-Net based on a multitask learning framework to simultaneously identify different types of semantic changes and regularize change boundaries. First, we construct a hybrid encoder combining transformer and convolutional neural network (CNN) (TCEncoder) to enhance the extraction of global context information. A bitemporal semantic linkage module (Bi-SLM) is embedded into the TCEncoder to enhance the semantic correlations between bitemporal images. Second, we introduce a boundary-region joint extractor based on Laplacian operators (LOBRE) to regularize the changed objects. We evaluated the effectiveness of the proposed method using the SECOND dataset and a Fuzhou GF-2 SCD dataset (FZ-SCD) and compared it with seven existing methods. The proposed method performed better than the other evaluated methods as it achieved 24.42% separation kappa (Sek) and 20.18% global total classification error (GTC) on the SECOND dataset and 23.10% Sek and 23.15% GTC on the FZ-SCD dataset. The results of ablation studies on the FZ-SCD dataset also verified the effectiveness of the developed modules for SCD.

Keyword :

Boundary regularization Boundary regularization Decoding Decoding Feature extraction Feature extraction Land surface Land surface Laplace equations Laplace equations multitask learning multitask learning semantic change detection (SCD) semantic change detection (SCD) Semantics Semantics Siamese neural network Siamese neural network Task analysis Task analysis Transformers Transformers very high-resolution (VHR) satellite images very high-resolution (VHR) satellite images

Cite:

Copy from the list or Export to your reference management。

GB/T 7714 Liu, Xuanguang , Dai, Chenguang , Zhang, Zhenchao et al. TBSCD-Net: A Siamese Multitask Network Integrating Transformers and Boundary Regularization for Semantic Change Detection From VHR Satellite Images [J]. | IEEE GEOSCIENCE AND REMOTE SENSING LETTERS , 2024 , 21 .
MLA Liu, Xuanguang et al. "TBSCD-Net: A Siamese Multitask Network Integrating Transformers and Boundary Regularization for Semantic Change Detection From VHR Satellite Images" . | IEEE GEOSCIENCE AND REMOTE SENSING LETTERS 21 (2024) .
APA Liu, Xuanguang , Dai, Chenguang , Zhang, Zhenchao , Li, Mengmeng , Wang, Hanyun , Ji, Hongliang et al. TBSCD-Net: A Siamese Multitask Network Integrating Transformers and Boundary Regularization for Semantic Change Detection From VHR Satellite Images . | IEEE GEOSCIENCE AND REMOTE SENSING LETTERS , 2024 , 21 .
Export to NoteExpress RIS BibTex

Version :

TBSCD-Net: A Siamese Multitask Network Integrating Transformers and Boundary Regularization for Semantic Change Detection From VHR Satellite Images EI
期刊论文 | 2024 , 21 , 1-5 | IEEE Geoscience and Remote Sensing Letters
TBSCD-Net: A Siamese Multi-task Network Integrating Transformers and Boundary Regularization for Semantic Change Detection from VHR Satellite Images Scopus
期刊论文 | 2024 , 21 , 1-1 | IEEE Geoscience and Remote Sensing Letters
Cross-Domain Urban Land Use Classification via Scenewise Unsupervised Multisource Domain Adaptation With Transformer SCIE
期刊论文 | 2024 , 17 , 10051-10066 | IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING
WoS CC Cited Count: 1
Abstract&Keyword Cite Version(2)

Abstract :

Current land use classification models based on very high-resolution (VHR) remote sensing images often suffer from high sample dependence and poor transferability. To address these challenges, we propose an unsupervised multisource domain adaptation framework for cross-domain land use classification that eliminates the need for repeatedly using source domain data. Our method uses the Swin Transformer as the backbone of the source domain model to extract features from multiple source domain samples. The model is trained on source domain samples, and unlabeled target domain samples are then used for target domain model training. To minimize the feature discrepancies between the source and target domains, we use a weighted information maximization loss and self-supervised pseudolabels to alleviate cross-domain classification noise. We conducted experiments on four public scene datasets and four new land use scene datasets created from different VHR images in four Chinese cities. Results show that our method outperformed three existing single-source cross-domain methods (i.e., DANN, DeepCORAL, and DSAN) and four multisource cross-domain methods (i.e., M3SDA, PTMDA, MFSAN, and SHOT), achieving the highest average classification accuracy and strong stability. We conclude that our method has high potential for practical applications in cross-domain land use classification using VHR images.

Keyword :

Adaptation models Adaptation models Computational modeling Computational modeling Cross-domain classification Cross-domain classification Data models Data models Feature extraction Feature extraction land use classification land use classification multisource domain adaptation multisource domain adaptation Remote sensing Remote sensing Swin transformer Swin transformer Transformers Transformers Urban areas Urban areas very high resolution (VHR) remote sensing images very high resolution (VHR) remote sensing images

Cite:

Copy from the list or Export to your reference management。

GB/T 7714 Li, Mengmeng , Zhang, Congcong , Zhao, Wufan et al. Cross-Domain Urban Land Use Classification via Scenewise Unsupervised Multisource Domain Adaptation With Transformer [J]. | IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING , 2024 , 17 : 10051-10066 .
MLA Li, Mengmeng et al. "Cross-Domain Urban Land Use Classification via Scenewise Unsupervised Multisource Domain Adaptation With Transformer" . | IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING 17 (2024) : 10051-10066 .
APA Li, Mengmeng , Zhang, Congcong , Zhao, Wufan , Zhou, Wen . Cross-Domain Urban Land Use Classification via Scenewise Unsupervised Multisource Domain Adaptation With Transformer . | IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING , 2024 , 17 , 10051-10066 .
Export to NoteExpress RIS BibTex

Version :

Cross-Domain Urban Land Use Classification Via Scene-Wise Unsupervised Multi-Source Domain Adaptation With Transformer Scopus
期刊论文 | 2024 , 17 , 1-16 | IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Cross-Domain Urban Land Use Classification via Scenewise Unsupervised Multisource Domain Adaptation with Transformer EI
期刊论文 | 2024 , 17 , 10051-10066 | IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
10| 20| 50 per page
< Page ,Total 5 >

Export

Results:

Selected

to

Format:
Online/Total:521/9699469
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1