Feng et al., 2025 - Google Patents

FTransDeepLab: Multimodal Fusion Transformer-Based DeepLabv3+ for Remote Sensing Semantic Segmentation

Feng et al., 2025

Document ID: 6002085617372844893
Author: Feng H; Hu Q; Zhao P; Wang S; Ai M; Zheng D; Liu T
Publication year: 2025
Publication venue: IEEE Transactions on Geoscience and Remote Sensing

External Links

Cited by

Snippet

High-resolution remote sensing images contain rich color and texture information, but due to the inherent limitations of 2-D data, achieving high-quality semantic segmentation remains a challenge. Multimodal data fusion technology has emerged as an effective approach to …

Continue reading at ieeexplore.ieee.org (other versions)

230000004927 fusion 0 title abstract description 82

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6201—Matching; Proximity measures
- G06K9/6202—Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30244—Information retrieval; Database structures therefor; File system structures therefor in image databases
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20112—Image segmentation details
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
- G06F17/30799—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/20—Image acquisition
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T17/00—Three dimensional [3D] modelling, e.g. data description of 3D objects

Similar Documents

Publication	Publication Date	Title
Zhang et al.	2019	A review of deep learning-based semantic segmentation for point cloud
Li et al.	2024	Boundary-enhanced dual-stream network for semantic segmentation of high-resolution remote sensing images
Zhao et al.	2022	Multi-source collaborative enhanced for remote sensing images semantic segmentation
Cheng et al.	2023	A survey on image semantic segmentation using deep learning techniques
Feng et al.	2025	FTransDeepLab: Multimodal Fusion Transformer-Based DeepLabv3+ for Remote Sensing Semantic Segmentation
Wu et al.	2022	Aggregate interactive learning for RGB-D salient object detection
Wang et al.	2021	Learning efficient multi-task stereo matching network with richer feature information
Wang et al.	2024	STCD: efficient Siamese transformers-based change detection method for remote sensing images
Wu et al.	2024	C3TB-YOLOv5: Integrated YOLOv5 with transformer for object detection in high-resolution remote sensing images
Wang et al.	2022	Global contextual guided residual attention network for salient object detection
Zhou et al.	2023	GAF-Net: geometric contextual feature aggregation and adaptive fusion for large-scale point cloud semantic segmentation
Jie et al.	2020	Photovoltaic power station identification using refined encoder–decoder network with channel attention and chained residual dilated convolutions
Zhang et al.	2024	Hvdistill: Transferring knowledge from images to point clouds via unsupervised hybrid-view distillation
Hou et al.	2024	BFFNet: a bidirectional feature fusion network for semantic segmentation of remote sensing objects
Wang et al.	2023	HQDec: self-supervised monocular depth estimation based on a high-quality decoder
Li et al.	2024	AFENet: An Attention-Focused Feature Enhancement Network for the Efficient Semantic Segmentation of Remote Sensing Images.
Sang et al.	2024	A Lightweight Network With Latent Representations for UAV Thermal Image Super-Resolution
Guo et al.	2025	CoFiNet: Unveiling camouflaged objects with multi-scale finesse
Qu et al.	2024	MDSC-Net: multi-directional spatial connectivity for road extraction in remote sensing images
Huang et al.	2025	Multi-Scale Semantic Segmentation of Remote Sensing Images Based on Edge Optimization
Zhang et al.	2024	P-msdiff: Parallel multi-scale diffusion for remote sensing image segmentation
Wang et al.	2025	MP-FocalUNet: Multiscale parallel focal self-attention U-Net for medical image segmentation
CN116503746B (en)	2023-09-12	Infrared small target detection method based on multi-layer nested non-full mapping U-shaped network
CN118628723A (en)	2024-09-10	Hyperspectral target detection method based on spectral discrimination information extraction and block-level sample simulation
Zhang et al.	2024	Swin‐fisheye: Object detection for fisheye images