Feng et al., 2025 - Google Patents
FTransDeepLab: Multimodal Fusion Transformer-Based DeepLabv3+ for Remote Sensing Semantic SegmentationFeng et al., 2025
- Document ID
- 6002085617372844893
- Author
- Feng H
- Hu Q
- Zhao P
- Wang S
- Ai M
- Zheng D
- Liu T
- Publication year
- Publication venue
- IEEE Transactions on Geoscience and Remote Sensing
External Links
Snippet
High-resolution remote sensing images contain rich color and texture information, but due to the inherent limitations of 2-D data, achieving high-quality semantic segmentation remains a challenge. Multimodal data fusion technology has emerged as an effective approach to …
- 230000004927 fusion 0 title abstract description 82
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6201—Matching; Proximity measures
- G06K9/6202—Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30244—Information retrieval; Database structures therefor; File system structures therefor in image databases
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20112—Image segmentation details
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
- G06F17/30799—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/20—Image acquisition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T17/00—Three dimensional [3D] modelling, e.g. data description of 3D objects
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Zhang et al. | A review of deep learning-based semantic segmentation for point cloud | |
| Li et al. | Boundary-enhanced dual-stream network for semantic segmentation of high-resolution remote sensing images | |
| Zhao et al. | Multi-source collaborative enhanced for remote sensing images semantic segmentation | |
| Cheng et al. | A survey on image semantic segmentation using deep learning techniques | |
| Feng et al. | FTransDeepLab: Multimodal Fusion Transformer-Based DeepLabv3+ for Remote Sensing Semantic Segmentation | |
| Wu et al. | Aggregate interactive learning for RGB-D salient object detection | |
| Wang et al. | Learning efficient multi-task stereo matching network with richer feature information | |
| Wang et al. | STCD: efficient Siamese transformers-based change detection method for remote sensing images | |
| Wu et al. | C3TB-YOLOv5: Integrated YOLOv5 with transformer for object detection in high-resolution remote sensing images | |
| Wang et al. | Global contextual guided residual attention network for salient object detection | |
| Zhou et al. | GAF-Net: geometric contextual feature aggregation and adaptive fusion for large-scale point cloud semantic segmentation | |
| Jie et al. | Photovoltaic power station identification using refined encoder–decoder network with channel attention and chained residual dilated convolutions | |
| Zhang et al. | Hvdistill: Transferring knowledge from images to point clouds via unsupervised hybrid-view distillation | |
| Hou et al. | BFFNet: a bidirectional feature fusion network for semantic segmentation of remote sensing objects | |
| Wang et al. | HQDec: self-supervised monocular depth estimation based on a high-quality decoder | |
| Li et al. | AFENet: An Attention-Focused Feature Enhancement Network for the Efficient Semantic Segmentation of Remote Sensing Images. | |
| Sang et al. | A Lightweight Network With Latent Representations for UAV Thermal Image Super-Resolution | |
| Guo et al. | CoFiNet: Unveiling camouflaged objects with multi-scale finesse | |
| Qu et al. | MDSC-Net: multi-directional spatial connectivity for road extraction in remote sensing images | |
| Huang et al. | Multi-Scale Semantic Segmentation of Remote Sensing Images Based on Edge Optimization | |
| Zhang et al. | P-msdiff: Parallel multi-scale diffusion for remote sensing image segmentation | |
| Wang et al. | MP-FocalUNet: Multiscale parallel focal self-attention U-Net for medical image segmentation | |
| CN116503746B (en) | Infrared small target detection method based on multi-layer nested non-full mapping U-shaped network | |
| CN118628723A (en) | Hyperspectral target detection method based on spectral discrimination information extraction and block-level sample simulation | |
| Zhang et al. | Swin‐fisheye: Object detection for fisheye images |