Menini et al., 2021 - Google Patents
A real-time online learning framework for joint 3d reconstruction and semantic segmentation of indoor scenesMenini et al., 2021
View PDF- Document ID
- 3887832665637444389
- Author
- Menini D
- Kumar S
- Oswald M
- Sandström E
- Sminchisescu C
- Van Gool L
- Publication year
- Publication venue
- IEEE Robotics and Automation Letters
External Links
Snippet
This letter presents a real-time online vision framework to jointly recover an indoor scene's 3D structure and semantic label. Given noisy depth maps, a camera trajectory, and 2D semantic labels at train time, the proposed deep neural network based approach learns to …
- 230000011218 segmentation 0 title description 28
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T17/00—Three dimensional [3D] modelling, e.g. data description of 3D objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T9/00—Image coding, e.g. from bit-mapped to non bit-mapped
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Menini et al. | A real-time online learning framework for joint 3d reconstruction and semantic segmentation of indoor scenes | |
| Dai et al. | Shape completion using 3d-encoder-predictor cnns and shape synthesis | |
| Ost et al. | Neural scene graphs for dynamic scenes | |
| Han et al. | Image-based 3D object reconstruction: State-of-the-art and trends in the deep learning era | |
| CN112602116B (en) | Mapping object instances using video data | |
| Avetisyan et al. | End-to-end cad model retrieval and 9dof alignment in 3d scans | |
| Dai et al. | Scancomplete: Large-scale scene completion and semantic segmentation for 3d scans | |
| Dai et al. | 3dmv: Joint 3d-multi-view prediction for 3d semantic scene segmentation | |
| Nguyen et al. | A field model for repairing 3d shapes | |
| Wang et al. | Forknet: Multi-branch volumetric semantic completion from a single depth image | |
| Smith et al. | Layered motion segmentation and depth ordering by tracking edges | |
| JP2025512722A (en) | Systems and methods for generalized scene reconstruction - Patents.com | |
| US11682166B2 (en) | Fitting 3D primitives to a high-resolution point cloud | |
| WO2023091249A1 (en) | Neural semantic fields for generalizable semantic segmentation of 3d scenes | |
| Liu et al. | High-quality textured 3D shape reconstruction with cascaded fully convolutional networks | |
| Vizzo et al. | Make it dense: Self-supervised geometric scan completion of sparse 3d lidar scans in large outdoor environments | |
| CN111738092B (en) | Method for recovering occluded human body posture sequence based on deep learning | |
| Golla et al. | Temporal upsampling of point cloud sequences by optimal transport for plant growth visualization | |
| Chen et al. | Circle: Convolutional implicit reconstruction and completion for large-scale indoor scene | |
| Wu et al. | Leveraging single-view images for unsupervised 3D point cloud completion | |
| Tung et al. | MF3D: Model-free 3D semantic scene parsing | |
| CN119919486A (en) | A multi-view object posture estimation and posture optimization method and electronic device | |
| Xiong et al. | Self‐supervised depth completion with multi‐view geometric constraints | |
| Wang et al. | Online scene semantic understanding based on sparsely correlated network for AR | |
| Hong et al. | Real-Time 3D Visual Perception by Cross-Dimensional Refined Learning |