Menini et al., 2021 - Google Patents

A real-time online learning framework for joint 3d reconstruction and semantic segmentation of indoor scenes

Menini et al., 2021

Document ID: 3887832665637444389
Author: Menini D; Kumar S; Oswald M; Sandström E; Sminchisescu C; Van Gool L
Publication year: 2021
Publication venue: IEEE Robotics and Automation Letters

External Links

Cited by

Snippet

This letter presents a real-time online vision framework to jointly recover an indoor scene's 3D structure and semantic label. Given noisy depth maps, a camera trajectory, and 2D semantic labels at train time, the proposed deep neural network based approach learns to …

Continue reading at arxiv.org (PDF) (other versions)

230000011218 segmentation 0 title description 28

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T17/00—Three dimensional [3D] modelling, e.g. data description of 3D objects
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T9/00—Image coding, e.g. from bit-mapped to non bit-mapped
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS

Similar Documents

Publication	Publication Date	Title
Menini et al.	2021	A real-time online learning framework for joint 3d reconstruction and semantic segmentation of indoor scenes
Dai et al.	2017	Shape completion using 3d-encoder-predictor cnns and shape synthesis
Ost et al.	2021	Neural scene graphs for dynamic scenes
Han et al.	2019	Image-based 3D object reconstruction: State-of-the-art and trends in the deep learning era
CN112602116B (en)	2025-07-29	Mapping object instances using video data
Avetisyan et al.	2019	End-to-end cad model retrieval and 9dof alignment in 3d scans
Dai et al.	2018	Scancomplete: Large-scale scene completion and semantic segmentation for 3d scans
Dai et al.	2018	3dmv: Joint 3d-multi-view prediction for 3d semantic scene segmentation
Nguyen et al.	2016	A field model for repairing 3d shapes
Wang et al.	2019	Forknet: Multi-branch volumetric semantic completion from a single depth image
Smith et al.	2004	Layered motion segmentation and depth ordering by tracking edges
JP2025512722A (en)	2025-04-22	Systems and methods for generalized scene reconstruction - Patents.com
US11682166B2 (en)	2023-06-20	Fitting 3D primitives to a high-resolution point cloud
WO2023091249A1 (en)	2023-05-25	Neural semantic fields for generalizable semantic segmentation of 3d scenes
Liu et al.	2019	High-quality textured 3D shape reconstruction with cascaded fully convolutional networks
Vizzo et al.	2022	Make it dense: Self-supervised geometric scan completion of sparse 3d lidar scans in large outdoor environments
CN111738092B (en)	2024-03-29	Method for recovering occluded human body posture sequence based on deep learning
Golla et al.	2020	Temporal upsampling of point cloud sequences by optimal transport for plant growth visualization
Chen et al.	2022	Circle: Convolutional implicit reconstruction and completion for large-scale indoor scene
Wu et al.	2023	Leveraging single-view images for unsupervised 3D point cloud completion
Tung et al.	2017	MF3D: Model-free 3D semantic scene parsing
CN119919486A (en)	2025-05-02	A multi-view object posture estimation and posture optimization method and electronic device
Xiong et al.	2023	Self‐supervised depth completion with multi‐view geometric constraints
Wang et al.	2024	Online scene semantic understanding based on sparsely correlated network for AR
Hong et al.	2024	Real-Time 3D Visual Perception by Cross-Dimensional Refined Learning