+

Menini et al., 2021 - Google Patents

A real-time online learning framework for joint 3d reconstruction and semantic segmentation of indoor scenes

Menini et al., 2021

View PDF
Document ID
3887832665637444389
Author
Menini D
Kumar S
Oswald M
Sandström E
Sminchisescu C
Van Gool L
Publication year
Publication venue
IEEE Robotics and Automation Letters

External Links

Snippet

This letter presents a real-time online vision framework to jointly recover an indoor scene's 3D structure and semantic label. Given noisy depth maps, a camera trajectory, and 2D semantic labels at train time, the proposed deep neural network based approach learns to …
Continue reading at arxiv.org (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • G06K9/6217Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/36Image preprocessing, i.e. processing the image information without deciding about the identity of the image
    • G06K9/46Extraction of features or characteristics of the image
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/50Computer-aided design
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T17/00Three dimensional [3D] modelling, e.g. data description of 3D objects
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T11/002D [Two Dimensional] image generation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T9/00Image coding, e.g. from bit-mapped to non bit-mapped
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T15/003D [Three Dimensional] image rendering
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06QDATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS

Similar Documents

Publication Publication Date Title
Menini et al. A real-time online learning framework for joint 3d reconstruction and semantic segmentation of indoor scenes
Dai et al. Shape completion using 3d-encoder-predictor cnns and shape synthesis
Ost et al. Neural scene graphs for dynamic scenes
Han et al. Image-based 3D object reconstruction: State-of-the-art and trends in the deep learning era
CN112602116B (en) Mapping object instances using video data
Avetisyan et al. End-to-end cad model retrieval and 9dof alignment in 3d scans
Dai et al. Scancomplete: Large-scale scene completion and semantic segmentation for 3d scans
Dai et al. 3dmv: Joint 3d-multi-view prediction for 3d semantic scene segmentation
Nguyen et al. A field model for repairing 3d shapes
Wang et al. Forknet: Multi-branch volumetric semantic completion from a single depth image
Smith et al. Layered motion segmentation and depth ordering by tracking edges
JP2025512722A (en) Systems and methods for generalized scene reconstruction - Patents.com
US11682166B2 (en) Fitting 3D primitives to a high-resolution point cloud
WO2023091249A1 (en) Neural semantic fields for generalizable semantic segmentation of 3d scenes
Liu et al. High-quality textured 3D shape reconstruction with cascaded fully convolutional networks
Vizzo et al. Make it dense: Self-supervised geometric scan completion of sparse 3d lidar scans in large outdoor environments
CN111738092B (en) Method for recovering occluded human body posture sequence based on deep learning
Golla et al. Temporal upsampling of point cloud sequences by optimal transport for plant growth visualization
Chen et al. Circle: Convolutional implicit reconstruction and completion for large-scale indoor scene
Wu et al. Leveraging single-view images for unsupervised 3D point cloud completion
Tung et al. MF3D: Model-free 3D semantic scene parsing
CN119919486A (en) A multi-view object posture estimation and posture optimization method and electronic device
Xiong et al. Self‐supervised depth completion with multi‐view geometric constraints
Wang et al. Online scene semantic understanding based on sparsely correlated network for AR
Hong et al. Real-Time 3D Visual Perception by Cross-Dimensional Refined Learning
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载