Ai et al., 2022 - Google Patents

Deep learning for omnidirectional vision: A survey and new perspectives

Ai et al., 2022

Document ID: 13107992851952866487
Author: Ai H; Cao Z; Zhu J; Bai H; Chen Y; Wang L
Publication year: 2022
Publication venue: arXiv preprint arXiv:2205.10468

External Links

Cited by

Snippet

Omnidirectional image (ODI) data is captured with a 360x180 field-of-view, which is much wider than the pinhole cameras and contains richer spatial information than the conventional planar images. Accordingly, omnidirectional vision has attracted booming …

Continue reading at arxiv.org (PDF) (other versions)

230000004438 eyesight 0 title abstract description 54

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10016—Video; Image sequence
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2200/00—Indexing scheme for image data processing or generation, in general
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements

Similar Documents

Publication	Publication Date	Title
Ai et al.	2022	Deep learning for omnidirectional vision: A survey and new perspectives
Hu et al.	2022	Deep depth completion from extremely sparse data: A survey
Fan et al.	2021	Point 4d transformer networks for spatio-temporal modeling in point cloud videos
Tosi et al.	2024	How nerfs and 3d gaussian splatting are reshaping slam: a survey
CN112771539B (en)	2023-08-25	Use 3D data predicted from 2D images using neural networks for 3D modeling applications
Wang et al.	2020	Predicting camera viewpoint improves cross-dataset generalization for 3d human pose estimation
Whelan et al.	2015	Real-time large-scale dense RGB-D SLAM with volumetric fusion
Xie et al.	2022	Recent advances in conventional and deep learning-based depth completion: A survey
AU2016266968A1 (en)	2017-11-23	Modelling a three-dimensional space
Yun et al.	2022	Panoramic vision transformer for saliency detection in 360∘ videos
Zhang et al.	2022	Wide-area crowd counting: Multi-view fusion networks for counting in large scenes
Ai et al.	2025	A survey of representation learning, optimization strategies, and applications for omnidirectional vision
Zhu et al.	2024	3d gaussian splatting in robotics: A survey
Pintore et al.	2024	Deep panoramic depth prediction and completion for indoor scenes
CN104463962B (en)	2017-02-22	Three-dimensional scene reconstruction method based on GPS information video
Azzarelli et al.	2025	Intelligent Cinematography: a review of AI research for cinematographic production
CN116721139A (en)	2023-09-08	Generating depth images of image data
Chen et al.	2024	Towards weather-robust 3D human body reconstruction: Millimeter-wave radar-based dataset, benchmark, and multi-modal fusion
Huang et al.	2025	Vipe: Video pose engine for 3d geometric perception
Li et al.	2021	Monocular 3-D Object Detection Based on Depth-Guided Local Convolution for Smart Payment in D2D Systems
Tanner et al.	2022	Large-scale outdoor scene reconstruction and correction with vision
Wang et al.	2021	Papooling: Graph-based position adaptive aggregation of local geometry in point clouds
Jiang et al.	2024	Romnistereo: Recurrent omnidirectional stereo matching
Tian	2023	Effective image enhancement and fast object detection for improved UAV applications
CN117455972A (en)	2024-01-26	UAV ground target positioning method based on monocular depth estimation