Ai et al., 2022 - Google Patents
Deep learning for omnidirectional vision: A survey and new perspectivesAi et al., 2022
View PDF- Document ID
- 13107992851952866487
- Author
- Ai H
- Cao Z
- Zhu J
- Bai H
- Chen Y
- Wang L
- Publication year
- Publication venue
- arXiv preprint arXiv:2205.10468
External Links
Snippet
Omnidirectional image (ODI) data is captured with a 360x180 field-of-view, which is much wider than the pinhole cameras and contains richer spatial information than the conventional planar images. Accordingly, omnidirectional vision has attracted booming …
- 230000004438 eyesight 0 title abstract description 54
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10016—Video; Image sequence
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2200/00—Indexing scheme for image data processing or generation, in general
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Ai et al. | Deep learning for omnidirectional vision: A survey and new perspectives | |
| Hu et al. | Deep depth completion from extremely sparse data: A survey | |
| Fan et al. | Point 4d transformer networks for spatio-temporal modeling in point cloud videos | |
| Tosi et al. | How nerfs and 3d gaussian splatting are reshaping slam: a survey | |
| CN112771539B (en) | Use 3D data predicted from 2D images using neural networks for 3D modeling applications | |
| Wang et al. | Predicting camera viewpoint improves cross-dataset generalization for 3d human pose estimation | |
| Whelan et al. | Real-time large-scale dense RGB-D SLAM with volumetric fusion | |
| Xie et al. | Recent advances in conventional and deep learning-based depth completion: A survey | |
| AU2016266968A1 (en) | Modelling a three-dimensional space | |
| Yun et al. | Panoramic vision transformer for saliency detection in 360∘ videos | |
| Zhang et al. | Wide-area crowd counting: Multi-view fusion networks for counting in large scenes | |
| Ai et al. | A survey of representation learning, optimization strategies, and applications for omnidirectional vision | |
| Zhu et al. | 3d gaussian splatting in robotics: A survey | |
| Pintore et al. | Deep panoramic depth prediction and completion for indoor scenes | |
| CN104463962B (en) | Three-dimensional scene reconstruction method based on GPS information video | |
| Azzarelli et al. | Intelligent Cinematography: a review of AI research for cinematographic production | |
| CN116721139A (en) | Generating depth images of image data | |
| Chen et al. | Towards weather-robust 3D human body reconstruction: Millimeter-wave radar-based dataset, benchmark, and multi-modal fusion | |
| Huang et al. | Vipe: Video pose engine for 3d geometric perception | |
| Li et al. | Monocular 3-D Object Detection Based on Depth-Guided Local Convolution for Smart Payment in D2D Systems | |
| Tanner et al. | Large-scale outdoor scene reconstruction and correction with vision | |
| Wang et al. | Papooling: Graph-based position adaptive aggregation of local geometry in point clouds | |
| Jiang et al. | Romnistereo: Recurrent omnidirectional stereo matching | |
| Tian | Effective image enhancement and fast object detection for improved UAV applications | |
| CN117455972A (en) | UAV ground target positioning method based on monocular depth estimation |