Yun et al., 2022 - Google Patents
Panoramic vision transformer for saliency detection in 360∘ videosYun et al., 2022
View PDF- Document ID
- 3748316331390349636
- Author
- Yun H
- Lee S
- Kim G
- Publication year
- Publication venue
- European Conference on Computer Vision
External Links
Snippet
Abstract 360∘ video saliency detection is one of the challenging benchmarks for 360∘ video understanding since non-negligible distortion and discontinuity occur in the projection of any format of 360∘ videos, and capture-worthy viewpoint in the omnidirectional sphere is …
- 230000004438 eyesight 0 title abstract description 27
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6268—Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6201—Matching; Proximity measures
- G06K9/6202—Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10016—Video; Image sequence
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
- G06F17/30799—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Yun et al. | Panoramic vision transformer for saliency detection in 360∘ videos | |
| Sakaridis et al. | Map-guided curriculum domain adaptation and uncertainty-aware evaluation for semantic nighttime image segmentation | |
| Li et al. | Omnifusion: 360 monocular depth estimation via geometry-aware fusion | |
| Ai et al. | Deep learning for omnidirectional vision: A survey and new perspectives | |
| Zhang et al. | A new haze removal approach for sky/river alike scenes based on external and internal clues | |
| WO2020092276A1 (en) | Video recognition using multiple modalities | |
| CN115577768A (en) | Semi-supervised model training method and device | |
| Zhang et al. | Saliency Prediction Network for $360^\circ $ Videos | |
| Meng et al. | Exploiting Random RGB and Sparse Features for Camera Pose Estimation. | |
| Sharjeel et al. | Real time drone detection by moving camera using COROLA and CNN algorithm | |
| Hou et al. | Towards real-time embodied AI agent: a bionic visual encoding framework for mobile robotics | |
| Sun et al. | GGC-SLAM: A VSLAM system based on predicted static probability of feature points in dynamic environments | |
| Wang et al. | Gated image-adaptive network for driving-scene object detection under nighttime conditions | |
| Yang et al. | Towards generic 3d tracking in RGBD videos: Benchmark and baseline | |
| Jain et al. | Generating bird’s eye view from egocentric rgb videos | |
| Zhou et al. | Multi-modal LiDAR point cloud semantic segmentation with salience refinement and boundary perception | |
| Muddamsetty et al. | Salient objects detection in dynamic scenes using color and texture features | |
| Qiao et al. | OARPD: occlusion-aware rotated people detection in overhead fisheye images | |
| Nkrumah et al. | EC-WAMI: Event camera-based pose optimization in remote sensing and wide-area motion imagery | |
| Huang et al. | Deep Multimodal Fusion Autoencoder for Saliency Prediction of RGB‐D Images | |
| Tan et al. | Transformer-based multi-level attention integration network for video saliency prediction | |
| Zeng et al. | SwinEFT: A robust and powerful Swin transformer based event frame tracker | |
| Kong et al. | Self-supervised indoor 360-degree depth estimation via structural regularization | |
| Yun et al. | Panoramic Vision Transformer for Saliency Detection in 360 {\deg} Videos | |
| Rao et al. | A Dual-Path Approach for Gaze Following in Fisheye Meeting Scenes |