Yun et al., 2022 - Google Patents

Panoramic vision transformer for saliency detection in 360∘ videos

Yun et al., 2022

Document ID: 3748316331390349636
Author: Yun H; Lee S; Kim G
Publication year: 2022
Publication venue: European Conference on Computer Vision

External Links

Cited by

Snippet

Abstract 360∘ video saliency detection is one of the challenging benchmarks for 360∘ video understanding since non-negligible distortion and discontinuity occur in the projection of any format of 360∘ videos, and capture-worthy viewpoint in the omnidirectional sphere is …

Continue reading at www.ecva.net (PDF) (other versions)

230000004438 eyesight 0 title abstract description 27

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6268—Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6201—Matching; Proximity measures
- G06K9/6202—Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10016—Video; Image sequence
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
- G06F17/30799—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass

Similar Documents

Publication	Publication Date	Title
Yun et al.	2022	Panoramic vision transformer for saliency detection in 360∘ videos
Sakaridis et al.	2020	Map-guided curriculum domain adaptation and uncertainty-aware evaluation for semantic nighttime image segmentation
Li et al.	2022	Omnifusion: 360 monocular depth estimation via geometry-aware fusion
Ai et al.	2022	Deep learning for omnidirectional vision: A survey and new perspectives
Zhang et al.	2020	A new haze removal approach for sky/river alike scenes based on external and internal clues
WO2020092276A1 (en)	2020-05-07	Video recognition using multiple modalities
CN115577768A (en)	2023-01-06	Semi-supervised model training method and device
Zhang et al.	2019	Saliency Prediction Network for $360^\circ $ Videos
Meng et al.	2016	Exploiting Random RGB and Sparse Features for Camera Pose Estimation.
Sharjeel et al.	2021	Real time drone detection by moving camera using COROLA and CNN algorithm
Hou et al.	2024	Towards real-time embodied AI agent: a bionic visual encoding framework for mobile robotics
Sun et al.	2024	GGC-SLAM: A VSLAM system based on predicted static probability of feature points in dynamic environments
Wang et al.	2025	Gated image-adaptive network for driving-scene object detection under nighttime conditions
Yang et al.	2022	Towards generic 3d tracking in RGBD videos: Benchmark and baseline
Jain et al.	2021	Generating bird’s eye view from egocentric rgb videos
Zhou et al.	2024	Multi-modal LiDAR point cloud semantic segmentation with salience refinement and boundary perception
Muddamsetty et al.	2018	Salient objects detection in dynamic scenes using color and texture features
Qiao et al.	2024	OARPD: occlusion-aware rotated people detection in overhead fisheye images
Nkrumah et al.	2024	EC-WAMI: Event camera-based pose optimization in remote sensing and wide-area motion imagery
Huang et al.	2021	Deep Multimodal Fusion Autoencoder for Saliency Prediction of RGB‐D Images
Tan et al.	2024	Transformer-based multi-level attention integration network for video saliency prediction
Zeng et al.	2023	SwinEFT: A robust and powerful Swin transformer based event frame tracker
Kong et al.	2022	Self-supervised indoor 360-degree depth estimation via structural regularization
Yun et al.	2022	Panoramic Vision Transformer for Saliency Detection in 360 {\deg} Videos
Rao et al.	2023	A Dual-Path Approach for Gaze Following in Fisheye Meeting Scenes