+

Yun et al., 2022 - Google Patents

Panoramic vision transformer for saliency detection in 360∘ videos

Yun et al., 2022

View PDF
Document ID
3748316331390349636
Author
Yun H
Lee S
Kim G
Publication year
Publication venue
European Conference on Computer Vision

External Links

Snippet

Abstract 360∘ video saliency detection is one of the challenging benchmarks for 360∘ video understanding since non-negligible distortion and discontinuity occur in the projection of any format of 360∘ videos, and capture-worthy viewpoint in the omnidirectional sphere is …
Continue reading at www.ecva.net (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/36Image preprocessing, i.e. processing the image information without deciding about the identity of the image
    • G06K9/46Extraction of features or characteristics of the image
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • G06K9/6217Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10024Color image
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • G06K9/6267Classification techniques
    • G06K9/6268Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • G06K9/6201Matching; Proximity measures
    • G06K9/6202Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10016Video; Image sequence
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/00221Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/00624Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30781Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F17/30784Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
    • G06F17/30799Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/20Analysis of motion
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass

Similar Documents

Publication Publication Date Title
Yun et al. Panoramic vision transformer for saliency detection in 360∘ videos
Sakaridis et al. Map-guided curriculum domain adaptation and uncertainty-aware evaluation for semantic nighttime image segmentation
Li et al. Omnifusion: 360 monocular depth estimation via geometry-aware fusion
Ai et al. Deep learning for omnidirectional vision: A survey and new perspectives
Zhang et al. A new haze removal approach for sky/river alike scenes based on external and internal clues
WO2020092276A1 (en) Video recognition using multiple modalities
CN115577768A (en) Semi-supervised model training method and device
Zhang et al. Saliency Prediction Network for $360^\circ $ Videos
Meng et al. Exploiting Random RGB and Sparse Features for Camera Pose Estimation.
Sharjeel et al. Real time drone detection by moving camera using COROLA and CNN algorithm
Hou et al. Towards real-time embodied AI agent: a bionic visual encoding framework for mobile robotics
Sun et al. GGC-SLAM: A VSLAM system based on predicted static probability of feature points in dynamic environments
Wang et al. Gated image-adaptive network for driving-scene object detection under nighttime conditions
Yang et al. Towards generic 3d tracking in RGBD videos: Benchmark and baseline
Jain et al. Generating bird’s eye view from egocentric rgb videos
Zhou et al. Multi-modal LiDAR point cloud semantic segmentation with salience refinement and boundary perception
Muddamsetty et al. Salient objects detection in dynamic scenes using color and texture features
Qiao et al. OARPD: occlusion-aware rotated people detection in overhead fisheye images
Nkrumah et al. EC-WAMI: Event camera-based pose optimization in remote sensing and wide-area motion imagery
Huang et al. Deep Multimodal Fusion Autoencoder for Saliency Prediction of RGB‐D Images
Tan et al. Transformer-based multi-level attention integration network for video saliency prediction
Zeng et al. SwinEFT: A robust and powerful Swin transformer based event frame tracker
Kong et al. Self-supervised indoor 360-degree depth estimation via structural regularization
Yun et al. Panoramic Vision Transformer for Saliency Detection in 360 {\deg} Videos
Rao et al. A Dual-Path Approach for Gaze Following in Fisheye Meeting Scenes
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载