Abstract
In the non-deep learning-based salient object detection methods known so far, the detection effect and robustness based on the background detection method are good. However, results are not desirable in small objects and complex scene images. This paper proposes a salient object detection algorithm, which employs a fusion framework to fuse background and frequency domain features to improve the accuracy of salient object detection. First, an improved background model is proposed for salient object detection to extract the background feature of the image. Simultaneously, the frequency domain features are obtained by the proposed frequency domain algorithm, which combines global information and local details by the Gaussian pyramid algorithm and different filters. Then, within our fusion framework, the fusion operations are guided by the self-attention mechanism to fuse background and multi-scale frequency domain features to obtain the self-attention maps. Finally, this paper introduces a fusion algorithm to derive the final saliency map from the self-attention maps. The results demonstrate that the proposed method consistently outperforms state-of-the-art approaches in four evaluation metrics on six challenging and complicated datasets and improves the accuracy of salient object detection in complex and small object scene images.
Similar content being viewed by others
References
Borji A (2015) What is a salient object? a dataset and a baseline model for salient object detection. IEEE Transactions on Image Processing 24(2):742–756. https://doi.org/10.1109/TIP.2014.2383320
Jia-Ying WU, Sai Y, Jun DU, Hong-Da L (2019) Review of bottom-up salient object detection. Computer Science
Arbeláez P, Maire M, Fowlkes C, Malik J (2011) Contour detection and hierarchical image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence 33:898–916
Nyo MT, Mebarek-Oudina F, Hlaing SS, Khan NA (2022) Otsu’s thresholding technique for mri image brain tumor segmentation. Multimedia Tools and Applications 1–13
Haralick RM, Shanmugam K, Dinstein I (1973) Textural features for image classification. IEEE Transactions on Systems, Man, and Cybernetics SMC-3(6):610–621. https://doi.org/10.1109/TSMC.1973.4309314
Huo L, Rao T, Zhang L (2019) Fused feature encoding in convolutional neural network. Multimedia Tools and Applications 78:1635–1648
Xu Q, Li M, Yu M (2019) Learning to rank with relational graph and pointwise constraint for cross-modal retrieval. Soft Computing 23:9413–9427
Pillai MS, Chaudhary G, Khari M, Crespo RG (2021) Real-time image enhancement for an automatic automobile accident detection through cctv using deep learning. Soft Computing 25(18):11929–11940
Rong Li, X Jilkov VP (2003) Survey of maneuvering target tracking. part i. dynamic models. IEEE Transactions on Aerospace and Electronic Systems 39(4):1333–1364. https://doi.org/10.1109/TAES.2003.1261132
Li C, Chen Z, Wu QJ, Liu C (2019) Saliency object detection: integrating reconstruction and prior. Machine Vision and Applications 30(3):397–406
Tong N, Lu H, Zhang Y, Ruan X (2015) Salient object detection via global and local cues. Pattern Recognition 48(10):3258–3267
Yuan Y, Li C, Kim J, Cai W, Feng DD (2017) Reversion correction and regularized random walk ranking for saliency detection. IEEE Transactions on Image Processing 27(3):1311–1322
Qin, Y, Lu, H, Xu, Y, Wang, H.: Saliency detection via cellular automata. In: Proceedings of the IEEE Conference on computer vision and pattern recognition, pp 110–119 (2015)
Perazzi F, Krähenbühl P, Pritch Y, Hornung A (2012) Saliency filters: Contrast based filtering for salient region detection. In: 2012 IEEE Conference on computer vision and pattern recognition, pp 733–740. IEEE
Xu Q, Wang F, Gong Y, Wang Z, Zeng K, Li Q, Luo X (2019) A novel edge-oriented framework for saliency detection enhancement. Image and Vision Computing 87:1–12
Li X, Zhao L, Wei L, Yang MH, Wu F, Zhuang Y, Ling H, Wang J (2016) Deepsaliency: Multi-task deep neural network model for salient object detection. IEEE Transactions on Image Processing 25(8):3919–3930
Li G, Yu Y (2015) Visual saliency based on multiscale deep features. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 5455–5463
Lee G, Tai YW, Kim J (2016) Deep saliency with encoded low level distance map and high level features. In: Proceedings of the IEEE Conference on computer vision and pattern recognition, pp 660–668
Wei J, Wang S, Huang Q (2020) F3net: Fusion, feedback and focus for salient object detection. Proceedings of the AAAI Conference on artificial intelligence 34:12321–12328
Gu Y, Wang L, Wang Z, Liu Y, Cheng MM, Lu SP (2020) Pyramid constrained self-attention network for fast video salient object detection. Proceedings of the AAAI Conference on artificial intelligence 34:10869–10876
Liu S, Huang D, Wang Y (2019) Learning spatial fusion for single-shot object detection. arXiv:1911.09516
Wei Y, Wen F, Zhu W, Sun J (2012) Geodesic saliency using background priors. In: European conference on computer vision, pp 29–42. Springer
Zhu W, Liang S, Wei Y, Sun J (2014) Saliency optimization from robust background detection. In: Proceedings of the IEEE Conference on computer vision and pattern recognition, pp 2814–2821
Li J, Levine MD, An X, Xu X, He H (2012) Visual saliency based on scalespace analysis in the frequency domain. IEEE Transactions on Pattern Analysis and Machine Intelligence 35(4):996–1010
Li W, Yang X, Li C, Lu R, Xie X (2020) Fast visual saliency based on multi-scale difference of gaussians fusion in frequency domain. IET Image Processing 14(16):4039–4048
Liu Z, Yang X, Liu Y, Qian Z (2019) Smoke-detection framework for highdefinition video using fused spatial-and frequency-domain features. IEEE Access 7:89687–89701
Song S, Jia Z, Yang J, Kasabov N (2022) Salient detection via the fusion of background-based and multiscale frequency-domain features. Information Sciences 618:53–71
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser L, Polosukhin I (2017) Attention is all you need. Advances in Neural Information Processing Systems 30:5998–6008
Zhang Y, Li K, Li K, Wang L, Zhong B, Fu Y (2018) Image superresolution using very deep residual channel attention networks. In: Proceedings of the european conference on computer vision (ECCV), pp 286–301
Wang F, Jiang M, Qian C, Yang S, Li C, Zhang H,Wang X, Tang X (2017) Residual attention network for image classification. In: Proceedings of the IEEE Conference on computer vision and pattern recognition, pp. 3156–3164
Chen L, Zhang H, Xiao J, Nie L, Shao J, Liu W, Chua TS (2017) Sca-cnn: Spatial and channel-wise attention in convolutional networks for image captioning. In: Proceedings of the IEEE Conference on computer vision and pattern recognition, pp 5659–5667
Wang W, Shen J, Shao L, Porikli F (2016) Correspondence driven saliency transfer. IEEE Transactions on Image Processing 25(11):5025–5034. https://doi.org/10.1109/TIP.2016.2601784
Zhang Z, Liang Y, Zheng J, Li K, Ding Z, Sun D (2019) Saliency optimization integrated robust background detection with global ranking. In: International conference on intelligent science and big data engineering, pp 517–528. Springer
Yan Y, Ren J, Sun G, Zhao H, Han J, Li X, Marshall S, Zhan J (2018) Unsupervised image saliency detection with gestalt-laws guided optimization and visual attention based renement. Pattern Recognition 79:65–78
Hou, X, Zhang, L.: Saliency detection: A spectral residual approach. In: 2007 IEEE Conference on Computer Vision and Pattern Recognition, pp 1–8 (2007)
Chenlei Guo, Qi Ma, Liming Zhang (2008) Spatio-temporal saliency detection using phase spectrum of quaternion fourier transform. In: 2008 IEEE Conference on computer vision and pattern recognition, pp 1–8. https://doi.org/10.1109/CVPR.2008.4587715
Wang W, Lai Q, Fu H, Shen J, Ling H, Yang R (2021) Salient object detection in the deep learning era: An in-depth survey. IEEE Transactions on pattern analysis and machine intelligence 1–1. https://doi.org/10.1109/TPAMI.2021.3051099
Chen S, Tan X, Wang B, Lu H, Hu X, Fu Y (2020) Reverse attentionbased residual network for salient object detection. IEEE Transactions on Image Processing 29:3763–3776
Li J, Pan Z, Liu Q, Cui Y, Sun Y (2020) Complementarity-aware attention network for salient object detection. IEEE Transactions on Cybernetics
Li G, Yu Y (2016) Visual saliency detection based on multiscale deep cnn features. IEEE Transactions on Image Processing 25(11):5012–5024
Xu Q, Wang Z, Wang F, Gong Y (2019) Multi-feature fusion cnns for drosophila embryo of interest detection. Physica A: Statistical Mechanics and its Applications 531:121808
Wang C, Dong S, Zhao X, Papanastasiou G, Zhang H, Yang G (2020) Saliencygan: Deep learning semisupervised salient object detection in the fog of iot. IEEE Transactions on Industrial Informatics 16(4):2667–2676. https://doi.org/10.1109/TII.2019.2945362
Kim KS, Zhang D, Kang MC, Ko SJ (2013) Improved simple linear iterative clustering superpixels. In: 2013 IEEE International symposium on consumer electronics (ISCE), pp 259–260. IEEE
Gonzalez RC, Woods RE (2008) Digital image processing. Prentice Hall. International 28(4):484–486
Ramsey JD, Sanchez-Romero R, Glymour C (2014) Non-gaussian methods and high-pass filters in the estimation of effective connections. Neuroimage 84:986–1006
Wang S, Li W, Wang Y, Jiang Y, Jiang S, Zhao R (2012) An improved difference of gaussian filter in face recognition. Journal of Multimedia 7(6):429–433
Adelson EH, Anderson CH, Bergen JR, Burt PJ, Ogden JM (1984) Pyramid methods in image processing. RCA Engineer 29(6):33–41
Wang X, Girshick R, Gupta A, He K (2018) Non-local neural networks. In: Proceedings of the IEEE Conference on computer vision and pattern recognition, pp 7794–7803
Cheng MM, Mitra NJ, Huang X, Torr PHS, Hu SM (2015) Global contrast based salient region detection. IEEE Transactions on Pattern Analysis and Machine Intelligence 37(3):569–582
Shi J, Yan Q, Xu L, Jia J (2015) Hierarchical image saliency detection on extended cssd. IEEE Transactions on Pattern Analysis and Machine Intelligence 38(4):717–729
Alpert S, Galun M, Brandt A, Basri R (2011) Image segmentation by probabilistic bottom-up aggregation and cue integration. IEEE transactions on pattern analysis and machine intelligence 34(2):315–327
Movahedi V, Elder JH (2010) Design and perceptual validation of performance measures for salient object segmentation. In: 2010 IEEE Computer society conference on computer vision and pattern recognition-workshops, pp 49–56. IEEE
Yang C, Zhang L, Lu H, Ruan X, Yang MH (2013) Saliency detection via graph-based manifold ranking. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 3166–3173
Fan DP, Cheng MM, Liu JJ, Gao SH, Hou Q, Borji A (2018) Salient objects in clutter: Bringing salient object detection to the foreground. In: Proceedings of the european conference on computer vision (ECCV), pp 186–202
Acknowledgements
This research was funded by the National Natural Science Foundation of China with Grant U1803261 and 62261053, and Scientific research plan of universities in Xinjiang Uygur Autonomous Region under grant XJEDU2019Y006.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of interest
The authors declare that they have no Conflict of Interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Song, S., Jia, Z., Shi, F. et al. Saliency optimization fused background feature with frequency domain features. Multimed Tools Appl 83, 40509–40528 (2024). https://doi.org/10.1007/s11042-023-16760-5
Received:
Revised:
Accepted:
Published:
Version of record:
Issue date:
DOI: https://doi.org/10.1007/s11042-023-16760-5