Abstract
Siamese networks have recently attracted much attention due to balancing accuracy and speed in tracking. However, most Siamese trackers emphasize how to learn target features, ignoring background information. In this paper, a novel Siamese-based method via an adaptive distractor-aware and enhancement confidence evaluator is proposed. First, to make full use of the background information, a distractor model is designed, which is used to select the dominant feature channels of less interference for target representation. Additionally, to enable the target model to work well with challenging scenes, an adaptive weights-aware strategy is proposed, which can adjust the feature channel weights online according to the single-convolutional attention network with L2 loss. In addition, a dynamic template update strategy is proposed to adapt target appearance variations. The strategy evaluates the tracking results via a new correlation peak average energy (CPAE) evaluator, which has more confidence for good templates, guiding the fusion of reliable template samples and further enhancing the adaptability of the template and making the tracker more robust. Experimental results on widely used OTB100, TC-128, UAV123, VOT2016 and VOT2018 benchmarks verified that the proposed tracker achieved outstanding performance compared with state-of-the-art trackers.
Similar content being viewed by others
Data Availability
The datasets generated during and analysed during the current study are available from the corresponding author on reasonable request.
References
Voulodimos A, Doulamis N, Doulamis A, Protopapadakis E (2018) Deep learning for computer vision: a brief review. Comput Intell Neurosci 2018
He S, Xu R, Zhao Z, Zou T (2022) Vision-based neural formation tracking control of multiple autonomous vehicles with visibility and performance constraints. Neurocomput 492:651–663
Liu Y, Zhang Y, Hu M, Si P, Xia C (2017) Fast tracking via spatio-temporal context learning based on multi-color attributes and pca. In: 2017 IEEE international conference on information and automation (ICIA). IEEE, pp 398–403
Bertinetto L, Valmadre J, Golodetz S, Miksik O, Torr PH (2016) Staple: complementary learners for real-time tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1401–1409
Jia M, Gao Z, Guo Q, Lin Y, Gu X (2019) Sparse feature learning for correlation filter tracking toward 5g-enabled tactile internet. IEEE Trans Ind Inform 16(3):1904–1913
Ma C, Huang J-B, Yang X, Yang M-H (2015) Hierarchical convolutional features for visual tracking. In: Proceedings of the IEEE international conference on computer vision, pp 3074–3082
Zhang H, Chen J, Nie G, Hu S (2020) Uncertain motion tracking based on convolutional net with semantics estimation and region proposals. Pattern Recognit 102:107232
Tang F, Lu X, Zhang X, Hu S, Zhang H (2019) Deep feature tracking based on interactive multiple model. Neurocomput 333:29–40
Xing X, Qiu F, Xu X, Qing C, Wu Y (2017) Robust object tracking based on sparse representation and incremental weighted pca. Multimedia Tools Appl 76(2):2039–2057
Che M, Wang R, Lu Y, Li Y, Zhi H, Xiong C (2018) Channel pruning for visual tracking. In: Proceedings of the European conference on computer vision (ECCV) workshops, pp 0–0
Danelljan M, Bhat G, Shahbaz Khan F, Felsberg M (2017) Eco: efficient convolution operators for tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 6638–6646
Li X, Ma C, Wu B, He Z, Yang M-H (2019) Target-aware deep tracking. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 1369–1378
Rahman MM, Fiaz M, Jung SK (2020) Efficient visual tracking with stacked channel-spatial attention learning. IEEE Access 8:100857–100869
Mueller M, Smith N, Ghanem B (2017) Context-aware correlation filter tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1396–1404
Bhat G, Danelljan M, Gool LV, Timofte R (2019) Learning discriminative model prediction for tracking. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 6182–6191
Lu X, Li J, He Z, Wang W, Wang H (2019) Distracter-aware tracking via correlation filter. Neurocomputing 348:134–144
Zhu Z, Wang Q, Li B, Wu W, Yan J, Hu W (2018) Distractor-aware siamese networks for visual object tracking. In: Proceedings of the European conference on computer vision (ECCV), pp 101–117
Zhang H, Cheng L, Zhang T, Wang Y, Zhang W, Zhang J (2022) Target-distractor aware deep tracking with discriminative enhancement learning loss. IEEE Trans Circ Syst Video Technol
Tao R, Gavves E, Smeulders AW (2016) Siamese instance search for tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1420–1429
Bertinetto L, Valmadre J, Henriques JF, Vedaldi A, Torr PH (2016) Fully-convolutional siamese networks for object tracking. In: European conference on computer vision. Springer, pp 850–865
Wang Q, Teng Z, Xing J, Gao J, Hu W, Maybank S (2018) Learning attentions: residual attentional siamese network for high performance online visual tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4854–4863
He A, Luo C, Tian X, Zeng W (2018) A twofold siamese network for real-time object tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4834–4843
Valmadre J, Bertinetto L, Henriques J, Vedaldi A, Torr PH (2017) End-to-end representation learning for correlation filter based tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2805–2813
Li B, Yan J, Wu W, Zhu Z, Hu X (2018) High performance visual tracking with siamese region proposal network. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 8971–8980
Li B, Wu W, Wang Q, Zhang F, Xing J, Yan J (2019) Siamrpn++: evolution of siamese visual tracking with very deep networks. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 4282–4291
Gao P, Yuan R, Wang F, Xiao L, Fujita H, Zhang Y (2020) Siamese attentional keypoint network for high performance visual tracking. Knowl-Based Syst 193:105448
Ma C, Huang J-B, Yang X, Yang M-H (2018) Robust visual tracking via hierarchical convolutional features. IEEE Trans Pattern Anal Mach Intell 41(11):2709–2723
Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. Int Conf Learni Represent
Lu X, Tang F, Huo H, Fang T (2019) Learning channel-aware deep regression for object tracking. Pattern Recogn Lett 127:103–109
Huang W, Gu J, Ma X, Li Y (2020) End-to-end multitask siamese network with residual hierarchical attention for real-time object tracking. Appl Intell 50(6):1908–1921
Fiaz M, Mahmood A, Jung SK (2020) Learning soft mask based feature fusion with channel and spatial attention for robust visual object tracking. Sensors 20(14):4021
Guo D, Wang J, Zhao W, Cui Y, Wang Z, Chen S (2021) End-to-end feature fusion siamese network for adaptive visual tracking. IET Image Processi 15(1):91–100
Gao P, Zhang Q, Wang F, Xiao L, Fujita H, Zhang Y (2020) Learning reinforced attentional representation for end-to-end visual tracking. Inf Sci 517:52–67
Sosnovik I, Moskalev A, Smeulders AW (2021) Scale equivariance improves siamese tracking. In: Proceedings of the IEEE/CVF winter conference on applications of computer vision, pp 2765–2774
Meng Y, Deng Z, Zhao K, Xu Y, Liu H (2021) Hierarchical correlation siamese network for real-time object tracking. Appl Intell 51(6):3202–3211
Guan H, An Z (2019) Robust online visual tracking via stable and adaptive memories. J Intell Fuzzy Syst 36(6):5521–5531
Zhang Z, Peng H (2019) Deeper and wider siamese networks for real-time visual tracking. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 4591–4600
Zhang Z, Zhang Y, Cheng X, Lu G (2021) Siamese network for object tracking with multi-granularity appearance representations. Pattern Recogn 118:108003
Li C, Yang B (2019) Adaptive weighted cnn features integration for correlation filter tracking. IEEE Access 7:76416–76427
Woo S, Park J, Lee J-Y, Kweon IS (2018) Cbam: convolutional block attention module. In: Proceedings of the european conference on computer vision (ECCV), pp 3–19
Henriques JF, Caseiro R, Martins P, Batista J (2014) High-speed tracking with kernelized correlation filters. IEEE Trans Pattern Anal Mach Intell 37(3):583–596
Wang M, Liu Y, Huang Z (2017) Large margin object tracking with circulant feature maps. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4021–4029
Yuan T, Yang W, Li Q, Wang Y (2021) An anchor-free siamese network with multi-template update for object tracking. Electronics 10(9):1067
Wu Y, Lim J, Yang M-H (2015) Object tracking benchmark. IEEE Trans Pattern Anal Mach Intell 37(09):1834–1848
Li P, Chen B, Ouyang W, Wang D, Yang X, Lu H (2019) Gradnet: gradient-guided network for visual object tracking. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 6162–6171
Danelljan M, Hager G, Shahbaz Khan F, Felsberg M (2015) Convolutional features for correlation filter based visual tracking. In: Proceedings of the IEEE international conference on computer vision workshops, pp 58–66
Danelljan M, Hager G, Shahbaz Khan F, Felsberg M (2015) Learning spatially regularized correlation filters for visual tracking. In: Proceedings of the IEEE international conference on computer vision, pp 4310–4318
Danelljan M, Bhat G, Khan FS, Felsberg M (2019) Atom: accurate tracking by overlap maximization. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 4660–4669
Liang P, Blasch E, Ling H (2015) Encoding color information for visual tracking: algorithms and benchmark. IEEE Trans Image Process 24(12):5630–5644
Guo D, Wang J, Cui Y, Wang Z, Chen S (2020) Siamcar: siamese fully convolutional classification and regression for visual tracking. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 6269–6277
Zhang T, Xu C, Yang M-H (2017) Multi-task correlation particle filter for robust object tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4335–4343
Song Y, Ma C, Gong L, Zhang J, Lau RW, Yang M-H (2017) Crest: convolutional residual learning for visual tracking. In: Proceedings of the IEEE international conference on computer vision, pp 2555–2564
Kiani Galoogahi H, Fagg A, Lucey S (2017) Learning background-aware correlation filters for visual tracking. In: Proceedings of the IEEE international conference on computer vision, pp 1135–1143
Mueller M, Smith N, Ghanem B (2016) A benchmark and simulator for uav tracking. In: European conference on computer vision. Springer, pp 445–461
Kristan M, Leonardis A, Matas J, Felsberg M, Pflugfelder R, Čehovin Zajc L, Vojir T, Bhat G, Lukezic A, Eldesokey A et al (2016) The visual object tracking vot2016 challenge results. In: Proceedings of the european conference on computer vision (ECCV) workshops
Danelljan M, Robinson A, Shahbaz Khan F, Felsberg M (2016) Beyond correlation filters: learning continuous convolution operators for visual tracking. In: European conference on computer vision. Springer, pp 472–488
Zhang H, Chen J, Nie G, Lin Y, Yang G, Zhang WC (2021) Light regression memory and multi-perspective object special proposals for abrupt motion tracking. Knowl-Based Syst 226:107127
Kristan M, Leonardis A, Matas J, Felsberg M, Pflugfelder R, Čehovin Zajc L, Vojir T, Bhat G, Lukezic A, Eldesokey A et al (2018) The sixth visual object tracking vot2018 challenge results. In: Proceedings of the european conference on computer vision (ECCV) workshops, pp 0–0
Guo Q, Feng W, Zhou C, Huang R, Wan L, Wang S (2017) Learning dynamic siamese network for visual object tracking. In: Proceedings of the IEEE international conference on computer vision, pp 1763–1771
Acknowledgements
This work is supported by the National Natural Science Foundation of China under Grant (62272423, 62102373, 62006213, 62072416, 61806181), Program for Science & Technology Innovation Talents in Universities of Henan Province, China (21HASTIT028), Natural Science Foundation of Henan Province, China (202300410495), Zhongyuan Science and Technology Innovation Leadership Program, China (214200510026).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of interest
The authors declare that there are no conflicts of interest regarding the publication of this paper.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zhang, H., Zhu, L., Wu, H. et al. Adaptive distractor-aware for siamese tracking via enhancement confidence evaluator. Appl Intell 53, 29223–29241 (2023). https://doi.org/10.1007/s10489-023-05068-4
Accepted:
Published:
Version of record:
Issue date:
DOI: https://doi.org/10.1007/s10489-023-05068-4