这是indexloc提供的服务,不要输入任何密码
Skip to main content
Log in

Adaptive distractor-aware for siamese tracking via enhancement confidence evaluator

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

Siamese networks have recently attracted much attention due to balancing accuracy and speed in tracking. However, most Siamese trackers emphasize how to learn target features, ignoring background information. In this paper, a novel Siamese-based method via an adaptive distractor-aware and enhancement confidence evaluator is proposed. First, to make full use of the background information, a distractor model is designed, which is used to select the dominant feature channels of less interference for target representation. Additionally, to enable the target model to work well with challenging scenes, an adaptive weights-aware strategy is proposed, which can adjust the feature channel weights online according to the single-convolutional attention network with L2 loss. In addition, a dynamic template update strategy is proposed to adapt target appearance variations. The strategy evaluates the tracking results via a new correlation peak average energy (CPAE) evaluator, which has more confidence for good templates, guiding the fusion of reliable template samples and further enhancing the adaptability of the template and making the tracker more robust. Experimental results on widely used OTB100, TC-128, UAV123, VOT2016 and VOT2018 benchmarks verified that the proposed tracker achieved outstanding performance compared with state-of-the-art trackers.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+
from $39.99 /Month
  • Starting from 10 chapters or articles per month
  • Access and download chapters and articles from more than 300k books and 2,500 journals
  • Cancel anytime
View plans

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12

Similar content being viewed by others

Data Availability

The datasets generated during and analysed during the current study are available from the corresponding author on reasonable request.

References

  1. Voulodimos A, Doulamis N, Doulamis A, Protopapadakis E (2018) Deep learning for computer vision: a brief review. Comput Intell Neurosci 2018

  2. He S, Xu R, Zhao Z, Zou T (2022) Vision-based neural formation tracking control of multiple autonomous vehicles with visibility and performance constraints. Neurocomput 492:651–663

  3. Liu Y, Zhang Y, Hu M, Si P, Xia C (2017) Fast tracking via spatio-temporal context learning based on multi-color attributes and pca. In: 2017 IEEE international conference on information and automation (ICIA). IEEE, pp 398–403

  4. Bertinetto L, Valmadre J, Golodetz S, Miksik O, Torr PH (2016) Staple: complementary learners for real-time tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1401–1409

  5. Jia M, Gao Z, Guo Q, Lin Y, Gu X (2019) Sparse feature learning for correlation filter tracking toward 5g-enabled tactile internet. IEEE Trans Ind Inform 16(3):1904–1913

    Article  Google Scholar 

  6. Ma C, Huang J-B, Yang X, Yang M-H (2015) Hierarchical convolutional features for visual tracking. In: Proceedings of the IEEE international conference on computer vision, pp 3074–3082

  7. Zhang H, Chen J, Nie G, Hu S (2020) Uncertain motion tracking based on convolutional net with semantics estimation and region proposals. Pattern Recognit 102:107232

    Article  Google Scholar 

  8. Tang F, Lu X, Zhang X, Hu S, Zhang H (2019) Deep feature tracking based on interactive multiple model. Neurocomput 333:29–40

    Article  Google Scholar 

  9. Xing X, Qiu F, Xu X, Qing C, Wu Y (2017) Robust object tracking based on sparse representation and incremental weighted pca. Multimedia Tools Appl 76(2):2039–2057

    Article  Google Scholar 

  10. Che M, Wang R, Lu Y, Li Y, Zhi H, Xiong C (2018) Channel pruning for visual tracking. In: Proceedings of the European conference on computer vision (ECCV) workshops, pp 0–0

  11. Danelljan M, Bhat G, Shahbaz Khan F, Felsberg M (2017) Eco: efficient convolution operators for tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 6638–6646

  12. Li X, Ma C, Wu B, He Z, Yang M-H (2019) Target-aware deep tracking. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 1369–1378

  13. Rahman MM, Fiaz M, Jung SK (2020) Efficient visual tracking with stacked channel-spatial attention learning. IEEE Access 8:100857–100869

    Article  Google Scholar 

  14. Mueller M, Smith N, Ghanem B (2017) Context-aware correlation filter tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1396–1404

  15. Bhat G, Danelljan M, Gool LV, Timofte R (2019) Learning discriminative model prediction for tracking. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 6182–6191

  16. Lu X, Li J, He Z, Wang W, Wang H (2019) Distracter-aware tracking via correlation filter. Neurocomputing 348:134–144

    Article  Google Scholar 

  17. Zhu Z, Wang Q, Li B, Wu W, Yan J, Hu W (2018) Distractor-aware siamese networks for visual object tracking. In: Proceedings of the European conference on computer vision (ECCV), pp 101–117

  18. Zhang H, Cheng L, Zhang T, Wang Y, Zhang W, Zhang J (2022) Target-distractor aware deep tracking with discriminative enhancement learning loss. IEEE Trans Circ Syst Video Technol

  19. Tao R, Gavves E, Smeulders AW (2016) Siamese instance search for tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1420–1429

  20. Bertinetto L, Valmadre J, Henriques JF, Vedaldi A, Torr PH (2016) Fully-convolutional siamese networks for object tracking. In: European conference on computer vision. Springer, pp 850–865

  21. Wang Q, Teng Z, Xing J, Gao J, Hu W, Maybank S (2018) Learning attentions: residual attentional siamese network for high performance online visual tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4854–4863

  22. He A, Luo C, Tian X, Zeng W (2018) A twofold siamese network for real-time object tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4834–4843

  23. Valmadre J, Bertinetto L, Henriques J, Vedaldi A, Torr PH (2017) End-to-end representation learning for correlation filter based tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2805–2813

  24. Li B, Yan J, Wu W, Zhu Z, Hu X (2018) High performance visual tracking with siamese region proposal network. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 8971–8980

  25. Li B, Wu W, Wang Q, Zhang F, Xing J, Yan J (2019) Siamrpn++: evolution of siamese visual tracking with very deep networks. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 4282–4291

  26. Gao P, Yuan R, Wang F, Xiao L, Fujita H, Zhang Y (2020) Siamese attentional keypoint network for high performance visual tracking. Knowl-Based Syst 193:105448

    Article  Google Scholar 

  27. Ma C, Huang J-B, Yang X, Yang M-H (2018) Robust visual tracking via hierarchical convolutional features. IEEE Trans Pattern Anal Mach Intell 41(11):2709–2723

    Article  Google Scholar 

  28. Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. Int Conf Learni Represent

  29. Lu X, Tang F, Huo H, Fang T (2019) Learning channel-aware deep regression for object tracking. Pattern Recogn Lett 127:103–109

    Article  Google Scholar 

  30. Huang W, Gu J, Ma X, Li Y (2020) End-to-end multitask siamese network with residual hierarchical attention for real-time object tracking. Appl Intell 50(6):1908–1921

    Article  Google Scholar 

  31. Fiaz M, Mahmood A, Jung SK (2020) Learning soft mask based feature fusion with channel and spatial attention for robust visual object tracking. Sensors 20(14):4021

    Article  Google Scholar 

  32. Guo D, Wang J, Zhao W, Cui Y, Wang Z, Chen S (2021) End-to-end feature fusion siamese network for adaptive visual tracking. IET Image Processi 15(1):91–100

    Article  Google Scholar 

  33. Gao P, Zhang Q, Wang F, Xiao L, Fujita H, Zhang Y (2020) Learning reinforced attentional representation for end-to-end visual tracking. Inf Sci 517:52–67

    Article  Google Scholar 

  34. Sosnovik I, Moskalev A, Smeulders AW (2021) Scale equivariance improves siamese tracking. In: Proceedings of the IEEE/CVF winter conference on applications of computer vision, pp 2765–2774

  35. Meng Y, Deng Z, Zhao K, Xu Y, Liu H (2021) Hierarchical correlation siamese network for real-time object tracking. Appl Intell 51(6):3202–3211

    Article  Google Scholar 

  36. Guan H, An Z (2019) Robust online visual tracking via stable and adaptive memories. J Intell Fuzzy Syst 36(6):5521–5531

    Article  Google Scholar 

  37. Zhang Z, Peng H (2019) Deeper and wider siamese networks for real-time visual tracking. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 4591–4600

  38. Zhang Z, Zhang Y, Cheng X, Lu G (2021) Siamese network for object tracking with multi-granularity appearance representations. Pattern Recogn 118:108003

    Article  Google Scholar 

  39. Li C, Yang B (2019) Adaptive weighted cnn features integration for correlation filter tracking. IEEE Access 7:76416–76427

    Article  Google Scholar 

  40. Woo S, Park J, Lee J-Y, Kweon IS (2018) Cbam: convolutional block attention module. In: Proceedings of the european conference on computer vision (ECCV), pp 3–19

  41. Henriques JF, Caseiro R, Martins P, Batista J (2014) High-speed tracking with kernelized correlation filters. IEEE Trans Pattern Anal Mach Intell 37(3):583–596

    Article  Google Scholar 

  42. Wang M, Liu Y, Huang Z (2017) Large margin object tracking with circulant feature maps. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4021–4029

  43. Yuan T, Yang W, Li Q, Wang Y (2021) An anchor-free siamese network with multi-template update for object tracking. Electronics 10(9):1067

    Article  Google Scholar 

  44. Wu Y, Lim J, Yang M-H (2015) Object tracking benchmark. IEEE Trans Pattern Anal Mach Intell 37(09):1834–1848

    Article  Google Scholar 

  45. Li P, Chen B, Ouyang W, Wang D, Yang X, Lu H (2019) Gradnet: gradient-guided network for visual object tracking. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 6162–6171

  46. Danelljan M, Hager G, Shahbaz Khan F, Felsberg M (2015) Convolutional features for correlation filter based visual tracking. In: Proceedings of the IEEE international conference on computer vision workshops, pp 58–66

  47. Danelljan M, Hager G, Shahbaz Khan F, Felsberg M (2015) Learning spatially regularized correlation filters for visual tracking. In: Proceedings of the IEEE international conference on computer vision, pp 4310–4318

  48. Danelljan M, Bhat G, Khan FS, Felsberg M (2019) Atom: accurate tracking by overlap maximization. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 4660–4669

  49. Liang P, Blasch E, Ling H (2015) Encoding color information for visual tracking: algorithms and benchmark. IEEE Trans Image Process 24(12):5630–5644

    Article  MathSciNet  MATH  Google Scholar 

  50. Guo D, Wang J, Cui Y, Wang Z, Chen S (2020) Siamcar: siamese fully convolutional classification and regression for visual tracking. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 6269–6277

  51. Zhang T, Xu C, Yang M-H (2017) Multi-task correlation particle filter for robust object tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4335–4343

  52. Song Y, Ma C, Gong L, Zhang J, Lau RW, Yang M-H (2017) Crest: convolutional residual learning for visual tracking. In: Proceedings of the IEEE international conference on computer vision, pp 2555–2564

  53. Kiani Galoogahi H, Fagg A, Lucey S (2017) Learning background-aware correlation filters for visual tracking. In: Proceedings of the IEEE international conference on computer vision, pp 1135–1143

  54. Mueller M, Smith N, Ghanem B (2016) A benchmark and simulator for uav tracking. In: European conference on computer vision. Springer, pp 445–461

  55. Kristan M, Leonardis A, Matas J, Felsberg M, Pflugfelder R, Čehovin Zajc L, Vojir T, Bhat G, Lukezic A, Eldesokey A et al (2016) The visual object tracking vot2016 challenge results. In: Proceedings of the european conference on computer vision (ECCV) workshops

  56. Danelljan M, Robinson A, Shahbaz Khan F, Felsberg M (2016) Beyond correlation filters: learning continuous convolution operators for visual tracking. In: European conference on computer vision. Springer, pp 472–488

  57. Zhang H, Chen J, Nie G, Lin Y, Yang G, Zhang WC (2021) Light regression memory and multi-perspective object special proposals for abrupt motion tracking. Knowl-Based Syst 226:107127

    Article  Google Scholar 

  58. Kristan M, Leonardis A, Matas J, Felsberg M, Pflugfelder R, Čehovin Zajc L, Vojir T, Bhat G, Lukezic A, Eldesokey A et al (2018) The sixth visual object tracking vot2018 challenge results. In: Proceedings of the european conference on computer vision (ECCV) workshops, pp 0–0

  59. Guo Q, Feng W, Zhou C, Huang R, Wan L, Wang S (2017) Learning dynamic siamese network for visual object tracking. In: Proceedings of the IEEE international conference on computer vision, pp 1763–1771

Download references

Acknowledgements

This work is supported by the National Natural Science Foundation of China under Grant (62272423, 62102373, 62006213, 62072416, 61806181), Program for Science & Technology Innovation Talents in Universities of Henan Province, China (21HASTIT028), Natural Science Foundation of Henan Province, China (202300410495), Zhongyuan Science and Technology Innovation Leadership Program, China (214200510026).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jianwei Zhang.

Ethics declarations

Conflicts of interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhang, H., Zhu, L., Wu, H. et al. Adaptive distractor-aware for siamese tracking via enhancement confidence evaluator. Appl Intell 53, 29223–29241 (2023). https://doi.org/10.1007/s10489-023-05068-4

Download citation

  • Accepted:

  • Published:

  • Version of record:

  • Issue date:

  • DOI: https://doi.org/10.1007/s10489-023-05068-4

Keywords

Profiles

  1. Huanlong Zhang