Abstract
Visual object tracking is a challenging problem in computer vision. Although the correlation filter-based trackers have achieved competitive results both on accuracy and robustness in recent years, there is still a need to improve the overall tracking performance. In this paper, to tackle the problems caused by Spatially Regularized Discriminative Correlation Filter (SRDCF), we suggest a single-sample-integrated training scheme which utilizes information of the previous frames and the current frame to improve the robustness of training samples. Moreover, manually designed features and deep convolutional features are integrated together to further boost the overall tracking capacity. To optimize the translation filter, we develop an alternating direction method of multipliers (ADMM) algorithm. Besides, we introduce a scale adaptively filter to directly learn the appearance changes induced by variations in the target scale. Extensive empirical evaluations on the TB-50, TB-100 and OTB-2013 datasets demonstrate that the proposed tracker is very promising for various challenging scenarios.
Supported by National Natural Science Foundation of China (No. 61672546 and No. 61573385), and Guangzhou Science and Technology Project (No. 201707010127).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Bertinetto, L., Valmadre, J., Golodetz, S., Miksik, O., Torr, P.H.S.: Staple: complementary learners for real-time tracking. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, June 2016. https://doi.org/10.1109/cvpr.2016.156
Bolme, D., Beveridge, J.R., Draper, B.A., Lui, Y.M.: Visual object tracking using adaptive correlation filters. In: 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE, June 2010. https://doi.org/10.1109/cvpr.2010.5539960
Chatfield, K., Simonyan, K., Vedaldi, A., Zisserman, A.: Return of the devil in the details: delving deep into convolutional nets. In: Proceedings of the British Machine Vision Conference 2014. British Machine Vision Association (2014). https://doi.org/10.5244/c.28.6
Cimpoi, M., Maji, S., Vedaldi, A.: Deep filter banks for texture recognition and segmentation. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, June 2015. https://doi.org/10.1109/cvpr.2015.7299007
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR). IEEE (2005). https://doi.org/10.1109/cvpr.2005.177
Danelljan, M., Hager, G., Khan, F.S., Felsberg, M.: Learning spatially regularized correlation filters for visual tracking. In: 2015 IEEE International Conference on Computer Vision (ICCV). IEEE, December 2015. https://doi.org/10.1109/iccv.2015.490
Danelljan, M., Hager, G., Khan, F.S., Felsberg, M.: Discriminative scale space tracking. IEEE Trans. Pattern Anal. Mach. Intell. 39(8), 1561–1575 (2017). https://doi.org/10.1109/tpami.2016.2609928
Danelljan, M., Häger, G., Khan, F.S., Felsberg, M.: Accurate scale estimation for robust visual tracking. In: Proceedings of the British Machine Vision Conference 2014. British Machine Vision Association (2014). https://doi.org/10.5244/c.28.65
Henriques, J.F., Caseiro, R., Martins, P., Batista, J.: Exploiting the circulant structure of tracking-by-detection with kernels. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012. LNCS, vol. 7575, pp. 702–715. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-33765-9_50
Henriques, J.F., Caseiro, R., Martins, P., Batista, J.: High-speed tracking with kernelized correlation filters. IEEE Trans. Pattern Anal. Mach. Intell. 37(3), 583–596 (2015). https://doi.org/10.1109/tpami.2014.2345390
Li, F., Tian, C., Zuo, W., Zhang, L., Yang, M.H.: Learning spatial-temporal regularized correlation filters for visual tracking. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. IEEE, June 2018. https://doi.org/10.1109/cvpr.2018.00515
Li, Y., Zhu, J.: A scale adaptive kernel correlation filter tracker with feature integration. In: Agapito, L., Bronstein, M.M., Rother, C. (eds.) ECCV 2014. LNCS, vol. 8926, pp. 254–265. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-16181-5_18
Liu, L., Shen, C., van den Hengel, A.: The treasure beneath convolutional layers: cross-convolutional-layer pooling for image classification. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, June 2015. https://doi.org/10.1109/cvpr.2015.7299107
Ma, C., Yang, X., Zhang, C., Yang, M.H.: Long-term correlation tracking. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, June 2015. https://doi.org/10.1109/cvpr.2015.7299177
Mueller, M., Smith, N., Ghanem, B.: Context-aware correlation filter tracking. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, July 2017. https://doi.org/10.1109/cvpr.2017.152
Sun, S., Guo, Q., Dong, F., Lei, B.: On-line boosting based real-time tracking with efficient HOG. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE, May 2013. https://doi.org/10.1109/icassp.2013.6638064
van de Weijer, J., Schmid, C., Verbeek, J., Larlus, D.: Learning color names for real-world applications. IEEE Trans. Image Process. 18(7), 1512–1523 (2009). https://doi.org/10.1109/tip.2009.2019809
Wu, Y., Lim, J., Yang, M.H.: Online object tracking: a benchmark. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, June 2013. https://doi.org/10.1109/cvpr.2013.312
Wu, Y., Lim, J., Yang, M.H.: Object tracking benchmark. IEEE Trans. Pattern Anal. Mach. Intell. 37(9), 1834–1848 (2015)
Yi, Y., Cheng, Y., Xu, C.: Visual tracking based on hierarchical framework and sparse representation. Multimed. Tools Appl. 77(13), 16267–16289 (2018). https://doi.org/10.1007/s11042-017-5198-4
Yi, Y., Luo, L., Zheng, Z.: Single online visual object tracking with enhanced tracking and detection learning. Multimed. Tools Appl. 78(9), 12333–12351 (2019). https://doi.org/10.1007/s11042-018-6787-6
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
She, Y., Yi, Y. (2020). Learning Multi-feature Based Spatially Regularized and Scale Adaptive Correlation Filters for Visual Tracking. In: Ro, Y., et al. MultiMedia Modeling. MMM 2020. Lecture Notes in Computer Science(), vol 11961. Springer, Cham. https://doi.org/10.1007/978-3-030-37731-1_39
Download citation
DOI: https://doi.org/10.1007/978-3-030-37731-1_39
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-37730-4
Online ISBN: 978-3-030-37731-1
eBook Packages: Computer ScienceComputer Science (R0)