Abstract
The problem of pedestrian detection in image and video frames has been extensively investigated in the past decade. However, the low performance in complex scenes shows that it remains an open problem. In this paper, we propose to cascade simple Aggregated Channel Features (ACF) and rich Deep Convolutional Neural Network (DCNN) features for efficient and effective pedestrian detection in complex scenes. The ACF based detector is used to generate candidate pedestrian windows and the rich DCNN features are used for fine classification. Experiments show that the proposed approach achieved leading performance in the INRIA dataset and comparable performance to the state-of-the-art in the Caltech and ETH datasets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Viola, P., Jones, M.J.: Robust real-time face detection. Int. J. Comput. Vis. 57, 137–154 (2004)
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp. 886–893 (2005)
Dollár, P., Zhuowen, T., Perona, P., Belongie, S.: Integral channel features. In: British Machine Vision Conference, vol. 2 (2009)
Dollár, P., Belongie, S., Perona, P.: The fastest pedestrian detector in the west. In: British Machine Vision Conference, vol. 2 (2010)
Dollár, P., Appel, R., Belongie, S., Perona, P.: Fast feature pyramids for object detection. IEEE Trans. Pattern Anal. Mach. Intell. 36, 1532–1545 (2014)
Mathias, M., Benenson, R., Timofte, R., Gool, L.V.: Handling occlusions with franken-classifiers. In: Proceedings of IEEE International Conference on Computer Vision, pp. 1505–1512 (2013)
Benenson, R., Mathias, M., Tuytelaars, T., Gool, L.V.: Seeking the strongest rigid detector. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp. 3666–3673 (2013)
Dollár, P., Appel, R., Kienzle, W.: Crosstalk cascades for frame-rate pedestrian detection. In: IEEE European Conference on Computer Vision, pp. 645–659 (2012)
Xiaoyu, W., Han, T., Shuicheng, Y.: An HOG-LBP human detector with partial occlusion handling. In: Proceedings of IEEE International Conference on Computer Vision, pp. 32–39 (2009)
Vedaldi, A., Gulshan, V., Varma, M., Zisserman, A.: Multiple kernels for object detection. In: Proceedings of IEEE International Conference on Computer Vision, pp. 606–613 (2009)
Tuzel, O., Porikli, F., Meer, P.: Pedestrian detection via classification on riemannian manifolds. IEEE Trans. Pattern Anal. Mach. Intell. 30, 1713–1727 (2008)
Felzenszwalb, P.F., Girshick, R.B., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Anal. Mach. Intell. 32, 1627–1645 (2010)
Bengio, Y.: Learning deep architectures for AI. Found. Trends Mach. Learn. 2, 1–127 (2009)
Bengio, Y., Courville, A., Vincent, P.: Representation learning: a review and new perspectives. IEEE Trans. Pattern Anal. Mach. Intell. 35, 1798–1828 (2013)
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet Classification with Deep Convolutional Neural Networks. In: Neural Information Processing Systems (2012)
Donahue, J., Yangqing, J., Vinyals, O., Hoffman, J., Ning, Z., Tzeng, E., Darrell, T.: DeCAF: a deep convolutional activation feature for generic visual recognition. In: International Conference on Machine Learning (2014)
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition (2014)
Sermanet, P., Eigen, D., Zhang, X., Mathieu, M., Fergus, R., LeCun, Y.: OverFeat: integrated recognition, localization and detection using convolutional networks. In: International Conference on Learning Representations (2014)
Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional neural networks. In: IEEE European Conference on Computer Vision, pp. 818–833 (2014)
Sermanet, P., Kavukcuoglu, K., Chintala, S., LeCun, Y.: Pedestrian detection with unsupervised multi-stage feature learning. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp. 3626–3633 (2013)
Ouyang, W., Zeng, X., Wang, X.: Modeling mutual visibility relationship in pedestrian detection. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp. 3222–3229 (2013)
Ouyang, W., Wang, X.: Joint deep learning for pedestrian detection. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp. 2056–2063 (2013)
Van de Sande, K., Uijlings, J., Gevers, T., Smeulders, A.: Segmentation as selective search for object recognition. In: Proceedings of IEEE International Conference on Computer Vision, pp. 1879–1886 (2011)
Chunhui, G., Lim, J., Arbeláez, P., Malik, J.: Recognition using regions. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp. 1030–1037 (2009)
Dollar, P., Wojek, C., Schiele, B., Perona, P.: Pedestrian detection: an evaluation of the state-of-the-art. IEEE Trans. Pattern Anal. Mach. Intell. 34, 743–761 (2012)
Alexe, B., Deselaers, T., Ferrari, V.: Measuring the objectness of image windows. IEEE Trans. Pattern Anal. Mach. Intell. 34, 2189–2202 (2012)
Yangqing, J.: Caffe: An Open Source Convolutional Architecture for Fast Feature Embedding (2012). http://caffe.berkeleyvision.org/
Dollár, P., Wojek, C., Schiele, B., Perona, P.: Pedestrian detection: a benchmark. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp. 304–311 (2009)
Luo, P., Tian, Y., Wang, X., Tang, X.: Switchable deep network for pedestrian detection. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp. 899–906 (2013)
Shen, C., Wang, P., Paisitkriangkrai, S., van den Hengel, A.: Training effective node classifiers for cascade classification. Int. J. Comput. Vis. 103, 326–347 (2013)
Gool, L.V., Mathias, M., Timofte, R., Benenson, R.: Pedestrian detection at 100 Frames Per Second. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp. 2903–2910 (2012)
Acknowledgement
This work was supported in Part by National Basic Research Program of China (973 Program) with Nos. 2011CB706900, 2010CB731800, and National Science Foundation of China with Nos. 61039003, 61271433 and 61202323.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Chen, X., Wei, P., Ke, W., Ye, Q., Jiao, J. (2015). Pedestrian Detection with Deep Convolutional Neural Network. In: Jawahar, C., Shan, S. (eds) Computer Vision - ACCV 2014 Workshops. ACCV 2014. Lecture Notes in Computer Science(), vol 9008. Springer, Cham. https://doi.org/10.1007/978-3-319-16628-5_26
Download citation
DOI: https://doi.org/10.1007/978-3-319-16628-5_26
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-16627-8
Online ISBN: 978-3-319-16628-5
eBook Packages: Computer ScienceComputer Science (R0)