Traffic sign detection and recognition based on residual single shot multibox detector model

被引:2
|
作者
Zhang S.-F. [1 ]
Zhu T. [1 ]
机构
[1] School of Electronical and Information Engineering, Tianjin University, Tianjin
关键词
Coarse-to-Fine; Detection; Multi-scale block; Residual single shot multibox detector (SSD) model; Traffic sign;
D O I
10.3785/j.issn.1008-973X.2019.05.015
中图分类号
U491.4 [交通管制];
学科分类号
0306 ; 0838 ;
摘要
The existing target detection methods were only suitable for large size and few specific types of traffic signs, and showed poor performance on complex traffic scene images. The ResNet101 with strong anti-degradation performance was used as basic network, and then a residual single shot multibox detector (SSD) model added with a number of convolution layers was proposed, in order to conduct multi-scale block detection on high resolution traffic images. A strategy Coarse-to-Fine was adopted to omit the prediction of pure background image blocks, in order to speed up. The target range was narrowed by the initial detection results of the medium scale image block. The other blocks within the target range were detected. All the block results were mapped back to the original image and non-maximum suppression was used to realize accurate recognition. Experiment results showed that the proposed method achieved 94% overall accuracy and 95% overall recall on the public traffic sign dataset Tsinghua-Tencent 100K. The detection ability on traffic sign with different sizes and shapes in multi-resolution images was strong and the proposed model was robust. © 2019, Zhejiang University Press. All right reserved.
引用
收藏
页码:940 / 949
页数:9
相关论文
共 25 条
  • [1] Ruta A., Li Y.M., Liu X.H., Detection, tracking and recognition of traffic signs from video input, Proceedings of the 11th International IEEE Conference on Intelligent Transportation Systems, pp. 55-60, (2008)
  • [2] Abukhait J., Abdel-Qader I., Oh J.S., Et al., Road sign detection and shape recognition invariant to sign defects, Social Science Electronic Publishing, 61, 1, pp. 1-6, (2012)
  • [3] Liu F., Zou Q., Traffic sign detection based on visual attention mechanism, Computer Engineering, 39, 2, pp. 192-196, (2013)
  • [4] Huang Z.Y., Yu Y.L., Gu J., Et al., An efficient method for traffic sign recognition based on extreme learning machine, IEEE Transactions on Cybernetics, 47, 4, pp. 920-933, (2017)
  • [5] Krizhevsky A., Sutskever I., Hinton G.E., ImageNet classification with deep convolutional neural networks, Advances in Neural Information Processing Systems 25, pp. 1097-1105, (2012)
  • [6] Girshick R., Donahue J., Darrell T., Et al., Rich feature hierarchies for accurate object detection and semantic segmentation, Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp. 580-587, (2014)
  • [7] Girshick R., Fast R-CNN, Proceedings of the 2015 IEEE International Conference on Computer Vision, pp. 1440-1448, (2015)
  • [8] Ren S.Q., He K.M., Girshick R., Et al., Faster R-CNN: towards real-time object detection with region proposal networks, Advances in Neural Information Processing Systems 28, pp. 91-99, (2015)
  • [9] Dai J.F., Li Y., He K.M., Et al., R-FCN: object detection via region-based fully convolutional networks, Advances in Neural Information Processing Systems 29, pp. 379-387, (2016)
  • [10] Redmon J., Divvala S., Girshick R., Et al., You only look once: unified, real-time object detection, Proceedings of the 2016 IEEE Conference on Computer Vision And Pattern, pp. 779-788, (2016)