Clothing Key Points Detection Algorithm Based on Cascade Convolutional Neural Network

被引:2
|
作者
Li Q. [1 ]
Yao L. [1 ]
Guan X. [1 ]
机构
[1] School of Microelectronics, Tianjin University, Tianjin
基金
中国国家自然科学基金;
关键词
Cascade convolution neural network; Dilated convolutions; Hourglass network; Key points detection;
D O I
10.11784/tdxbz201901054
中图分类号
学科分类号
摘要
With the development of deep learning, key points detection using deep convolutional neural networks(CNN) has attracted extensive attention. Although key points detection for human body posture and face recognition has developed rapidly, the application of this technology to clothing faces great challenges because of the variability of the background and posture in clothing pictures. Technology for clothing key points detection has great value for e-commerce and fashion collocation. To apply an algorithm for key point detection to fashion, in this paper, we propose an algorithm for clothing key points detection based on a cascade CNN. This algorithm first detects the key points of clothing and then adjusts difficult key points using a two-level cascade CNN. The first stage of the algorithm detects preliminary key points using ResNet to extract features, and then, to retain more detailed image information, uses dilated convolution to solve the problem of high receptive fields but low spatial resolution in the high-level feature map of a pyramid structure. Using the results from the first stage as a preliminary structure of key points, the accuracy is then improved in the second stage by adjusting the difficult key points by combining them with multi-scale features extracted by an hourglass network. We used the 2018 FashionAI clothing landmark dataset for training and testing in the experiment. The normalized error was reduced to 3.56% in the clothing landmark detection task, which verifies the effectiveness of the network. Compared with the existing algorithm for key points, the algorithm proposed in this paper achieves the best result in the task of clothing key points detection, especially in the detection of difficult key points. © 2020, Editorial Board of Journal of Tianjin University(Science and Technology). All right reserved.
引用
收藏
页码:229 / 236
页数:7
相关论文
共 17 条
  • [1] Liu Z., Yan S., Luo P., Et al., Fashion landmark detection in the wild, Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition, pp. 229-245, (2016)
  • [2] Wei S.E., Ramakrishna V., Kanade T., Et al., Convolutional pose machines, Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition, pp. 4724-4732, (2016)
  • [3] Newell A., Yang K., Deng J., Stacked hourglass networks for human pose estimation, Proceedings of 2016 European Conference on Computer Vision, pp. 483-499, (2016)
  • [4] Papandreou G., Zhu T., Kanazawa N., Et al., Towards accurate multi-person pose estimation in the wild, Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition, pp. 3711-3719, (2017)
  • [5] Ren S., He K., Girshick R., Et al., Faster R-CNN: Towards real-time object detection with region proposal networks, Proceedings of 2015 Neural Information Processing Systems, pp. 91-99, (2015)
  • [6] He K., Zhang X., Ren S., Et al., Deep residual learning for image recognition, Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition, pp. 770-778, (2016)
  • [7] He K., Gkioxari G., Dollar P., Et al., Mask R-CNN, Proceedings of 2017 IEEE International Conference on Computer Vision, pp. 2980-2988, (2017)
  • [8] Fang H.S., Xie S., Tai Y.W., Et al., RMPE: Regional multi-person pose estimation, Proceedings of 2017 IEEE International Conference on Computer Vision, pp. 2353-2362, (2017)
  • [9] Liu W., Anguelov D., Erhan D., Et al., SSD: Single shot multiBox detector, Proceedings of 2016 European Conference on Computer Vision, pp. 21-37, (2016)
  • [10] Chen Y., Wang Z., Peng Y., Et al., Cascaded pyramid network for multi-person pose estimation, Proceedings of 2018 IEEE Conference on Computer Vision and Pattern Recognition, pp. 7103-7112, (2018)