Deep Metric Learning for Scene Text Detection

被引:0
|
作者
Zhu, Qi-Hai [1 ]
Zhu, Rui [1 ]
Li, Ning [1 ]
Yang, Yu-Bin [1 ]
机构
[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Jiangsu, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The strong abilities of deep learning models have been shown in the area of text detection in natural scene images. In this paper, we introduce a new method called deep metric learning for scene text detection. We use the triplet loss [1] to replace the traditional loss function (Softmax) and learn a mapping from image regions to a compact Euclidean space where distances correspond to a measure of text similarity. By combining the CNN model with metric learning, we can make reliable binary classification between text regions and non-text ones. We show that the proposed model achieves competitive results on the ICDAR 2003, ICDAR 2011, and ICDAR 2013 datasets, with the F-measure of 0.74, 0.80, and 0.79.
引用
收藏
页码:1025 / 1029
页数:5
相关论文
共 50 条
  • [1] Deep Learning Based Scene Text Detection: A Survey
    Jiang, Wei
    Zhang, Chong-Sheng
    Yin, Xu-Cheng
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2019, 47 (05): : 1152 - 1161
  • [2] Scene Text Detection and Recognition: The Deep Learning Era
    Shangbang Long
    Xin He
    Cong Yao
    [J]. International Journal of Computer Vision, 2021, 129 : 161 - 184
  • [3] Scene Text Detection and Recognition: The Deep Learning Era
    Long, Shangbang
    He, Xin
    Yao, Cong
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (01) : 161 - 184
  • [4] Deep learning for detection of text polarity in natural scene images
    Perepu, Pavan Kumar
    [J]. NEUROCOMPUTING, 2021, 431 : 1 - 6
  • [5] Scene text detection and recognition with advances in deep learning: a survey
    Liu, Xiyan
    Meng, Gaofeng
    Pan, Chunhong
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2019, 22 (02) : 143 - 162
  • [6] Scene text detection and recognition with advances in deep learning: a survey
    Xiyan Liu
    Gaofeng Meng
    Chunhong Pan
    [J]. International Journal on Document Analysis and Recognition (IJDAR), 2019, 22 : 143 - 162
  • [7] Deep learning approaches to scene text detection: a comprehensive review
    Khan, Tauseef
    Sarkar, Ram
    Mollah, Ayatullah Faruk
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (05) : 3239 - 3298
  • [8] Deep learning approaches to scene text detection: a comprehensive review
    Tauseef Khan
    Ram Sarkar
    Ayatullah Faruk Mollah
    [J]. Artificial Intelligence Review, 2021, 54 : 3239 - 3298
  • [9] TextField: Learning a Deep Direction Field for Irregular Scene Text Detection
    Xu, Yongchao
    Wang, Yukang
    Zhou, Wei
    Wang, Yongpan
    Yang, Zhibo
    Bai, Xiang
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (11) : 5566 - 5579
  • [10] Deep Residual Text Detection Network for Scene Text
    Zhu, Xiangyu
    Jiang, Yingying
    Yang, Shuli
    Wang, Xiaobing
    Li, Wei
    Fu, Pei
    Wang, Hua
    Luo, Zhenbo
    [J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 807 - 812