A comparative approach on detecting multi-lingual and multi-oriented text in natural scene images

被引:3
|
作者
Yegnaraman, Aparna [1 ]
Valli, S. [1 ]
机构
[1] Anna Univ, Coll Engn, Dept Comp Sci & Engn, Chennai 600025, Tamil Nadu, India
关键词
Scene text detection; PIoU loss; Genetic algorithm; You only look once; Differentiable binarization; Flexible threshold; LOCALIZATION; RECOGNITION;
D O I
10.1007/s10489-020-01972-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text helps to convey the intended message to users very accurately. Detecting text from natural scene images for quadrilateral-type and polygon-type datasets is the primary scope of this work. A regression-based method using modified You Only Look Once YOLOv4 network is used for quadrilateral-type datasets. Hyperparameters for training the network are optimized using the Genetic Algorithm which proves to be a suitable candidate than traditional methods. The Pixels-IoU (PIoU) loss is introduced to derive an accurate bounding box and it seems to be productive under various challenging scenarios with high aspect ratios and complex background. This yielded quick results for quadrilateral-type datasets but did not scale for arbitrarily-shaped and curved scene text. So the approach is changed to segmentation based for enhancing the results. This introduces binarization operation in a segmentation network to boost its detection accuracy for polygon-type datasets. The introduction of a new module DiffBiSeg (Differentiable Binarization in Segmentation network) facilitates post-processing and text detection performance by setting the thresholds flexibly for binarization in the segmentation network. The efficacy of both approaches is clearly seen in their respective experimental results.
引用
收藏
页码:3696 / 3717
页数:22
相关论文
共 50 条
  • [32] Detecting multi-oriented text with corner-based region proposals
    Deng, Linjie
    Gong, Yanxiang
    Lin, Yi
    Shuai, Jingwen
    Tu, Xiaoguang
    Zhang, Yuefei
    Ma, Zheng
    Xie, Mei
    NEUROCOMPUTING, 2019, 334 : 134 - 142
  • [33] Multi-Oriented Moving Text Detection
    Khare, Vijeta
    Shivakumara, Palaiahnakote
    Raveendran, Paramesaran
    2014 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS), 2014, : 347 - 352
  • [34] REFINETEXT: REFINING MULTI-ORIENTED SCENE TEXT DETECTION WITH A FEATURE REFINEMENT MODULE
    Xie, Pengyuan
    Xiao, Jing
    Cao, Yang
    Zhu, Jia
    Khan, Asad
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1756 - 1761
  • [35] A New Technique for Multi-Oriented Scene Text Line Detection and Tracking in Video
    Wu, Liang
    Shivakumara, Palaiahnakote
    Lu, Tong
    Tan, Chew Lim
    IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (08) : 1137 - 1152
  • [36] SCALE-INVARIANT MULTI-ORIENTED TEXT DETECTION IN WILD SCENE IMAGE
    Dasgupta, Kinjal
    Das, Sudip
    Bhattacharya, Ujjwal
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2041 - 2045
  • [37] Multi-oriented scene text detection by fixed-width multi-ratio rotation anchors
    Zou, Beiji
    Yang, Wenjun
    Liu, Shu
    Jiang, Lingzi
    COMPUTERS & ELECTRICAL ENGINEERING, 2021, 95
  • [38] Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation
    Lyu, Pengyuan
    Yao, Cong
    Wu, Wenhao
    Yan, Shuicheng
    Bai, Xiang
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7553 - 7563
  • [39] Recognition of Multi-Oriented, Multi-Sized, and Curved Text
    Chiang, Yao-Yi
    Knoblock, Craig A.
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 1399 - 1403
  • [40] Location Sensitive Regression Algorithm for Multi-Oriented Scene Text Detection with Focal Loss
    Kuang, Hailan
    Li, Zheng
    Ma, Xiaolin
    Liu, Xinhua
    2019 11TH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA 2019), 2019, : 462 - 466