A comparative approach on detecting multi-lingual and multi-oriented text in natural scene images

被引:3
|
作者
Yegnaraman, Aparna [1 ]
Valli, S. [1 ]
机构
[1] Anna Univ, Coll Engn, Dept Comp Sci & Engn, Chennai 600025, Tamil Nadu, India
关键词
Scene text detection; PIoU loss; Genetic algorithm; You only look once; Differentiable binarization; Flexible threshold; LOCALIZATION; RECOGNITION;
D O I
10.1007/s10489-020-01972-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text helps to convey the intended message to users very accurately. Detecting text from natural scene images for quadrilateral-type and polygon-type datasets is the primary scope of this work. A regression-based method using modified You Only Look Once YOLOv4 network is used for quadrilateral-type datasets. Hyperparameters for training the network are optimized using the Genetic Algorithm which proves to be a suitable candidate than traditional methods. The Pixels-IoU (PIoU) loss is introduced to derive an accurate bounding box and it seems to be productive under various challenging scenarios with high aspect ratios and complex background. This yielded quick results for quadrilateral-type datasets but did not scale for arbitrarily-shaped and curved scene text. So the approach is changed to segmentation based for enhancing the results. This introduces binarization operation in a segmentation network to boost its detection accuracy for polygon-type datasets. The introduction of a new module DiffBiSeg (Differentiable Binarization in Segmentation network) facilitates post-processing and text detection performance by setting the thresholds flexibly for binarization in the segmentation network. The efficacy of both approaches is clearly seen in their respective experimental results.
引用
收藏
页码:3696 / 3717
页数:22
相关论文
共 50 条
  • [1] A comparative approach on detecting multi-lingual and multi-oriented text in natural scene images
    Aparna Yegnaraman
    S. Valli
    Applied Intelligence, 2021, 51 : 3696 - 3717
  • [2] Multi-Oriented and Multi-Lingual Scene Text Detection With Direct Regression
    He, Wenhao
    Zhang, Xu-Yao
    Yin, Fei
    Liu, Cheng-Lin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (11) : 5406 - 5419
  • [3] MULTI-ORIENTED TEXT DETECTION IN SCENE IMAGES
    Basavanna, M.
    Shivakumara, P.
    Srivatsa, S. K.
    Kumar, G. Hemantha
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2012, 26 (07)
  • [4] Multi-oriented text detection and verification in video frames and scene images
    Sain, Aneeshan
    Bhunia, Ayan Kumar
    Roy, Partha Pratim
    Pal, Umapada
    NEUROCOMPUTING, 2018, 275 : 1531 - 1549
  • [5] Multi-lingual scene text detection and language identification
    Saha, Shaswata
    Chakraborty, Neelotpal
    Kundu, Soumyadeep
    Paul, Sayantan
    Mollah, Ayatullah Faruk
    Basu, Subhadip
    Sarkar, Ram
    PATTERN RECOGNITION LETTERS, 2020, 138 : 16 - 22
  • [6] Script independent approach for multi-oriented text detection in scene image
    Dey, Sounak
    Shivakumara, Palaiahnakote
    Raghunandan, K. S.
    Pal, Umapada
    Lu, Tong
    Kumar, G. Hemantha
    Chan, Chee Seng
    NEUROCOMPUTING, 2017, 242 : 96 - 112
  • [7] Language identification from multi-lingual scene text images: a CNN based classifier ensemble approach
    Neelotpal Chakraborty
    Soumyadeep Kundu
    Sayantan Paul
    Ayatullah Faruk Mollah
    Subhadip Basu
    Ram Sarkar
    Journal of Ambient Intelligence and Humanized Computing, 2021, 12 : 7997 - 8008
  • [8] Language identification from multi-lingual scene text images: a CNN based classifier ensemble approach
    Chakraborty, Neelotpal
    Kundu, Soumyadeep
    Paul, Sayantan
    Mollah, Ayatullah Faruk
    Basu, Subhadip
    Sarkar, Ram
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (07) : 7997 - 8008
  • [9] MOSTL: An Accurate Multi-Oriented Scene Text Localization
    Fatemeh Naiemi
    Vahid Ghods
    Hassan Khalesi
    Circuits, Systems, and Signal Processing, 2021, 40 : 4452 - 4473
  • [10] A Method for Multi-Oriented Thai Text Localization in Natural Scene Images using Convolutional Neural Network
    Kobchaisawat, Thananop
    Chalidabhongse, Thanarat H.
    2015 IEEE INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING APPLICATIONS (ICSIPA), 2015, : 220 - 225