A New Context-Based Method for Restoring Occluded Text in Natural Scene Images

被引:3
|
作者
Mittal, Ayush [1 ]
Shivakumara, Palaiahnakote [2 ]
Pal, Umapada [1 ]
Lu, Tong [3 ]
Blumenstein, Michael [4 ]
Lopresti, Daniel [5 ]
机构
[1] Indian Stat Inst, Comp Vision & Pattern Recognit Unit, Kolkata, India
[2] Univ Malaya, Fac Comp Sci & Informat Technol, Kuala Lumpur, Malaysia
[3] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Peoples R China
[4] Univ Technol Sydney, Fac Engn & Informat Technol, Ultimo, Australia
[5] Lehigh Univ, Comp Sci & Engn, Bethlehem, PA USA
来源
DOCUMENT ANALYSIS SYSTEMS | 2020年 / 12116卷
关键词
Text detection; Occluded image; Annotating natural scene images; Natural language processing; Text recognition; RECOGNITION; NETWORK;
D O I
10.1007/978-3-030-57058-3_33
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text recognition from natural scene images is an active research area because of its important real world applications, including multimedia search and retrieval, and scene understanding through computer vision. It is often the case that portions of text in images are missed due to occlusion with objects in the background. Therefore, this paper presents a method for restoring occluded text to improve text recognition performance. The proposed method uses the GOOGLE Vision API for obtaining labels for input images. We propose to use PixelLink-E2E methods for detecting text and obtaining recognition results. Using these results, the proposed method generates candidate words based on distance measures employing lexicons created through natural scene text recognition. We extract the semantic similarity between labels and recognition results, which results in a Global Context Score (GCS). Next, we use the Natural Language Processing (NLP) system known as BERT for extracting semantics between candidate words, which results in a Local Context Score (LCS). Global and local context scores are then fused for estimating the ranking for each candidate word. The word that gets the highest ranking is taken as the correction for text which is occluded in the image. Experimental results on a dataset assembled from standard natural scene datasets and our resources show that our approach helps to improve the text recognition performance significantly.
引用
收藏
页码:466 / 480
页数:15
相关论文
共 50 条
  • [31] Text Detection and Recognition in Natural Scene Images
    Pise, Amruta
    Ruikar, S. D.
    2014 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2014,
  • [32] Scene Text Detection in Natural Images: A Review
    Cao, Dongping
    Zhong, Yong
    Wang, Lishun
    He, Yilong
    Dang, Jiachen
    SYMMETRY-BASEL, 2020, 12 (12): : 1 - 26
  • [33] Uyghur Text Detection in Natural Scene Images
    Li, Xinming
    Li, Junfang
    Gao, Qiag
    Yu, Xiao
    2019 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (ICMA), 2019, : 1542 - 1547
  • [34] Text detection and restoration in natural scene images
    Ye, Qixiang
    Hao, Jianbin
    Huang, Jun
    Yu, Hua
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2007, 18 (06) : 504 - 513
  • [35] Automatic text location in natural scene images
    Li, CA
    Ding, XQ
    Wu, YS
    SIXTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, PROCEEDINGS, 2001, : 1069 - 1073
  • [36] Context-based filtering of document images
    Ageenko, E
    Fränti, P
    PATTERN RECOGNITION LETTERS, 2000, 21 (6-7) : 483 - 491
  • [37] A New Method for Arabic Text Detection in Natural Scene Image Based on the Color Homogeneity
    Gaddour, Houda
    Kanoun, Slim
    Vincent, Nicole
    IMAGE AND SIGNAL PROCESSING (ICISP 2016), 2016, 9680 : 127 - 136
  • [38] A multiscale feature fusion method for cursive text detection in natural scene images
    Chandio, Asghar Ali
    Leghari, Mehwish
    Soomro, Muhammad Ali
    Nizamani, Shah Zaman
    Memon, Saifullah
    IMAGING SCIENCE JOURNAL, 2021, 69 (5-8): : 302 - 318
  • [39] A Method for Restoring ?-Radiation Scene Images Based on Spatial Axial Gradient Discrimination
    Li, Kun-Fang
    Feng, Jie
    Li, Yu-Dong
    Wen, Lin
    Kan, Yong-Jia
    Guo, Qi
    ELECTRONICS, 2023, 12 (17)
  • [40] Hierarchical Context-Based Emotion Recognition With Scene Graphs
    Wu, Shichao
    Zhou, Lei
    Hu, Zhengxi
    Liu, Jingtai
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 3725 - 3739