A New Context-Based Method for Restoring Occluded Text in Natural Scene Images

被引:3
|
作者
Mittal, Ayush [1 ]
Shivakumara, Palaiahnakote [2 ]
Pal, Umapada [1 ]
Lu, Tong [3 ]
Blumenstein, Michael [4 ]
Lopresti, Daniel [5 ]
机构
[1] Indian Stat Inst, Comp Vision & Pattern Recognit Unit, Kolkata, India
[2] Univ Malaya, Fac Comp Sci & Informat Technol, Kuala Lumpur, Malaysia
[3] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Peoples R China
[4] Univ Technol Sydney, Fac Engn & Informat Technol, Ultimo, Australia
[5] Lehigh Univ, Comp Sci & Engn, Bethlehem, PA USA
来源
DOCUMENT ANALYSIS SYSTEMS | 2020年 / 12116卷
关键词
Text detection; Occluded image; Annotating natural scene images; Natural language processing; Text recognition; RECOGNITION; NETWORK;
D O I
10.1007/978-3-030-57058-3_33
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text recognition from natural scene images is an active research area because of its important real world applications, including multimedia search and retrieval, and scene understanding through computer vision. It is often the case that portions of text in images are missed due to occlusion with objects in the background. Therefore, this paper presents a method for restoring occluded text to improve text recognition performance. The proposed method uses the GOOGLE Vision API for obtaining labels for input images. We propose to use PixelLink-E2E methods for detecting text and obtaining recognition results. Using these results, the proposed method generates candidate words based on distance measures employing lexicons created through natural scene text recognition. We extract the semantic similarity between labels and recognition results, which results in a Global Context Score (GCS). Next, we use the Natural Language Processing (NLP) system known as BERT for extracting semantics between candidate words, which results in a Local Context Score (LCS). Global and local context scores are then fused for estimating the ranking for each candidate word. The word that gets the highest ranking is taken as the correction for text which is occluded in the image. Experimental results on a dataset assembled from standard natural scene datasets and our resources show that our approach helps to improve the text recognition performance significantly.
引用
下载
收藏
页码:466 / 480
页数:15
相关论文
共 50 条
  • [1] A new method for detection and prediction of occluded text in natural scene images
    Mittal, Ayush
    Shivakumara, Palaiahnakote
    Pal, Umapada
    Lu, Tong
    Blumenstein, Michael
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2022, 100
  • [2] Context-Based Contrastive Learning for Scene Text Recognition
    Zhang, Xinyun
    Zhu, Binwu
    Yao, Xufeng
    Sun, Qi
    Li, Ruiyu
    Yu, Bei
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 3353 - 3361
  • [3] A New Method for Arabic Text Detection in Natural Scene Images
    Gaddour, Houda
    Kanoun, Slim
    Vincent, Nicole
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2023, 23 (01)
  • [4] CONTEXT-BASED TEXT DETECTION IN NATURAL SCENES
    Du, Yuning
    Duan, Genquan
    Ai, Haizhou
    2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 1857 - 1860
  • [5] A New Text Location Method in Natural Scene Images based on Color Reduction and AdaBoost
    Gao, Jiakai
    Li, Lei
    Yang, Lei
    2016 3RD INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2016, : 855 - 860
  • [6] Text Detection in Natural Scene Images Leveraging Context Information
    Wang, Runmin
    Sang, Nong
    Gao, Changxin
    Kuang, Xiaoqin
    Xiang, Jun
    PATTERN RECOGNITION (CCPR 2014), PT II, 2014, 484 : 444 - 454
  • [7] A cascaded method for text detection in natural scene images
    Zheng, Yang
    Li, Qing
    Liu, Jie
    Liu, Heping
    Li, Gen
    Zhang, Shuwu
    NEUROCOMPUTING, 2017, 238 : 307 - 315
  • [8] An Improved Text Localization Method for Natural Scene Images
    Jiang Mengdi
    Cheng Jianghua
    Chen Minghui
    Ku Xishu
    2017 2ND INTERNATIONAL CONFERENCE ON COMMUNICATION, IMAGE AND SIGNAL PROCESSING (CCISP 2017), 2018, 960
  • [9] Integrated Method for Text Detection in Natural Scene Images
    Zheng, Yang
    Liu, Jie
    Liu, Heping
    Li, Qing
    Li, Gen
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2016, 10 (11): : 5583 - 5604
  • [10] Context-Based Scene Understanding
    Zolghadr, Esfandiar
    Furht, Borko
    INTERNATIONAL JOURNAL OF MULTIMEDIA DATA ENGINEERING & MANAGEMENT, 2016, 7 (01): : 22 - 40