A New Context-Based Method for Restoring Occluded Text in Natural Scene Images

被引：3

作者：

Mittal, Ayush ^{[1
]}

Shivakumara, Palaiahnakote ^{[2
]}

Pal, Umapada ^{[1
]}

Lu, Tong ^{[3
]}

Blumenstein, Michael ^{[4
]}

Lopresti, Daniel ^{[5
]}

机构：

[1] Indian Stat Inst, Comp Vision & Pattern Recognit Unit, Kolkata, India

[2] Univ Malaya, Fac Comp Sci & Informat Technol, Kuala Lumpur, Malaysia

[3] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Peoples R China

[4] Univ Technol Sydney, Fac Engn & Informat Technol, Ultimo, Australia

[5] Lehigh Univ, Comp Sci & Engn, Bethlehem, PA USA

来源：

DOCUMENT ANALYSIS SYSTEMS | 2020年 / 12116卷

关键词：

Text detection; Occluded image; Annotating natural scene images; Natural language processing; Text recognition; RECOGNITION; NETWORK;

D O I：

10.1007/978-3-030-57058-3_33

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Text recognition from natural scene images is an active research area because of its important real world applications, including multimedia search and retrieval, and scene understanding through computer vision. It is often the case that portions of text in images are missed due to occlusion with objects in the background. Therefore, this paper presents a method for restoring occluded text to improve text recognition performance. The proposed method uses the GOOGLE Vision API for obtaining labels for input images. We propose to use PixelLink-E2E methods for detecting text and obtaining recognition results. Using these results, the proposed method generates candidate words based on distance measures employing lexicons created through natural scene text recognition. We extract the semantic similarity between labels and recognition results, which results in a Global Context Score (GCS). Next, we use the Natural Language Processing (NLP) system known as BERT for extracting semantics between candidate words, which results in a Local Context Score (LCS). Global and local context scores are then fused for estimating the ranking for each candidate word. The word that gets the highest ranking is taken as the correction for text which is occluded in the image. Experimental results on a dataset assembled from standard natural scene datasets and our resources show that our approach helps to improve the text recognition performance significantly.

引用

页码：466 / 480

页数：15

共 50 条

[31] Text Detection and Recognition in Natural Scene Images
Pise, Amruta
Ruikar, S. D.
2014 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2014,
[32] Scene Text Detection in Natural Images: A Review
Cao, Dongping
Zhong, Yong
Wang, Lishun
He, Yilong
Dang, Jiachen
SYMMETRY-BASEL, 2020, 12 (12): : 1 - 26
[33] Uyghur Text Detection in Natural Scene Images
Li, Xinming
Li, Junfang
Gao, Qiag
Yu, Xiao
2019 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (ICMA), 2019, : 1542 - 1547
[34] Text detection and restoration in natural scene images
Ye, Qixiang
Hao, Jianbin
Huang, Jun
Yu, Hua
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2007, 18 (06) : 504 - 513
[35] Automatic text location in natural scene images
Li, CA
Ding, XQ
Wu, YS
SIXTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, PROCEEDINGS, 2001, : 1069 - 1073
[36] Context-based filtering of document images
Ageenko, E
Fränti, P
PATTERN RECOGNITION LETTERS, 2000, 21 (6-7) : 483 - 491
[37] A New Method for Arabic Text Detection in Natural Scene Image Based on the Color Homogeneity
Gaddour, Houda
Kanoun, Slim
Vincent, Nicole
IMAGE AND SIGNAL PROCESSING (ICISP 2016), 2016, 9680 : 127 - 136
[38] A multiscale feature fusion method for cursive text detection in natural scene images
Chandio, Asghar Ali
Leghari, Mehwish
Soomro, Muhammad Ali
Nizamani, Shah Zaman
Memon, Saifullah
IMAGING SCIENCE JOURNAL, 2021, 69 (5-8): : 302 - 318
[39] A Method for Restoring ?-Radiation Scene Images Based on Spatial Axial Gradient Discrimination
Li, Kun-Fang
Feng, Jie
Li, Yu-Dong
Wen, Lin
Kan, Yong-Jia
Guo, Qi
ELECTRONICS, 2023, 12 (17)
[40] Hierarchical Context-Based Emotion Recognition With Scene Graphs
Wu, Shichao
Zhou, Lei
Hu, Zhengxi
Liu, Jingtai
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 3725 - 3739

← 1 2 3 4 5 →