Improving Word Recognition using Multiple Hypotheses and Deep Embeddings

被引:0
|
作者
Bansal, Siddhant [1 ]
Krishnan, Praveen [1 ]
Jawahar, C., V [1 ]
机构
[1] IIIT, CVIT, Hyderabad, India
关键词
Word recognition; word image embedding; EmbedNet;
D O I
10.1109/ICPR48806.2021.9412417
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel scheme for improving the word recognition accuracy using word image embeddings. We use a trained text recognizer, which can predict multiple text hypothesis for a given word image. Our fusion scheme improves the recognition process by utilizing the word image and text embeddings obtained from a trained word image embedding network. We propose EmbedNet, which is trained using a triplet loss for learning a suitable embedding space where the embedding of the word image lies closer to the embedding of the corresponding text transcription. The updated embedding space thus helps in choosing the correct prediction with higher confidence. To further improve the accuracy, we propose a plug-and-play module called Confidence based Accuracy Booster (CAB). The CAB module takes in the confidence scores obtained from the text recognizer and Euclidean distances between the embeddings to generate an updated distance vector. The updated distance vector has lower distance values for the correct words and higher distance values for the incorrect words. We rigorously evaluate our proposed method systematically on a collection of books in the Hindi language. Our method achieves an absolute improvement of around 10% in terms of word recognition accuracy.
引用
收藏
页码:9499 / 9506
页数:8
相关论文
共 50 条
  • [31] Improving biterm topic model with word embeddings
    Jiajia Huang
    Min Peng
    Pengwei Li
    Zhiwei Hu
    Chao Xu
    World Wide Web, 2020, 23 : 3099 - 3124
  • [32] Improving biterm topic model with word embeddings
    Huang, Jiajia
    Peng, Min
    Li, Pengwei
    Hu, Zhiwei
    Xu, Chao
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2020, 23 (06): : 3099 - 3124
  • [33] Using Deep Learning Word Embeddings for Citations Similarity in Academic Papers
    Hourrane, Oumaima
    Mifrah, Sara
    Benlahmar, El Habib
    Bouhriz, Nadia
    Rachdi, Mohamed
    BIG DATA, CLOUD AND APPLICATIONS, BDCA 2018, 2018, 872 : 185 - 196
  • [34] Using deep learning and word embeddings for predicting human agreeableness behavior
    Raed Alsini
    Anam Naz
    Hikmat Ullah Khan
    Amal Bukhari
    Ali Daud
    Muhammad Ramzan
    Scientific Reports, 14 (1)
  • [35] Improving semantic similarity retrieval with word embeddings
    Yan, Fengqi
    Fan, Qiaoqing
    Lu, Mingming
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2018, 30 (23):
  • [36] Word Embeddings for improving REST services discoverability
    Lizarralde, Ignacio
    Rodriguez, Juan Manuel
    Mateos, Cristian
    Zunino, Alejandro
    2017 XLIII LATIN AMERICAN COMPUTER CONFERENCE (CLEI), 2017,
  • [37] Arabic Quran Verses Authentication Using Deep Learning and Word Embeddings
    Touati-Hamad, Zineb
    Laouar, Mohamed Ridda
    Bendib, Issam
    Hakak, Saqib
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2022, 19 (04) : 681 - 688
  • [38] Improving Document Ranking with Dual Word Embeddings
    Nalisnick, Eric
    Mitra, Bhaskar
    Craswell, Nick
    Caruana, Rich
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16 COMPANION), 2016, : 83 - 84
  • [39] Word Spotting and Recognition using Deep Embedding
    Krishnan, Praveen
    Dutta, Kartik
    Jawahar, C. V.
    2018 13TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS), 2018, : 1 - 6
  • [40] A deep learning-based bilingual Hindi and Punjabi named entity recognition system using enhanced word embeddings
    Goyal, Archana
    Gupta, Vishal
    Kumar, Manish
    KNOWLEDGE-BASED SYSTEMS, 2021, 234