Improving Word Recognition using Multiple Hypotheses and Deep Embeddings

被引:0
|
作者
Bansal, Siddhant [1 ]
Krishnan, Praveen [1 ]
Jawahar, C., V [1 ]
机构
[1] IIIT, CVIT, Hyderabad, India
关键词
Word recognition; word image embedding; EmbedNet;
D O I
10.1109/ICPR48806.2021.9412417
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel scheme for improving the word recognition accuracy using word image embeddings. We use a trained text recognizer, which can predict multiple text hypothesis for a given word image. Our fusion scheme improves the recognition process by utilizing the word image and text embeddings obtained from a trained word image embedding network. We propose EmbedNet, which is trained using a triplet loss for learning a suitable embedding space where the embedding of the word image lies closer to the embedding of the corresponding text transcription. The updated embedding space thus helps in choosing the correct prediction with higher confidence. To further improve the accuracy, we propose a plug-and-play module called Confidence based Accuracy Booster (CAB). The CAB module takes in the confidence scores obtained from the text recognizer and Euclidean distances between the embeddings to generate an updated distance vector. The updated distance vector has lower distance values for the correct words and higher distance values for the incorrect words. We rigorously evaluate our proposed method systematically on a collection of books in the Hindi language. Our method achieves an absolute improvement of around 10% in terms of word recognition accuracy.
引用
收藏
页码:9499 / 9506
页数:8
相关论文
共 50 条
  • [41] Supervised Author Recognition with Aggregated Word Embeddings
    Atar, Muhammed Selim
    Esen, Ersin
    Arabaci, Mehmet Ali
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [42] Improving Phrase Chunking by using Contextualized Word Embeddings for a Morphologically Rich Language
    Ehsan, Toqeer
    Khalid, Javairia
    Ambreen, Saadia
    Mustafa, Asad
    Hussain, Sarmad
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2022, 47 (08) : 9781 - 9799
  • [43] Using pseudo-senses for improving the extraction of synonyms from word embeddings
    Ferret, Olivier
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2018, : 351 - 357
  • [44] Improving Phrase Chunking by using Contextualized Word Embeddings for a Morphologically Rich Language
    Toqeer Ehsan
    Javairia Khalid
    Saadia Ambreen
    Asad Mustafa
    Sarmad Hussain
    Arabian Journal for Science and Engineering, 2022, 47 : 9781 - 9799
  • [45] Adjusting Word Embeddings by Deep Neural Networks
    Gao, Xiaoyang
    Ichise, Ryutaro
    ICAART: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2017, : 398 - 406
  • [46] A survey of word embeddings based on deep learning
    Wang, Shirui
    Zhou, Wenan
    Jiang, Chao
    COMPUTING, 2020, 102 (03) : 717 - 740
  • [47] Geographic Named Entity Recognition and Disambiguation in Mexican News using word embeddings
    Molina-Villegas, Alejandro
    Muniz-Sanchez, Victor
    Arreola-Trapala, Jean
    Alcantara, Filomeno
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 176
  • [48] Improving semantic change analysis by combining word embeddings and word frequencies
    Englhardt, Adrian
    Willkomm, Jens
    Schaeler, Martin
    Boehm, Klemens
    INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES, 2020, 21 (03) : 247 - 264
  • [49] A survey of word embeddings based on deep learning
    Shirui Wang
    Wenan Zhou
    Chao Jiang
    Computing, 2020, 102 : 717 - 740
  • [50] Improving semantic change analysis by combining word embeddings and word frequencies
    Adrian Englhardt
    Jens Willkomm
    Martin Schäler
    Klemens Böhm
    International Journal on Digital Libraries, 2020, 21 : 247 - 264