Improving Word Recognition using Multiple Hypotheses and Deep Embeddings

被引:0
|
作者
Bansal, Siddhant [1 ]
Krishnan, Praveen [1 ]
Jawahar, C., V [1 ]
机构
[1] IIIT, CVIT, Hyderabad, India
关键词
Word recognition; word image embedding; EmbedNet;
D O I
10.1109/ICPR48806.2021.9412417
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel scheme for improving the word recognition accuracy using word image embeddings. We use a trained text recognizer, which can predict multiple text hypothesis for a given word image. Our fusion scheme improves the recognition process by utilizing the word image and text embeddings obtained from a trained word image embedding network. We propose EmbedNet, which is trained using a triplet loss for learning a suitable embedding space where the embedding of the word image lies closer to the embedding of the corresponding text transcription. The updated embedding space thus helps in choosing the correct prediction with higher confidence. To further improve the accuracy, we propose a plug-and-play module called Confidence based Accuracy Booster (CAB). The CAB module takes in the confidence scores obtained from the text recognizer and Euclidean distances between the embeddings to generate an updated distance vector. The updated distance vector has lower distance values for the correct words and higher distance values for the incorrect words. We rigorously evaluate our proposed method systematically on a collection of books in the Hindi language. Our method achieves an absolute improvement of around 10% in terms of word recognition accuracy.
引用
收藏
页码:9499 / 9506
页数:8
相关论文
共 50 条
  • [1] DEEP WORD EMBEDDINGS FOR VISUAL SPEECH RECOGNITION
    Stafylakis, Themos
    Tzimiropoulos, Georgios
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4974 - 4978
  • [2] Deep Fake Recognition in Tweets Using Text Augmentation, Word Embeddings and Deep Learning
    Tesfagergish, Senait G.
    Damasevicius, Robertas
    Kapociute-Dzikiene, Jurgita
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, ICCSA 2021, PT VI, 2021, 12954 : 523 - 538
  • [3] Improving WordNet using Word Embeddings
    Chiru, Costin-Gabriel
    Truica, Ciprian-Octavian
    Apostol, Elena-Simona
    Ionescu, Alexandru
    2021 23RD INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2021), 2021, : 121 - 128
  • [4] Improving Named Entity Recognition for Morphologically Rich Languages using Word Embeddings
    Demir, Hakan
    Ozgur, Arzucan
    2014 13TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2014, : 117 - 122
  • [5] Improving Word Embeddings Using Kernel PCA
    Gupta, Vishwani
    Giesselbach, Sven
    Rueping, Stefan
    Bauckhage, Christian
    4TH WORKSHOP ON REPRESENTATION LEARNING FOR NLP (REPL4NLP-2019), 2019, : 200 - 208
  • [6] Recognition of Multiword Expressions Using Word Embeddings
    Loukachevitch, Natalia
    Parkhomenko, Ekaterina
    ARTIFICIAL INTELLIGENCE (RCAI 2018), 2018, 934 : 112 - 124
  • [7] Improving Chemical Named Entity Recognition in Patents with Contextualized Word Embeddings
    Zhai, Zenan
    Dat Quoc Nguyen
    Akhondi, Saber A.
    Thorne, Camilo
    Druckenbrodt, Christian
    Cohn, Trevor
    Gregory, Michelle
    Verspoor, Karin
    SIGBIOMED WORKSHOP ON BIOMEDICAL NATURAL LANGUAGE PROCESSING (BIONLP 2019), 2019, : 328 - 338
  • [8] Word Embeddings for Speech Recognition
    Bengio, Samy
    Heigold, Georg
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1053 - 1057
  • [9] A deep neural framework for named entity recognition with boosted word embeddings
    Goyal, Archana
    Gupta, Vishal
    Kumar, Manish
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (06) : 15533 - 15546
  • [10] Deep learning with word embeddings improves biomedical named entity recognition
    Habibi, Maryam
    Weber, Leon
    Neves, Mariana
    Wiegandt, David Luis
    Leser, Ulf
    BIOINFORMATICS, 2017, 33 (14) : I37 - I48