Improving Word Recognition using Multiple Hypotheses and Deep Embeddings

被引:0
|
作者
Bansal, Siddhant [1 ]
Krishnan, Praveen [1 ]
Jawahar, C., V [1 ]
机构
[1] IIIT, CVIT, Hyderabad, India
关键词
Word recognition; word image embedding; EmbedNet;
D O I
10.1109/ICPR48806.2021.9412417
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel scheme for improving the word recognition accuracy using word image embeddings. We use a trained text recognizer, which can predict multiple text hypothesis for a given word image. Our fusion scheme improves the recognition process by utilizing the word image and text embeddings obtained from a trained word image embedding network. We propose EmbedNet, which is trained using a triplet loss for learning a suitable embedding space where the embedding of the word image lies closer to the embedding of the corresponding text transcription. The updated embedding space thus helps in choosing the correct prediction with higher confidence. To further improve the accuracy, we propose a plug-and-play module called Confidence based Accuracy Booster (CAB). The CAB module takes in the confidence scores obtained from the text recognizer and Euclidean distances between the embeddings to generate an updated distance vector. The updated distance vector has lower distance values for the correct words and higher distance values for the incorrect words. We rigorously evaluate our proposed method systematically on a collection of books in the Hindi language. Our method achieves an absolute improvement of around 10% in terms of word recognition accuracy.
引用
收藏
页码:9499 / 9506
页数:8
相关论文
共 50 条
  • [21] Deep recurrent neural networks with word embeddings for Urdu named entity recognition
    Khan, Wahab
    Daud, Ali
    Alotaibi, Fahd
    Aljohani, Naif
    Arafat, Sachi
    ETRI JOURNAL, 2020, 42 (01) : 90 - 100
  • [22] Lexical Function Identification Using Word Embeddings and Deep Learning
    Hernandez-Miranda, Arturo
    Gelbukh, Alexander
    Kolesnikova, Olga
    ADVANCES IN SOFT COMPUTING, MICAI 2019, 2019, 11835 : 77 - 86
  • [23] Fall Detection in EHR using Word Embeddings and Deep Learning
    dos Santos, Henrique D. P.
    Silva, Amanda P.
    Maciel, Maria Carolina O.
    Burin, Haline Maria V.
    Urbanetto, Janete S.
    Vieira, Renata
    2019 IEEE 19TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2019, : 265 - 268
  • [24] Automatic Idiom Recognition with Word Embeddings
    Peng, Jing
    Feldman, Anna
    INFORMATION MANAGEMENT AND BIG DATA, 2017, 656 : 17 - 29
  • [25] Improving the accuracy using pre-trained word embeddings on deep neural networks for Turkish text classification
    Aydogan, Murat
    Karci, Ali
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2020, 541
  • [26] DEEP CONVOLUTIONAL ACOUSTIC WORD EMBEDDINGS USING WORD-PAIR SIDE INFORMATION
    Kamper, Herman
    Wang, Weiran
    Livescu, Karen
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 4950 - 4954
  • [27] Improving Unsupervised Acoustic Word Embeddings using Speaker and Gender Information
    van Staden, Lisa
    Kamper, Herman
    2020 INTERNATIONAL SAUPEC/ROBMECH/PRASA CONFERENCE, 2020, : 533 - 538
  • [28] Improving Implicit Stance Classification in Tweets Using Word and Sentence Embeddings
    Schaefer, Robin
    Stede, Manfred
    ADVANCES IN ARTIFICIAL INTELLIGENCE, KI 2019, 2019, 11793 : 299 - 307
  • [29] Improving seller-customer communication process using word embeddings
    Missen, Malik Muhammad Saad
    Naeem, Aqsa
    Asmat, Hina
    Salamat, Nadeem
    Akhtar, Nadeem
    Coustaty, Mickael
    Prasath, V. B. Surya
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (02) : 2257 - 2272
  • [30] Improving Sentiment Analysis in Twitter Using Sentiment Specific Word Embeddings
    Othman, Rania
    Abdelsadek, Youcef
    Chelghoum, Kamel
    Kacem, Imed
    Faiz, Rim
    PROCEEDINGS OF THE 2019 10TH IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT DATA ACQUISITION AND ADVANCED COMPUTING SYSTEMS - TECHNOLOGY AND APPLICATIONS (IDAACS), VOL. 2, 2019, : 854 - 858