Multilingual Word Spotting in Offline Handwritten Documents

被引:0
|
作者
Wshah, Safwan [1 ]
Kumar, Gaurav [1 ]
Govindaraju, Venu [1 ]
机构
[1] SUNY Buffalo, Dept Comp Sci & Engn, Buffalo, NY 14260 USA
关键词
HIDDEN MARKOV-MODELS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we propose a novel multilingual word spotting framework based on Hidden Markov Models that works on corpus of multilingual handwritten documents and documents that contain more than one handwritten script. The system deals with large multilingual vocabularies without need for word or character segmentation. A keyword is represented by concatenating its character models. We propose and compare two systems: a script identifier based (IDB) and a script identifier free (IDF) system. IDB uses a HMM based script identifier before spotting a keyword. While, IDF does the spotting without the script identification. The system is evaluated on a mixed corpus of public dataset from several scripts such as IAM for English, AMA for Arabic and LAW for Devanagari and on synthetic dataset generated by concatenating words and lines from different scripts in a document image.
引用
收藏
页码:310 / 313
页数:4
相关论文
共 50 条
  • [1] Statistical script independent word spotting in offline handwritten documents
    Wshah, Safwan
    Kumar, Gaurav
    Govindaraju, Venu
    PATTERN RECOGNITION, 2014, 47 (03) : 1039 - 1050
  • [2] Word Spotting as a Service for Handwritten Documents
    Amanatiadis, Angelos
    Zagoris, Konstantinos
    Pratikakis, Ioannis
    2021 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2021,
  • [3] A Survey on handwritten documents word spotting
    Ahmed R.
    Al-Khatib W.G.
    Mahmoud S.
    International Journal of Multimedia Information Retrieval, 2017, 6 (1) : 31 - 47
  • [4] An overview on handwritten documents word spotting
    Boualam, Manal
    Khaissidi, Ghizlane
    Mrabti, Mostafa
    Elfakir, Youssef
    2019 INTERNATIONAL CONFERENCE ON WIRELESS TECHNOLOGIES, EMBEDDED AND INTELLIGENT SYSTEMS (WITS), 2019,
  • [5] Script Independent Word Spotting in Offline Handwritten Documents Based on Hidden Markov Models
    Wshah, Safwan
    Kumar, Gaurav
    Govindaraju, Venu
    13TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2012), 2012, : 14 - 19
  • [6] Attribute CNNs for word spotting in handwritten documents
    Sebastian Sudholt
    Gernot A. Fink
    International Journal on Document Analysis and Recognition (IJDAR), 2018, 21 : 199 - 218
  • [7] A segmentation free Word Spotting for handwritten documents
    Ghorbel, Adam
    Ogier, Lean-Marc
    Vincent, Nicole
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 346 - 350
  • [8] Attribute CNNs for word spotting in handwritten documents
    Sudholt, Sebastian
    Fink, Gernot A.
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2018, 21 (03) : 199 - 218
  • [9] Sequential Word Spotting in Historical Handwritten Documents
    Fernandez-Mota, David
    Llados, Josep
    Fornes, Alicia
    Manmatha, R.
    2014 11TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS 2014), 2014, : 101 - 105
  • [10] ON THE INFLUENCE OF WORD REPRESENTATIONS FOR HANDWRITTEN WORD SPOTTING IN HISTORICAL DOCUMENTS
    Llados, Josep
    Rusinol, Marcal
    Fornes, Alicia
    Fernandez, David
    Dutta, Anjan
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2012, 26 (05)