Word Spotting based Retrieval of Urdu Handwritten Documents

被引:12
|
作者
Abidi, Ali [1 ]
Jamil, Akhtar [2 ]
Siddiqi, Imran [3 ]
Khurshid, Khurram [4 ]
机构
[1] Natl Univ Sci & Technol, Islamabad, Pakistan
[2] Comsats Univ, Abbotabad, Pakistan
[3] Bahria Univ, Islamabad, Pakistan
[4] Inst Space Technol, Islamabad, Pakistan
关键词
Urdu handwritten text detection; Partial Words; Run length smoothing alogrithm;
D O I
10.1109/ICFHR.2012.289
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Urdu being one of the most popular languages adopted during different swatches of history has a valuable collection of handwritten scripts in different state libraries of South Asia. Digitizing these collections can serve not only to preserve them but also to make them available to general public. Non existence of an Urdu OCR, however, limits the concept of a digital Urdu library to scanning and manual search of documents only. We present a word spotting based search method for Urdu handwritten text. The text is first segmented into partial words and a set of features is computed from each partial word. The user queries the system using word image. The partial words in the query image are then matched with those in the database and the matched partial words are merged into complete words. The proposed method evaluated on 90 handwritten documents reported encouraging precision and recall rates.
引用
收藏
页码:331 / 336
页数:6
相关论文
共 50 条
  • [1] Word Spotting as a Service for Handwritten Documents
    Amanatiadis, Angelos
    Zagoris, Konstantinos
    Pratikakis, Ioannis
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2021,
  • [2] A Survey on handwritten documents word spotting
    Ahmed R.
    Al-Khatib W.G.
    Mahmoud S.
    [J]. International Journal of Multimedia Information Retrieval, 2017, 6 (1) : 31 - 47
  • [3] An overview on handwritten documents word spotting
    Boualam, Manal
    Khaissidi, Ghizlane
    Mrabti, Mostafa
    Elfakir, Youssef
    [J]. 2019 INTERNATIONAL CONFERENCE ON WIRELESS TECHNOLOGIES, EMBEDDED AND INTELLIGENT SYSTEMS (WITS), 2019,
  • [4] Visual keyword based word-spotting in handwritten documents
    Kolcz, A
    Alspector, J
    Augusteijn, M
    Carlson, R
    Popescu, GV
    [J]. DOCUMENT RECOGNITION V, 1998, 3305 : 185 - 193
  • [5] Local Feature Based Word Spotting in Handwritten Archive Documents
    Czuni, Laszlo
    Kiss, Peter Jozsef
    Gal, Monika
    Lipovits, Agnes
    [J]. 2013 11TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI 2013), 2013, : 178 - 183
  • [6] Multilingual Word Spotting in Offline Handwritten Documents
    Wshah, Safwan
    Kumar, Gaurav
    Govindaraju, Venu
    [J]. 2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 310 - 313
  • [7] Attribute CNNs for word spotting in handwritten documents
    Sebastian Sudholt
    Gernot A. Fink
    [J]. International Journal on Document Analysis and Recognition (IJDAR), 2018, 21 : 199 - 218
  • [8] A segmentation free Word Spotting for handwritten documents
    Ghorbel, Adam
    Ogier, Lean-Marc
    Vincent, Nicole
    [J]. 2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 346 - 350
  • [9] Attribute CNNs for word spotting in handwritten documents
    Sudholt, Sebastian
    Fink, Gernot A.
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2018, 21 (03) : 199 - 218
  • [10] Sequential Word Spotting in Historical Handwritten Documents
    Fernandez-Mota, David
    Llados, Josep
    Fornes, Alicia
    Manmatha, R.
    [J]. 2014 11TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS 2014), 2014, : 101 - 105