Recognition of the Script in Serbian Documents Using Frequency Occurrence and Co-Occurrence Analysis

被引:8
|
作者
Brodic, Darko [1 ]
Milivojevic, Zoran N. [2 ]
Maluckov, Cedomir A. [1 ]
机构
[1] Univ Belgrade, Tech Fac Bor, Bor 19210, Serbia
[2] Tech Coll Nis, Nish 18000, Serbia
来源
关键词
IMAGE TEXTURE; FEATURES; MATRIX;
D O I
10.1155/2013/896328
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Any document in Serbian language can be written in two different scripts: Latin or Cyrillic. Although characteristics of these scripts are similar, some of their statistical measures are quite different. The paper proposed a method for the extraction of certain script from document according to the occurrence and co-occurrence of the script types. First, each letter is modeled with the certain script type according to characteristics concerning its position in baseline area. Then, the frequency analysis of the script types occurrence is performed. Due to diversity of Latin and Cyrillic script, the occurrence of modeled letters shows substantial statistics dissimilarity. Furthermore, the co-occurrence matrix is computed. The analysis of the co-occurrence matrix draws a strong margin as a criteria to distinguish and recognize the certain script. The proposed method is analyzed on the case of a database which includes different types of printed and web documents. The experiments gave encouraging results.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Co-occurrence analysis of scientific documents in citation networks
    Muppidi, Satish
    Reddy, K. Thammi
    [J]. INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2020, 24 (01) : 19 - 25
  • [2] Object recognition using Gabor co-occurrence similarity
    Zou, Jian
    Liu, Chuan-Cai
    Zhang, Yue
    Lu, Gui-Fu
    [J]. PATTERN RECOGNITION, 2013, 46 (01) : 434 - 448
  • [3] A proposal of co-occurrence frequency image
    Yamaashi, Kazuhiko
    Fujiwara, Takayuki
    Koshimizu, Hiroyasu
    [J]. IEEJ Transactions on Electronics, Information and Systems, 2007, 127 (04) : 528 - 536
  • [4] FACE RECOGNITION USING CO-OCCURRENCE HISTOGRAMS OF ORIENTED GRADIENTS
    Thanh-Toan Do
    Kijak, Ewa
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 1301 - 1304
  • [5] Textile recognition using Tchebichef moments of co-occurrence matrices
    Cheong, Marc
    Loke, Kar-Seng
    [J]. ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, PROCEEDINGS: WITH ASPECTS OF THEORETICAL AND METHODOLOGICAL ISSUES, 2008, 5226 : 1017 - +
  • [6] Enhancing object recognition using regency and co-occurrence heuristics
    Lee, JCM
    Pong, TC
    Esterline, A
    [J]. PATTERN RECOGNITION, 1998, 31 (09) : 1319 - 1336
  • [7] Handwritten arabic character recognition using co-occurrence matrices
    Assaleh, K
    Al-Rousan, M
    Ghazal, M
    [J]. 8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL VI, PROCEEDINGS: IMAGE, ACOUSTIC, SIGNAL PROCESSING AND OPTICAL SYSTEMS, TECHNOLOGIES AND APPLICATIONS, 2004, : 191 - 194
  • [8] Analysis of co-occurrence networks with clique occurrence information
    Shen, Bin
    Li, Yixiao
    [J]. INTERNATIONAL JOURNAL OF MODERN PHYSICS C, 2014, 25 (05):
  • [9] TEXTURE ANALYSIS USING GENERALIZED CO-OCCURRENCE MATRICES
    DAVIS, LS
    JOHNS, SA
    AGGARWAL, JK
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1979, 1 (03) : 251 - 259
  • [10] FACE RECOGNITION USING HISTOGRAM OF CO-OCCURRENCE GABOR PHASE PATTERNS
    Wang, Cong
    Chai, Zhenhua
    Sun, Zhenan
    [J]. 2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 2777 - 2781