A chinese document layout analysis method based on minimal spanning tree clustering

被引:3
|
作者
Tian, XD [1 ]
Zhang, C [1 ]
机构
[1] Hebei Univ, Fac Math & Comp Sci, Baoding 071002, Hebei, Peoples R China
关键词
document layout analysis; run-length smoothing; minimal spanning tree clustering;
D O I
10.1109/ICMLC.2003.1260127
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For adapting to some special characteristics of Chinese documents, a method based on minimal spanning tree clustering is presented. This method is a bottom-up approach. First apply run-length smoothing algorithm on the document in horizontal direction, and then in vertical direction. After that, minimal spanning tree clustering is applied. We can infer from experiments that the problem of Chinese document layout analysis can be resolved in a better way.
引用
收藏
页码:3183 / 3187
页数:5
相关论文
共 50 条
  • [31] Distributed Document Clustering Analysis Based on a Hybrid Method
    Judith, J. E.
    Jayakumari, J.
    CHINA COMMUNICATIONS, 2017, 14 (02) : 131 - 142
  • [32] Distributed Document Clustering Analysis Based on a Hybrid Method
    J.E.Judith
    J.Jayakumari
    中国通信, 2017, 14 (02) : 131 - 142
  • [33] XML Document Clustering Based on Spectral Analysis Method
    Li Xinye
    ADVANCED RESEARCH ON INFORMATION SCIENCE, AUTOMATION AND MATERIAL SYSTEM, PTS 1-6, 2011, 219-220 : 304 - 307
  • [34] A novel PAT-tree approach to Chinese document clustering
    Kwok, K
    Lyu, MR
    King, I
    ISE'2001: PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON INFORMATION SYSTEMS AND ENGINEERING, 2001, : 85 - 91
  • [35] Document clustering based on constructing density tree
    Dai W.
    Wang W.
    Hou Y.
    Wang Y.
    Zhang L.
    Transactions of Tianjin University, 2008, 14 (1) : 21 - 26
  • [36] Document Clustering Based on Constructing Density Tree
    戴维迪
    王文俊
    侯越先
    王英
    张璐
    Transactions of Tianjin University, 2008, (01) : 21 - 26
  • [37] A Document Layout Analysis Method Based on Morphological Operators and Connected Components
    Alarcon Arenas, Sebastian W.
    Meza-Lovon, Graciela L.
    Yari, Yessenia
    2018 XLIV LATIN AMERICAN COMPUTER CONFERENCE (CLEI 2018), 2018, : 622 - 631
  • [38] Exploratory data analysis of evoked response single trials based on minimal spanning tree
    Laskaris, NA
    Ioannides, AE
    CLINICAL NEUROPHYSIOLOGY, 2001, 112 (04) : 698 - 712
  • [39] Meta clusters through minimum spanning tree based clustering for performance analysis of students
    Karthikeyan, T.
    Peter, S. John
    Chidambaranathan, S.
    JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2011, 14 (04): : 349 - 367