Adaptive document block segmentation and classification

被引:26
|
作者
Shih, FY
Chen, SS
机构
[1] Computer Vision Laboratory, Department of Computer and Information Science, New Jersey Institule of Technology, Newark
关键词
D O I
10.1109/3477.537322
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This correspondence presents an adaptive block segmentation and classification technique for daily-received office documents having complex layout structures such as multiple columns and mixed-mode contents of text, graphics, and pictures. First, an improved two-step block segmentation algorithm is performed based on run-length smoothing for decomposing any document into single-mode blocks. Then, a rule-based block classification is used for classifying each block into the test, horizontal/vertical line, graphics, or picture type. The document features and rules used are independent of character font and size and the scanning resolution. Experimental results show that our algorithms are capable of correctly segmenting and classifying different types of mixed-mode printed documents.
引用
收藏
页码:797 / 802
页数:6
相关论文
共 50 条
  • [21] Page segmentation and classification algorithm for skewed document images with graph regions
    Wang, JJ
    Huang, XW
    Zhong, XR
    Guo, WW
    THIRD INTERNATIONAL SYMPOSIUM ON MULTISPECTRAL IMAGE PROCESSING AND PATTERN RECOGNITION, PTS 1 AND 2, 2003, 5286 : 645 - 648
  • [22] Using Multi-level Segmentation Features for Document Image Classification
    Kaddas, Panagiotis
    Gatos, Basilis
    DOCUMENT ANALYSIS SYSTEMS, DAS 2022, 2022, 13237 : 702 - 712
  • [23] Automated Segmentation and Classification of Chemical and other Equations from Document Images
    Jana, Prerana
    Majumdar, Anubhab
    Layek, Ashish Kumar
    Mandal, Sekhar
    Das, Amit Kumar
    2015 EIGHTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION (ICAPR), 2015, : 127 - +
  • [24] Skew detection, page segmentation, and script classification of printed document images
    Waked, B
    Bergler, S
    Suen, CY
    Khoury, S
    1998 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5, 1998, : 4470 - 4475
  • [25] CLASSIFICATION CONSTRAINED DISCRIMINATOR FOR DOMAIN ADAPTIVE SEMANTIC SEGMENTATION
    Chen, Tao
    Zhang, Jian
    Xie, Guo-Sen
    Yao, Yazhou
    Huang, Xiaoshui
    Tang, Zhenmin
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [26] Alternating Segmentation and Simulation for Contrast Adaptive Tissue Classification
    Pham, Dzung L.
    Roy, Snehashis
    MEDICAL IMAGING 2018: BIOMEDICAL APPLICATIONS IN MOLECULAR, STRUCTURAL, AND FUNCTIONAL IMAGING, 2018, 10578
  • [27] BPFormNet: a lightweight block pyramid network for form segmentation and classification
    Hanyang Lin
    Yongzhao Zhan
    Chongshu Wu
    International Journal on Document Analysis and Recognition (IJDAR), 2024, 27 : 1 - 17
  • [28] BPFormNet: a lightweight block pyramid network for form segmentation and classification
    Lin, Hanyang
    Zhan, Yongzhao
    Wu, Chongshu
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2024, 27 (01) : 1 - 17
  • [29] GLCM Inspired Fingerprints Segmentation Algorithm with Adaptive Block Size
    Li, Huina
    Luo, JunLi
    MEASUREMENT TECHNOLOGY AND ITS APPLICATION, PTS 1 AND 2, 2013, 239-240 : 1456 - 1461
  • [30] Web document classification technique based on the adaptive neural network
    Lei, Jingsheng
    Zhong, Sheng
    Journal of Information and Computational Science, 2004, 1 (03): : 135 - 139