An automatic histogram detection and information extraction from document images

被引:0
|
作者
P. H. Anagha
A. Baskar
机构
[1] Amrita Vishwa Vidyapeetham,Dept of Computer Science and Engineering, Amrita School of Engineering
关键词
Histogram; Hough line detector; Morphological operator; Information; Extraction;
D O I
暂无
中图分类号
学科分类号
摘要
Histogram is an important data chart that is commonly present in scientific documents. In this paper, an automatic histogram detection and information extraction methodology, based on Hough line detector and Morphological operator, is proposed. The proffered system is comprised of three steps: pre-processing, axis detection, and chart pattern extraction. In the pre-processing step, the RGB image pattern of a histogram is converted into a binary image. Next, in the axis detection step, horizontal axis, vertical axis and title of the histogram are extracted. In this step Hough line detector methodology was applied to detect horizontal and vertical lines in the image patterns. From the set of identified vertical lines, both the endpoints of a line, having the same minimum values of x co-ordinate was considered as a vertical axis. Similarly, from the set of identified horizontal lines, the two endpoints of a line having the same maximum values of y co-ordinate were considered as a horizontal axis. With respect to the dimensions of the horizontal axis and vertical axis, a rectangular region containing horizontal axis values and label, vertical axis values and label and title are extracted. In the final chart pattern extraction step, using morphological operations, the frequency of data present in the histogram was identified. Verification and validation tests of the propounded system yielded promising results, indicative of efficient approach for extraction of histogram information.
引用
收藏
页码:77 / 85
页数:8
相关论文
共 50 条
  • [41] Automatic Information Extraction from Heatmaps
    Markowska-Kaczmar, Urszula
    Szymanska, Agnieszka
    Culer, Lukasz
    5TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS AND APPLICATIONS, IISA 2014, 2014, : 267 - +
  • [42] Automatic extraction of ontologies from teaching document metadata
    Kay, J
    Holden, S
    INTERNATIONAL CONFERENCE ON COMPUTERS IN EDUCATION, VOLS I AND II, PROCEEDINGS, 2002, : 1555 - 1556
  • [43] AUTOMATIC TOPIC DETECTION STRATEGY FOR INFORMATION RETRIEVAL IN SPOKEN DOCUMENT
    Jin, Shan
    Misra, Hemant
    Sikora, Thomas
    Jose, Joemon
    2009 10TH INTERNATIONAL WORKSHOP ON IMAGE ANALYSIS FOR MULTIMEDIA INTERACTIVE SERVICES, 2009, : 300 - +
  • [44] Automatic keyphrases extraction from document using backpropagation
    Wang, JB
    Peng, H
    Hu, JS
    PROCEEDINGS OF 2005 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-9, 2005, : 3770 - 3774
  • [45] Automatic correction of distorted aerial images and extraction of multiangular information
    Guo, J
    Li, XW
    IGARSS '98 - 1998 INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, PROCEEDINGS VOLS 1-5: SENSING AND MANAGING THE ENVIRONMENT, 1998, : 2047 - 2049
  • [46] Automatic presentation of wide dynamic images for optimal, information extraction
    Kattnig, AP
    OPTICS FOR THE QUALITY OF LIFE, PTS 1 AND 2, 2003, 4829 : 228 - 229
  • [47] Automatic Extraction of Document Topics
    Teixeira, Luis
    Lopes, Gabriel
    Ribeiro, Rita A.
    TECHNOLOGICAL INNOVATION FOR SUSTAINABILITY, 2011, 349 : 101 - +
  • [48] EXTRACTION OF BINARY CHARACTER GRAPHICS IMAGES FROM GRAYSCALE DOCUMENT IMAGES
    KAMEL, M
    ZHAO, A
    CVGIP-GRAPHICAL MODELS AND IMAGE PROCESSING, 1993, 55 (03): : 203 - 217
  • [49] Automatic road extraction from aerial images
    Trinder, JC
    Wang, YD
    DIGITAL SIGNAL PROCESSING, 1998, 8 (04) : 215 - 224
  • [50] Automatic vessel extraction from angiogram images
    Guo, DB
    Richardson, P
    COMPUTERS IN CARDIOLOGY 1998, VOL 25, 1998, 25 : 441 - 444