An automatic histogram detection and information extraction from document images

被引:0
|
作者
P. H. Anagha
A. Baskar
机构
[1] Amrita Vishwa Vidyapeetham,Dept of Computer Science and Engineering, Amrita School of Engineering
关键词
Histogram; Hough line detector; Morphological operator; Information; Extraction;
D O I
暂无
中图分类号
学科分类号
摘要
Histogram is an important data chart that is commonly present in scientific documents. In this paper, an automatic histogram detection and information extraction methodology, based on Hough line detector and Morphological operator, is proposed. The proffered system is comprised of three steps: pre-processing, axis detection, and chart pattern extraction. In the pre-processing step, the RGB image pattern of a histogram is converted into a binary image. Next, in the axis detection step, horizontal axis, vertical axis and title of the histogram are extracted. In this step Hough line detector methodology was applied to detect horizontal and vertical lines in the image patterns. From the set of identified vertical lines, both the endpoints of a line, having the same minimum values of x co-ordinate was considered as a vertical axis. Similarly, from the set of identified horizontal lines, the two endpoints of a line having the same maximum values of y co-ordinate were considered as a horizontal axis. With respect to the dimensions of the horizontal axis and vertical axis, a rectangular region containing horizontal axis values and label, vertical axis values and label and title are extracted. In the final chart pattern extraction step, using morphological operations, the frequency of data present in the histogram was identified. Verification and validation tests of the propounded system yielded promising results, indicative of efficient approach for extraction of histogram information.
引用
收藏
页码:77 / 85
页数:8
相关论文
共 50 条
  • [31] An Automatic Ulcer Detection Scheme Using Histogram in YIQ Domain from Wireless Capsule Endoscopy Images
    Kundu, A. K.
    Bhattacharjee, Arnab
    Fattah, S. A.
    Shahnaz, C.
    TENCON 2017 - 2017 IEEE REGION 10 CONFERENCE, 2017, : 1300 - 1303
  • [32] Automatic Detection of Melanoma Using Broad Extraction of Features from Digital Images
    Jafari, M. H.
    Samavi, S.
    Karimi, N.
    Soroushmehr, S. M. R.
    Ward, K.
    Najarian, K.
    2016 38TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2016, : 1357 - 1360
  • [33] Automatic Target Detection in GPR Images Using Histogram of Oriented Gradients (HOG)
    Lee, K. L.
    Mokji, M. M.
    2014 2ND INTERNATIONAL CONFERENCE ON ELECTRONIC DESIGN (ICED), 2014, : 181 - 186
  • [34] Automatic Extraction of Non-Textual Information in Web Document and Their Classification
    Zachariasova, Martina
    Hudec, Robert
    Benco, Miroslav
    Kamencay, Patrik
    2012 35TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2012, : 753 - 757
  • [35] Automatic generation of structured hyperdocuments from document images
    Lee, JY
    Park, JS
    Byun, H
    Moon, J
    Lee, SW
    PATTERN RECOGNITION, 2002, 35 (02) : 485 - 503
  • [36] A method of small water information automatic extraction from TM remote sensing images
    Yang, Shuwen
    Xue, Chongsheng
    Liu, Tao
    Li, Yikun
    Cehui Xuebao/Acta Geodaetica et Cartographica Sinica, 2010, 39 (06): : 611 - 617
  • [37] Automatic localization and extraction of tables from handheld mobile-camera captured handwritten document images
    Amarnath, R.
    Sindhushree, G. S.
    Nagabhushan, P.
    Javed, Mohammed
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (03) : 2527 - 2544
  • [38] Automatic Extraction of Text Regions from Document Images by Multilevel Thresholding and K-means Clustering
    Hoai Nam Vu
    Tuan Anh Tran
    Na, In Seop
    Kim, Soo Hyung
    2015 IEEE/ACIS 14TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2015, : 329 - 334
  • [39] Automatic Shoreline Detection from Video Images by Combining Information from Different Methods
    Ribas, Francesca
    Simarro, Gonzalo
    Arriaga, Jaime
    Luque, Pau
    REMOTE SENSING, 2020, 12 (22) : 1 - 23
  • [40] Information extraction from historical handwritten document images with a context-aware neural model
    Ignacio Toledo, J.
    Carbonell, Manuel
    Fornes, Alicia
    Llados, Josep
    PATTERN RECOGNITION, 2019, 86 (27-36) : 27 - 36