Segmentation and Text extraction from Document Images: Survey

被引:0
|
作者
Mukarambi, Gururaj [1 ]
Gaikwad, Hema [1 ]
Dhandra, B., V [1 ]
机构
[1] Symbiosis Int Deemed Univ, Symbiosis Inst Comp Studies & Res, Pune, Maharashtra, India
关键词
GLCM; RBSC; DPTE and PLA; CLASSIFICATION;
D O I
10.1109/i-pact44901.2019.8960097
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Segmentation and text extraction from complex document image helps in analyzing, storing, retrieving and auto indexing of required information. In this paper, we considered 23 existing methods of segmentation and text extraction for complex document images. After review of the existing methods, we found that connected component method [1],[2],[5],[8],[10],[13] are more suitable for segmentation of text and non-text from document and also LSTM &RNN found that potential methods for extraction of text from complex document[15].
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Segmentation of text and graphics from document images
    Chowdhury, S. P.
    Mandal, S.
    Das, A. K.
    Chanda, Bhabatosh
    [J]. ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 619 - +
  • [2] Text region extraction and text segmentation on camera-captured document style images
    Song, YJ
    Kim, KC
    Choi, YW
    Byun, HR
    Kim, SH
    Chi, SY
    Jang, DK
    Chung, YK
    [J]. EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 172 - 176
  • [3] Text extraction from complex document images using the multi-plane segmentation technique
    Chen, Yen-Lin
    Wu, Bing-Fei
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-6, PROCEEDINGS, 2006, : 3540 - +
  • [4] Word Extraction and Character Segmentation from Text Lines of Unconstrained Handwritten Bangla Document Images
    Sarkar, Ram
    Malakar, Samir
    Das, Nibaran
    Basu, Subhadip
    Kundu, Mahantapas
    Nasipuri, Mita
    [J]. JOURNAL OF INTELLIGENT SYSTEMS, 2011, 20 (03) : 227 - 260
  • [5] Script-Independent Text Segmentation from Document Images
    Sahare, Parul
    Tembhurne, Jitendra V.
    Parate, Mayur R.
    Diwan, Tausif
    Dhok, Sanjay B.
    [J]. International Journal of Ambient Computing and Intelligence, 2022, 13 (01)
  • [6] Text Line Extraction in Document Images
    Wang, Liuan
    Fan, Wei
    Sun, Jun
    Naoi, Satshi
    Tanaka, Hiroshi
    [J]. 2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 191 - 195
  • [7] Text segmentation in degraded historical document images
    Kavitha, A. S.
    Shivakumara, P.
    Kumar, G. H.
    Lu, Tong
    [J]. EGYPTIAN INFORMATICS JOURNAL, 2016, 17 (02) : 189 - 197
  • [8] Text region extraction from quality degraded document images
    Abirami, S.
    Manjula, D.
    [J]. PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2007, 4815 : 519 - 527
  • [9] Text Extraction from Document Images using Edge Information
    Grover, Sachin
    Arora, Kushal
    Mitra, Suman K.
    [J]. 2009 ANNUAL IEEE INDIA CONFERENCE (INDICON 2009), 2009, : 582 - +
  • [10] Text line extraction for historical document images
    Saabni, Raid
    Asi, Abedelkadir
    El-Sana, Jihad
    [J]. PATTERN RECOGNITION LETTERS, 2014, 35 : 23 - 33