Segmenting and Indexing Old Documents Using a Letter Extraction

被引:0
|
作者
Coustaty, Mickael [1 ]
Dubois, Sloven [1 ]
Ogier, Jean-Marc [1 ]
Menard, Michel [1 ]
机构
[1] L3i Lab, F-17042 La Rochelle, France
关键词
TOTAL VARIATION MINIMIZATION; IMAGE DECOMPOSITION; RESTORATION; ALGORITHMS;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper presents a new method to extract areas of interest in drop caps and particularly the most important shape: Letter itself. This method relies on a combination of a Aujol and Chambolle algorithm and a Segmentation using a Zipf Law and can be enhanced as a three-step process: 1)Decomposition in layers 2)Segmentation using a Zipf Law 3)Selection of the connected components.
引用
收藏
页码:142 / 149
页数:8
相关论文
共 50 条
  • [1] Indexing and retrieval of words in old documents
    Marinai, S
    Marino, E
    Soda, G
    SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 223 - 227
  • [2] Analyzing Old Documents Using a Complex Approach: Application to Lettrines Indexing
    Coustaty, Mickael
    Courboulay, Vincent
    Ogier, Jean-Marc
    ADVANCES IN KNOWLEDGE DISCOVERY AND MANAGEMENT, VOL 2, 2012, 398 : 155 - 171
  • [3] Indexing and segmenting colour images using neighbourhood sequences
    Hajdu, A
    Nagy, B
    Zörgö, Z
    2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 1, PROCEEDINGS, 2003, : 957 - 960
  • [4] Automatic Indexing of Financial Documents via Information Extraction
    Ramamurthy, Rajkumar
    Luebbering, Max
    Bell, Thiago
    Gebauer, Michael
    Ulusay, Bilge
    Uedelhoven, Daniel
    Khameneh, Tim Dilmaghani
    Loitz, Ruediger
    Pielka, Maren
    Bauckhage, Christian
    Sifa, Rafet
    2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
  • [5] CLUSTERING AND INDEXING OF MULTIPLE DOCUMENTS USING FEATURE EXTRACTION THROUGH APACHE HADOOP ON BIG DATA
    Lydia, E. Laxmi
    Moses, G. Jose
    Varadarajan, Vijayakumar
    Nonyelu, Fredi
    Maseleno, Andino
    Perumal, Eswaran
    Shankar, K.
    MALAYSIAN JOURNAL OF COMPUTER SCIENCE, 2020, : 108 - 123
  • [6] Semantic Indexing for XML Documents using RDBMS
    Ihsan, Imran
    Kiyani, Faisal Fayyaz
    Qadir, M. Abdul
    Rehman, Mohib ur
    2015 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES (ICICT), 2015,
  • [7] USING UDC FOR COORDINATE INDEXING AND RETRIEVAL OF DOCUMENTS
    DMITRIEVSKII, NN
    NAUCHNO-TEKHNICHESKAYA INFORMATSIYA SERIYA 1-ORGANIZATSIYA I METODIKA INFORMATSIONNOI RABOTY, 1968, (01): : 14 - +
  • [8] Variable indexing method in rule documents for ship design using extraction of portable document format elements
    Kong, Min-Chul
    Roh, Myung-Il
    Kim, Ki-Su
    Kim, Jongoh
    Kim, Ju-Sung
    Park, Hogyun
    JOURNAL OF COMPUTATIONAL DESIGN AND ENGINEERING, 2022, 9 (06) : 2556 - 2573
  • [9] Handwriting documents denoising and indexing using Hermite transform
    Bres, S
    Eglin, V
    Rivero, C
    PATTERN RECOGNITION AND DATA MINING, PT 1, PROCEEDINGS, 2005, 3686 : 664 - 673
  • [10] Categorization of Malay Documents using Latent Semantic Indexing
    Ab Samat, Nordianah
    Murad, Masrah Azrifah Azmi
    Atan, Rodziah
    Abdullah, Muhammad Taufik
    KMICE 2008 - KNOWLEDGE MANAGEMENT INTERNATIONAL CONFERENCE, 2008 - TRANSFERRING, MANAGING AND MAINTAINING KNOWLEDGE FOR NATION CAPACITY DEVELOPMENT, 2008, : 87 - 91