Assessing the impact of graphical quality on automatic text recognition in digital maps

被引:18
|
作者
Chiang, Yao-Yi [1 ]
Leyk, Stefan [2 ]
Nazari, Narges Honarvar [3 ,4 ]
Moghaddam, Sima [3 ,4 ]
Tan, Tian Xiang [3 ,4 ]
机构
[1] Univ Southern Calif, Spatial Sci Inst, Los Angeles, CA 90089 USA
[2] Univ Colorado, Dept Geog, Boulder, CO 80309 USA
[3] Univ Southern Calif, Dept Comp Sci, Los Angeles, CA 90089 USA
[4] Spatial Sci Inst, Los Angeles, CA USA
关键词
Digital map processing; Scanned maps; Geographic information system; Text recognition; Optical character recognition; Accuracy assessment; EXTRACTION; FEATURES;
D O I
10.1016/j.cageo.2016.04.013
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Converting geographic features (e.g., place names) in map images into a vector format is the first step for incorporating cartographic information into a geographic information system (GIS). With the advancement in computational power and algorithm design, map processing systems have been considerably improved over the last decade. However, the fundamental map processing techniques such as color image segmentation, (map) layer separation, and object recognition are sensitive to minor variations in graphical properties of the input image (e.g., scanning resolution). As a result, most map processing results would not meet user expectations if the user does not "properly" scan the map of interest, preprocess the map image (e.g., using compression or not), and train the processing system, accordingly. These issues could slow down the further advancement of map processing techniques as such unsuccessful attempts create a discouraged user community, and less sophisticated tools would be perceived as more viable solutions. Thus, it is important to understand what kinds of maps are suitable for automatic map processing and what types of results and process-related errors can be expected. In this paper, we shed light on these questions by using a typical map processing task, text recognition, to discuss a number of map instances that vary in suitability for automatic processing. We also present an extensive experiment on a diverse set of scanned historical maps to provide measures of baseline performance of a standard text recognition tool under varying map conditions (graphical quality) and text representations (that can vary even within the same map sheet). Our experimental results help the user understand what to expect when a fully or semi-automatic map processing system is used to process a scanned map with certain (varying) graphical properties and complexities in map content. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:21 / 35
页数:15
相关论文
共 50 条
  • [21] Text processing by digital voice recognition
    Meijer, GA
    Baak, JPA
    vanDiest, PJ
    vanHattum, AH
    vanderLinden, HC
    Koevoets, JJM
    ANALYTICAL AND QUANTITATIVE CYTOLOGY AND HISTOLOGY, 1996, 18 (04): : 261 - 266
  • [22] TEXT NORMALIZATION FOR AUTOMATIC SPEECH RECOGNITION SYSTEMS
    Vasile, Alin-Florentin
    Boros, Tiberiu
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE 'LINQUISTIC RESOURCES AND TOOLS FOR PROCESSING THE ROMANIAN LANGUAGE', 2016, : 121 - 128
  • [23] Automatic Text Recognition Using Difference Ratio
    Anwar, Shamama
    SMART COMPUTING AND INFORMATICS, 2018, 77 : 691 - 699
  • [24] Automatic Genre Recognition and Adaptive Text Summarization
    Yatsko, V. A.
    Starikov, M. S.
    Butakov, A. V.
    AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS, 2010, 44 (03) : 111 - 120
  • [25] Robust Scene Text Recognition with Automatic Rectification
    Shi, Baoguang
    Wang, Xinggang
    Lyu, Pengyuan
    Yao, Cong
    Bai, Xiang
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 4168 - 4176
  • [26] Automatic text detection and tracking in digital video
    Li, HP
    Doermann, D
    Kia, O
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2000, 9 (01) : 147 - 156
  • [27] SAR Automatic Target Recognition Using Discriminative Graphical Models
    Srinivas, Umamahesh
    Monga, Vishal
    Raj, Raghu G.
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2014, 50 (01) : 591 - 606
  • [28] Automatic diatom recognition on digital images
    Forero, MG
    Alvarado, JE
    Tamayo, AL
    Perez, GC
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XXV, 2002, 4790 : 133 - 142
  • [29] DIGITAL AUTOMATIC WORD RECOGNITION PROCEDURE
    PETRICK, SR
    WILLETT, HM
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1960, 32 (11): : 1517 - 1517
  • [30] Assessing text representations with recognition: The interaction of domain knowledge and text coherence
    Long, Debra L.
    Wilson, Jeannette
    Hurley, Ryan
    Prat, Chantel S.
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-LEARNING MEMORY AND COGNITION, 2006, 32 (04) : 816 - 827