Assessing the impact of graphical quality on automatic text recognition in digital maps

被引:18
|
作者
Chiang, Yao-Yi [1 ]
Leyk, Stefan [2 ]
Nazari, Narges Honarvar [3 ,4 ]
Moghaddam, Sima [3 ,4 ]
Tan, Tian Xiang [3 ,4 ]
机构
[1] Univ Southern Calif, Spatial Sci Inst, Los Angeles, CA 90089 USA
[2] Univ Colorado, Dept Geog, Boulder, CO 80309 USA
[3] Univ Southern Calif, Dept Comp Sci, Los Angeles, CA 90089 USA
[4] Spatial Sci Inst, Los Angeles, CA USA
关键词
Digital map processing; Scanned maps; Geographic information system; Text recognition; Optical character recognition; Accuracy assessment; EXTRACTION; FEATURES;
D O I
10.1016/j.cageo.2016.04.013
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Converting geographic features (e.g., place names) in map images into a vector format is the first step for incorporating cartographic information into a geographic information system (GIS). With the advancement in computational power and algorithm design, map processing systems have been considerably improved over the last decade. However, the fundamental map processing techniques such as color image segmentation, (map) layer separation, and object recognition are sensitive to minor variations in graphical properties of the input image (e.g., scanning resolution). As a result, most map processing results would not meet user expectations if the user does not "properly" scan the map of interest, preprocess the map image (e.g., using compression or not), and train the processing system, accordingly. These issues could slow down the further advancement of map processing techniques as such unsuccessful attempts create a discouraged user community, and less sophisticated tools would be perceived as more viable solutions. Thus, it is important to understand what kinds of maps are suitable for automatic map processing and what types of results and process-related errors can be expected. In this paper, we shed light on these questions by using a typical map processing task, text recognition, to discuss a number of map instances that vary in suitability for automatic processing. We also present an extensive experiment on a diverse set of scanned historical maps to provide measures of baseline performance of a standard text recognition tool under varying map conditions (graphical quality) and text representations (that can vary even within the same map sheet). Our experimental results help the user understand what to expect when a fully or semi-automatic map processing system is used to process a scanned map with certain (varying) graphical properties and complexities in map content. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:21 / 35
页数:15
相关论文
共 50 条
  • [1] Automatic text recognition in digital videos
    Lienhart, R
    Stuber, F
    IMAGE AND VIDEO PROCESSING IV, 1996, 2666 : 180 - 188
  • [2] Digital Text Quality: A Model for Assessing the Quality of Digital Texts
    Stoner, Angelika
    DEUTSCHE SPRACHE, 2020, 48 (02): : 101 - 125
  • [3] Automatic Feature Extraction and Text Recognition From Scanned Topographic Maps
    Pezeshk, Aria
    Tutwiler, Richard L.
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2011, 49 (12): : 5047 - 5063
  • [4] Text independent automatic speaker recognition using self-organizing maps
    Mafra, AT
    Simoes, MG
    CONFERENCE RECORD OF THE 2004 IEEE INDUSTRY APPLICATIONS CONFERENCE, VOLS 1-4: COVERING THEORY TO PRACTICE, 2004, : 1503 - 1510
  • [5] Graphical models and automatic speech recognition
    Bilmes, JA
    MATHEMATICAL FOUNDATIONS OF SPEECH AND LANGUAGE PROCESSING, 2004, 138 : 191 - 245
  • [6] Automatic interpretation of digital maps
    Walter, Volker
    Luo, Fen
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2011, 66 (04) : 519 - 528
  • [7] DIGITAL TOPOGRAPHIC MAPS - PRODUCTION PROBLEMS AND THEIR IMPACT ON QUALITY AND COST
    RAMIREZ, JR
    PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 1991, 57 (07): : 973 - 976
  • [8] Automatic text segmentation and text recognition for video indexing
    Lienhart, R
    Effelsberg, W
    MULTIMEDIA SYSTEMS, 2000, 8 (01) : 69 - 81
  • [9] Automatic text segmentation and text recognition for video indexing
    Rainer Lienhart
    Wolfgang Effelsberg
    Multimedia Systems, 2000, 8 : 69 - 81
  • [10] Automatic edge match of digital maps
    Hua, Hui
    Tong, Xiaohua
    Tongji Daxue Xuebao/Journal of Tongji University, 2000, 28 (01): : 33 - 36