Assessing the impact of graphical quality on automatic text recognition in digital maps

被引:18
|
作者
Chiang, Yao-Yi [1 ]
Leyk, Stefan [2 ]
Nazari, Narges Honarvar [3 ,4 ]
Moghaddam, Sima [3 ,4 ]
Tan, Tian Xiang [3 ,4 ]
机构
[1] Univ Southern Calif, Spatial Sci Inst, Los Angeles, CA 90089 USA
[2] Univ Colorado, Dept Geog, Boulder, CO 80309 USA
[3] Univ Southern Calif, Dept Comp Sci, Los Angeles, CA 90089 USA
[4] Spatial Sci Inst, Los Angeles, CA USA
关键词
Digital map processing; Scanned maps; Geographic information system; Text recognition; Optical character recognition; Accuracy assessment; EXTRACTION; FEATURES;
D O I
10.1016/j.cageo.2016.04.013
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Converting geographic features (e.g., place names) in map images into a vector format is the first step for incorporating cartographic information into a geographic information system (GIS). With the advancement in computational power and algorithm design, map processing systems have been considerably improved over the last decade. However, the fundamental map processing techniques such as color image segmentation, (map) layer separation, and object recognition are sensitive to minor variations in graphical properties of the input image (e.g., scanning resolution). As a result, most map processing results would not meet user expectations if the user does not "properly" scan the map of interest, preprocess the map image (e.g., using compression or not), and train the processing system, accordingly. These issues could slow down the further advancement of map processing techniques as such unsuccessful attempts create a discouraged user community, and less sophisticated tools would be perceived as more viable solutions. Thus, it is important to understand what kinds of maps are suitable for automatic map processing and what types of results and process-related errors can be expected. In this paper, we shed light on these questions by using a typical map processing task, text recognition, to discuss a number of map instances that vary in suitability for automatic processing. We also present an extensive experiment on a diverse set of scanned historical maps to provide measures of baseline performance of a standard text recognition tool under varying map conditions (graphical quality) and text representations (that can vary even within the same map sheet). Our experimental results help the user understand what to expect when a fully or semi-automatic map processing system is used to process a scanned map with certain (varying) graphical properties and complexities in map content. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:21 / 35
页数:15
相关论文
共 50 条
  • [41] Intelligent digital notepad with handwritten text recognition
    Senda, S
    Yamada, K
    NEC RESEARCH & DEVELOPMENT, 2002, 43 (01): : 53 - 56
  • [42] Assessing the accuracy of automatic speech recognition for psychotherapy
    Adam S. Miner
    Albert Haque
    Jason A. Fries
    Scott L. Fleming
    Denise E. Wilfley
    G. Terence Wilson
    Arnold Milstein
    Dan Jurafsky
    Bruce A. Arnow
    W. Stewart Agras
    Li Fei-Fei
    Nigam H. Shah
    npj Digital Medicine, 3
  • [43] Assessing the accuracy of automatic speech recognition for psychotherapy
    Miner, Adam S.
    Haque, Albert
    Fries, Jason A.
    Fleming, Scott L.
    Wilfley, Denise E.
    Wilson, G. Terence
    Milstein, Arnold
    Jurafsky, Dan
    Arnow, Bruce A.
    Agras, W. Stewart
    Li Fei-Fei
    Shah, Nigam H.
    NPJ DIGITAL MEDICINE, 2020, 3 (01)
  • [44] Automatic Building Reconstruction with Satellite Images and Digital Maps
    Lee, Dong-Cheon
    Yom, Jae-Hong
    Shin, Sung Woong
    Oh, Jaehong
    Park, Kisurk
    ETRI JOURNAL, 2011, 33 (04) : 537 - 546
  • [45] Digital Maps and Automatic Narratives for the Interactive Global Histories
    Cheong, Siew Ann
    Nanetti, Andrea
    Fhilippov, Mikhail
    ASIAN REVIEW OF WORLD HISTORIES, 2016, 4 (01): : 83 - 123
  • [46] Automatic digital biometry analysis based on depth maps
    Reyes, Miguel
    Clapes, Albert
    Ramirez, Jose
    Revilla, Juan R.
    Escalera, Sergio
    COMPUTERS IN INDUSTRY, 2013, 64 (09) : 1316 - 1325
  • [47] Text Independent Automatic Speaker Recognition System in Malayalam
    Selvan, Karthik
    Babu, Anish K. K.
    PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING & COMMUNICATION SYSTEMS (ICACCS), 2013,
  • [48] An automatic taxi receipt text recognition application system
    Liu, Weiliang
    Yuan, Xueguang
    Zhang, Yangan
    Wu, Mengqi
    Du, Hang
    Cui, Yakun
    2020 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING (BMSB), 2020,
  • [49] Almost Unsupervised Text to Speech and Automatic Speech Recognition
    Ren, Yi
    Tan, Xu
    Qin, Tao
    Zhao, Sheng
    Zhao, Zhou
    Liu, Tie-Yan
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [50] Automatic Entity Recognition and Typing in Massive Text Data
    Ren, Xiang
    El-Kishky, Ahmed
    Ji, Heng
    Han, Jiawei
    SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 2235 - 2239