Robust Toponym Resolution Based on Surface Statistics

被引:1
|
作者
Sano, Tomohisa [1 ]
Nobesawa, Shiho Hoshi [2 ]
Okamoto, Hiroyuki [1 ]
Susuki, Hiroya [1 ]
Matsubara, Masaki [1 ]
Saito, Hiroaki [1 ]
机构
[1] Keio Univ, Yokohama, Kanagawa 2238522, Japan
[2] Tokyo City Univ, Tokyo 1588557, Japan
关键词
natural language processing; toponym resolution; area identification; statistical information;
D O I
10.1587/transinf.E92.D.2313
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Toponyms and other named entities are main issues in unknown word processing problem. Our purpose is to salvage unknown toponyms, not only for avoiding noises but also providing them information of area candidates to where they may belong. Most of previous toponym resolution methods were targeting disambiguation among area candidates. which is caused by the multiple existence of a toponym. These approaches were mostly based on gazetteers and contexts. When it comes to the documents which may contain toponyms worldwide, like newspaper articles, toponym resolution is not just an ambiguity resolution, but an area candidate selection from all the areas on Earth. Thus we propose an automatic toponym resolution method which enables to identify its, area candidates based only on their Surface statistics, in place of dictionary-lookup approaches. Our method combines two modules, area candidate reduction and area candidate examination which uses block-unit data, to obtain high accuracy without reducing recall rate. Our empirical result showed 85.54% precision rate, 91.92% recall rate and .89 F-measure value on average. This method is it flexible and robust approach for toponym resolution targeting unrestricted number of areas.
引用
收藏
页码:2313 / 2320
页数:8
相关论文
共 50 条
  • [41] Impulse noise detection based on robust statistics and genetic programming
    Petrovic, N
    Crnojevic, V
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, PROCEEDINGS, 2005, 3708 : 643 - 649
  • [42] Toponym Based Community Search in Large Social Network
    Chen, Zhenghui
    Xu, Jian
    Bao, Qing
    Yang, Xue
    Zhang, Huaixiang
    Xu, Ming
    2021 2ND INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND INTELLIGENT CONTROL (ICCEIC 2021), 2021, : 57 - 63
  • [43] Robust Optimization-Based Alignment Method Based on Projection Statistics Algorithm
    Zhu, Bing
    Li, Jingshu
    Cui, Mei
    Cui, Guoheng
    IEEE SENSORS JOURNAL, 2021, 21 (15) : 16538 - 16546
  • [44] Robust Super-resolution Based on Selective Samples
    Peng, Cong
    Luo, Yu-Pin
    2024 5TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATION, ICCEA 2024, 2024, : 72 - 76
  • [45] ROBUST LEARNING-BASED SUPER-RESOLUTION
    Kim, Changhyun
    Choi, Kyuha
    Lee, Ho-young
    Hwang, Kyuyoung
    Ra, Jong Beom
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 2017 - 2020
  • [46] A Chinese Toponym Recognition Method Based on Conditional Random Field
    Wu L.
    Liu L.
    Li H.
    Gao Y.
    Gao, Yong (gaoyong@pku.edu.can), 2017, Editorial Board of Medical Journal of Wuhan University (42): : 150 - 156
  • [47] A Statistics-Based Watermarking Scheme Robust to Print-and-Scan
    Cheng, Daofang
    Li, Xiaolong
    Qi, Wenfa
    Yang, Bin
    PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON ELECTRONIC COMMERCE AND SECURITY, 2008, : 894 - 898
  • [48] Study on Fresh Breast Tissues by Raman Spectroscopy Based on Robust Statistics
    Zheng Chao
    Zhang Haipeng
    Han Bing
    Liang Lijia
    Zou Yabin
    Xu Shuping
    Lin Heping
    Xu Weiqing
    Fan Zhimin
    CHEMICAL JOURNAL OF CHINESE UNIVERSITIES-CHINESE, 2015, 36 (01): : 74 - 80
  • [50] SOME RESULTS ON ROBUST ESTIMATION BASED ON KOLMOGOROV-SMIRNOV STATISTICS
    RAO, PV
    LITTELL, RC
    ANNALS OF MATHEMATICAL STATISTICS, 1971, 42 (06): : 2178 - &