High-Precision Person Name Extraction from Turkish Texts Using Wikipedia

被引:1
|
作者
Kucuk, Dilek [1 ]
Kucuk, Dogan [2 ]
机构
[1] TUBITAK Energy Inst, Ankara, Turkey
[2] Gazi Univ, Ankara, Turkey
关键词
Person name extraction; Turkish; Wikipedia; Named entity; ENTITY RECOGNITION;
D O I
10.1007/978-3-319-19581-0_31
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we focus on person name extraction from diverse text types in Turkish and have compiled a large set of person names from Turkish Wikipedia. After automated post-processing to clean and extend it, we have performed extraction experiments using this resource on data sets of considerable sizes and achieved high precision rates. Next, we have shown that the use of non-local dependencies together with this Wikipedia resource improves recall, and hence F-Measure, considerably. Finally, we have tested the contribution of the resource and the scheme based on non-local dependencies to the person name extraction performance of a full-fledged named entity recognizer.
引用
收藏
页码:347 / 354
页数:8
相关论文
共 50 条
  • [1] Improving Information Extraction from Wikipedia Texts using Basic English
    Rodriguez-Ferreira, Teresa
    Rabadan, Adrian
    Hervas, Raquel
    Diaz, Alberto
    [J]. LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 395 - 400
  • [2] Person Name Extraction From Turkish Financial News Text Using Local Grammar-Based Approach
    Bayraktar, Oezkan
    Temizel, Tugba Taskaya
    [J]. 23RD INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2008, : 225 - 228
  • [3] High-Precision Extraction of Emerging Concepts from Scientific Literature
    King, Daniel
    Downey, Doug
    Weld, Daniel S.
    [J]. PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 1549 - 1552
  • [4] With a Little Help from my Neighbors: Person Name Linking Using the Wikipedia Social Network
    Geiss, Johanna
    Gertz, Michael
    [J]. PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16 COMPANION), 2016, : 985 - 990
  • [5] Modelling of high-precision edge extraction phenomena
    Tretjakov, E. V.
    Simkin, B. E.
    [J]. PERCEPTION, 1996, 25 : 80 - 80
  • [6] High-Precision Orientation and Skew Detection for Texts in Scanned Documents
    Boiangiu, Costin-Anton
    Raducanu, Bogdan
    Spataru, Andrei-Cristian
    [J]. 2009 IEEE 5TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING, PROCEEDINGS, 2009, : 145 - 148
  • [7] Collocation Extraction in Turkish Texts Using Statistical Methods
    Metin, Senem Kumova
    Karaoglan, Bahar
    [J]. ADVANCES IN NATURAL LANGUAGE PROCESSING, 2010, 6233 : 238 - +
  • [8] HIGH-PRECISION BIO-MOLECULAR EVENT EXTRACTION FROM TEXT USING PARALLEL BINARY CLASSIFIERS
    Van Landeghem, Sofie
    De Baets, Bernard
    Van de Peer, Yves
    Saeys, Yvan
    [J]. COMPUTATIONAL INTELLIGENCE, 2011, 27 (04) : 645 - 664
  • [9] INFORMATION EXTRACTION AS A BASIS FOR HIGH-PRECISION TEXT CLASSIFICATION
    RILOFF, E
    LEHNERT, W
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 1994, 12 (03) : 296 - 333
  • [10] Research on a high-precision extraction method of industrial cuboid
    Liu, Qi
    Zhu, Zijian
    Huo, Ju
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 132