Mention detection in Turkish coreference resolution

被引:0
|
作者
Demir, Seniz [1 ]
Akdag, Hanifi Ibrahim [1 ]
机构
[1] MEF Univ, Dept Comp Engn, Istanbul, Turkiye
关键词
Coreference resolution; mention detection; neural network; language model; Turkish;
D O I
10.55730/1300-0632.4095
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A crucial step in understanding natural language is detecting mentions that refer to real-world entities in a text and correctly identifying their boundaries. Mention detection is commonly considered a preprocessing step in coreference resolution which is shown to be helpful in several language processing applications such as machine translation and text summarization. Despite recent efforts on Turkish coreference resolution, no standalone neural solution to mention detection has been proposed yet. In this article, we present two models designed for detecting Turkish mentions by using feed-forward neural networks. Both models extract all spans up to a fixed length from input text as candidates and classify them as mentions or not mentions. The models differ in terms of how candidate text spans are represented. The first model represents a span by focusing on its first and last words, whereas the representation also covers the preceding and proceeding words of a span in the second model. Mention span representations are formed by using contextual embeddings, part-of-speech embeddings, and named-entity embeddings of words in interest where contextual embeddings are obtained from pretrained Turkish language models. In our evaluation studies, we not only assess the impact of mention representation strategies on system performance but also demonstrate the usability of different pretrained language models in resolution task. We argue that our work provides useful insights to the existing literature and the first step in understanding the effectiveness of neural architectures in Turkish mention detection.
引用
收藏
页码:682 / 697
页数:17
相关论文
共 50 条
  • [21] Do UD Trees Match Mention Spans in Coreference Annotations?
    Popel, Martin
    Zabokrtsky, Zdenek
    Nedoluzhko, Anna
    Novak, Michal
    Zeman, Daniel
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 3570 - 3576
  • [22] Joint Anaphoricity Detection and Coreference Resolution with Constrained Latent Structures
    Lassalle, Emmanuel
    Denis, Pascal
    [J]. PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 2274 - 2280
  • [23] Exploration of coreference resolution: The ACE entity detection and recognition task
    Chen, Ying
    Hacioglu, Kadri
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2006, 4188 : 301 - 308
  • [24] Coreference Resolution for Latvian
    Znotins, Arturs
    Paikens, Peteris
    [J]. LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 3209 - 3213
  • [25] Signed Coreference Resolution
    Yin, Kayo
    DeHaan, Kenneth
    Alikhani, Malihe
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 4950 - 4961
  • [26] Multilingual coreference resolution
    Harabagiu, SM
    Maiorano, SJ
    [J]. 6TH APPLIED NATURAL LANGUAGE PROCESSING CONFERENCE/1ST MEETING OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE AND PROCEEDINGS OF THE ANLP-NAACL 2000 STUDENT RESEARCH WORKSHOP, 2000, : 142 - 149
  • [27] Multilingual coreference resolution
    Kuebler, Sandra
    Zhekova, Desislava
    [J]. LANGUAGE AND LINGUISTICS COMPASS, 2016, 10 (11): : 614 - 631
  • [28] Arukikata Travelogue Dataset with Geographic Entity Mention, Coreference, and Link Annotation
    Higashiyama, Shohei
    Ouchi, Hiroki
    Teranishi, Hiroki
    Otomo, Hiroyuki
    Ide, Yusuke
    Yamamoto, Aitaro
    Shindo, Hiroyuki
    Matsuda, Yuki
    Wakamiya, Shoko
    Inoue, Naoya
    Yamada, Ikuya
    Watanabe, Taro
    [J]. arXiv, 2023,
  • [29] NOMINAL COREFERENCE RESOLUTION FOR POLISH
    Ogrodniczuk, Maciej
    [J]. POZNAN STUDIES IN CONTEMPORARY LINGUISTICS, 2019, 55 (02): : 367 - 396
  • [30] Approaches to biomedical coreference resolution
    Mondal, Ishani
    [J]. PROCEEDINGS OF THE 7TH ACM IKDD CODS AND 25TH COMAD (CODS-COMAD 2020), 2020, : 343 - 344