Generating flexible proper name references in text: Data, models and evaluation

被引:0
|
作者
Ferreira, Thiago Castro [1 ]
Krahmer, Emiel [1 ]
Wubben, Sander [1 ]
机构
[1] Tilburg Univ, Tilburg Ctr Cognit & Commun TiCC, Tilburg, Netherlands
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study introduces a statistical model able to generate variations of a proper name by taking into account the person to be mentioned, the discourse context and variation. The model relies on the REGnames corpus, a dataset with 53,102 proper name references to 1,000 people in different discourse contexts. We evaluate the versions of our model from the perspective of how human writers produce proper names, and also how human readers process them. The corpus(1) and the model(2) are publicly available.
引用
收藏
页码:655 / 664
页数:10
相关论文
共 50 条
  • [31] A Study on Generating Webtoons Using Multilingual Text-to-Image Models
    Yu, Kyungho
    Kim, Hyoungju
    Kim, Jeongin
    Chun, Chanjun
    Kim, Pankoo
    APPLIED SCIENCES-BASEL, 2023, 13 (12):
  • [32] EvilPromptFuzzer: generating inappropriate content based on text-to-image models
    He, Juntao
    Dai, Haoran
    Sui, Runqi
    Yuan, Xuejing
    Liu, Dun
    Feng, Hao
    Liu, Xinyue
    Yang, Wenchuan
    Cui, Baojiang
    Li, Kedan
    CYBERSECURITY, 2024, 7 (01):
  • [33] Generating Benchmarks for Factuality Evaluation of Language Models
    Muhlgay, Dor
    Ram, Ori
    Magar, Inbal
    Levine, Yoav
    Ratner, Nir
    Belinkov, Yonatan
    Abend, Omri
    Leyton-Brown, Kevin
    Shashua, Amnon
    Shoham, Yoav
    PROCEEDINGS OF THE 18TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 49 - 66
  • [34] Succinct data structures for flexible text retrieval systems
    Sadakane, Kunihiko
    JOURNAL OF DISCRETE ALGORITHMS, 2007, 5 (01) : 12 - 22
  • [35] Generating automatically labeled data for author name disambiguation: an iterative clustering method
    Kim, Jinseok
    Kim, Jinmo
    Owen-Smith, Jason
    SCIENTOMETRICS, 2019, 118 (01) : 253 - 280
  • [36] Generating automatically labeled data for author name disambiguation: an iterative clustering method
    Jinseok Kim
    Jinmo Kim
    Jason Owen-Smith
    Scientometrics, 2019, 118 : 253 - 280
  • [37] Flexible Models for Complex Data with Applications
    Ley, Christophe
    Babic, Sladana
    Craens, Domien
    ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, VOL 8, 2021, 2021, 8 : 369 - 391
  • [38] Flexible and Interpretable Models for Survival Data
    Wu, Jiacheng
    Witten, Daniela
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2019, 28 (04) : 954 - 966
  • [39] Generating Fluent Translations from Disfluent Text Without Access to Fluent References: IIT Bombay@IWSLT2020
    Saini, Nikhil
    Khatri, Jyotsana
    Jyothi, Preethi
    Bhattacharyya, Pushpak
    17TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE TRANSLATION (IWSLT 2020), 2020, : 178 - 186
  • [40] Data aggregation for evaluation of the performance of flexible manufacturing systems using queuing network models
    de Almeida, D
    RAIRO-RECHERCHE OPERATIONNELLE-OPERATIONS RESEARCH, 1998, 32 (02): : 145 - 192