Studying the Role of Named Entities for Content Preservation in Text Style Transfer

被引:0
|
作者
Babakov, Nikolay [1 ]
Dale, David [1 ]
Logacheva, Varvara [1 ]
Krotova, Irina [2 ]
Panchenko, Alexander [1 ]
机构
[1] Skolkovo Inst Sci & Technol, Moscow, Russia
[2] Mobile TeleSyst MTS, Moscow, Russia
关键词
Text style transfer; Content preservation; Named entities;
D O I
10.1007/978-3-031-08473-7_40
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text style transfer techniques are gaining popularity in Natural Language Processing, finding various applications such as text detoxification, sentiment, or formality transfer. However, the majority of the existing approaches were tested on such domains as online communications on public platforms, music, or entertainment yet none of them were applied to the domains which are typical for task-oriented production systems, such as personal plans arrangements (e.g. booking of flights or reserving a table in a restaurant). We fill this gap by studying formality transfer in this domain. We noted that, the texts in this domain are full of named entities, which are very important for keeping the original sense of the text. Indeed, if for example, someone communicates destination city of a flight is must not be altered. Thus, we concentrate on the role of named entities in content preservation for formality text style transfer. We collect a new dataset for the evaluation of content similarity measures in text style transfer. It is taken from a corpus of task-oriented dialogues and contains many important entities related to realistic requests that make this dataset particularly useful for testing style transfer models before using them in production. Besides, we perform an error analysis of a pre-trained formality transfer model and introduce a simple technique to use information about named entities to enhance the performance of baseline content similarity measures used in text style transfer.
引用
收藏
页码:437 / 448
页数:12
相关论文
共 50 条
  • [1] Processing Named Entities in Text
    McNamee, Paul
    Mayfield, James C.
    Piatko, Christine D.
    [J]. JOHNS HOPKINS APL TECHNICAL DIGEST, 2011, 30 (01): : 31 - 40
  • [2] Measuring Content Preservation in Textual Style Transfer
    Fitzpatrick, Stuart
    Park, Laurence
    Obst, Oliver
    [J]. DATA MINING, AUSDM 2022, 2022, 1741 : 3 - 14
  • [3] Enhancing Content Preservation in Text Style Transfer Using Reverse Attention and Conditional Layer Normalization
    Lee, Dongkyu
    Tian, Zhiliang
    Xue, Lanqing
    Zhang, Nevin L.
    [J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 93 - 102
  • [4] Building Language Models for Text with Named Entities
    Parvez, Md Rizwan
    Chakraborty, Saikat
    Ray, Baishakhi
    Chang, Kai-Wei
    [J]. PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 2373 - 2383
  • [5] Locating Complex Named Entities in Web Text
    Downey, Doug
    Broadhead, Matthew
    Etzioni, Oren
    [J]. 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 2733 - 2739
  • [6] A large-scale computational study of content preservation measures for text style transfer and paraphrase generation
    Babakov, Nikolay
    Dale, David
    Logacheva, Varvara
    Panchenko, Alexander
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): STUDENT RESEARCH WORKSHOP, 2022, : 300 - 321
  • [7] Style Transfer with Content Preservation from Multiple Images
    Liu, Dilin
    Yu, Wei
    Yao, Hongxun
    [J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT I, 2018, 10735 : 783 - 791
  • [8] A lightweight approach to coreference resolution for named entities in text
    Dimitrov, M
    Bontcheva, K
    Cunningham, H
    Maynard, D
    [J]. ANAPHORA PROCESSING: LINGUISTIC, COGNITIVE AND COMPUTATIONAL MODELLING, 2004, 263 : 97 - 111
  • [9] Named Entities as Privileged Information for Hierarchical Text Clustering
    Sinoara, Roberta A.
    Sundermann, Camila V.
    Marcacini, Ricardo M.
    Domingues, Marcos A.
    Rezende, Solange O.
    [J]. PROCEEDINGS OF THE 18TH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM (IDEAS14), 2014, : 57 - 66
  • [10] Temporal Role Annotation for Named Entities
    Koutraki, Maria
    Bakhshandegan-Moghaddam, Farshad
    Sack, Harald
    [J]. PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON SEMANTIC SYSTEMS, 2018, 137 : 223 - 234