Reuse and plagiarism in Speech and Natural Language Processing publications

被引:5
|
作者
Mariani, Joseph [1 ]
Francopoulo, Gil [1 ,2 ]
Paroubek, Patrick [1 ]
机构
[1] Univ Paris Saclay, CNRS, LIMSI, Orsay, France
[2] Tagmatica, Paris, France
关键词
Plagiarism; Detection; Text reuse; Natural Language Processing; Speech Processing; Scientometrics; Informetrics;
D O I
10.1007/s00799-017-0211-0
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
The aim of this experiment is to present an easy way to compare fragments of texts in order to detect (supposed) results of copy and paste operations between articles in the domain of Natural Language Processing (NLP), including Speech Processing. The search space of the comparisons is a corpus labeled as NLP4NLP, which includes 34 different conferences and journals and gathers a large part of the NLP activity over the past 50 years. This study considers the similarity between the papers of each individual event and the complete set of papers in the whole corpus, according to four different types of relationship (self-reuse, self-plagiarism, reuse and plagiarism) and in both directions: a paper borrowing a fragment of text from another paper of the corpus (that we will call the source paper), or in the reverse direction, fragments of text from the source paper being borrowed and inserted in another paper of the corpus. The results show that self-reuse is rather a common practice, but that plagiarism seems to be very unusual, and that both stay within legal and ethical limits.
引用
收藏
页码:113 / 126
页数:14
相关论文
共 50 条
  • [21] Plagiarism in Scientific Publications
    Molina Gomez, Ana Maria
    Selin Ganen, Marina
    MEDISUR-REVISTA DE CIENCIAS MEDICAS DE CIENFUEGOS, 2016, 14 (01): : 7 - 9
  • [22] Speech and natural language
    1600, Morgan Kaufmann Publ Inc, San Mateo, CA, USA (02):
  • [23] A Critical Survey on the use of Fuzzy Sets in Speech and Natural Language Processing
    Carvalho, Joao P.
    Batista, Fernando
    Coheur, Luisa
    2012 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2012,
  • [24] Combining automatic speech recognition with semantic natural language processing in schizophrenia
    Ciampelli, S.
    Voppel, A. E.
    de Boer, J. N.
    Koops, S.
    Sommer, I. E. C.
    PSYCHIATRY RESEARCH, 2023, 325
  • [25] Note from the editor: “Rethinking natural language processing for speech technology”
    Amy Neustein
    International Journal of Speech Technology, 2008, 11 (3-4)
  • [26] An overview of natural language processing techniques in text-to-speech systems
    Külekci, MO
    Oflazer, K
    PROCEEDINGS OF THE IEEE 12TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, 2004, : 454 - 457
  • [27] Note from the editor: "Rethinking natural language processing for speech technology"
    Neustein, Amy
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2008, 11 (3-4) : 107 - 108
  • [28] Analysis of spontaneous speech in Parkinson's disease by natural language processing
    Yokoi, Katsunori
    Iribe, Yurie
    Kitaoka, Norihide
    Tsuboi, Takashi
    Hiraga, Keita
    Satake, Yuki
    Hattori, Makoto
    Tanaka, Yasuhiro
    Sato, Maki
    Hori, Akihiro
    Katsuno, Masahisa
    PARKINSONISM & RELATED DISORDERS, 2023, 113
  • [29] Natural language acquisition and gestalt language processing: A critical analysis of their application to autism and speech language therapy
    Hutchins, Tiffany L.
    Knox, Sophie E.
    Fletcher, E. Cheryl
    AUTISM & DEVELOPMENTAL LANGUAGE IMPAIRMENTS, 2024, 9
  • [30] From communicative context to speech: Integrating dialogue processing, speech production and natural language generation
    Teich, E
    Hagen, E
    Grote, B
    Bateman, J
    SPEECH COMMUNICATION, 1997, 21 (1-2) : 73 - 99